San Francisco vs Austin

YOURCITYvsAI

Compare · Trends

San Francisco

versus

Austin

6.9×

more AI engineering jobs in San Francisco. But that number hides a story about what kind of AI work each city actually does.

Live data 1,719 listings ghost-filtered

Left city

Right city

Role

▶ The Shape of AI Work

Same industry. Different structures.

Two views of the same data. By role shows what each engineer actually does day-to-day. By company shows whether they work at an AI-native company or at a vertical SaaS company embedding AI.

San Francisco

By role (what the engineer does)

AI Application 49.0%

AI Development 26.8%

AI Infrastructure 24.1%

By company (where the engineer works)

AI Development 49.5%

Unclassified 37.1%

AI Application 13.1%

Austin

By role (what the engineer does)

AI Application 59.6%

AI Infrastructure 21.6%

AI Development 18.8%

By company (where the engineer works)

Unclassified 37.6%

AI Application 31.2%

AI Development 29.4%

Legacy Infrastructure 3.2%

2.6×

When you filter to startups only (Seed through Series B), the gap shrinks from 6.9× to 2.6×. Austin's startup layer is closer to San Francisco than the headline suggests. (San Francisco: 133 · Austin: 52)

Role Intelligence

Inference Engineer 145 listings analyzed

▶

An inference engineer designs, builds, and optimizes the systems that serve machine learning models to users and applications in production. This role sits at the critical intersection of machine learning research and large-scale systems engineering, responsible for transforming trained models into reliable, low-latency, and cost-efficient services. The core problem they solve is delivering the computational power of complex AI—particularly large language models and generative AI—to a global audience while rigorously managing constraints of latency, throughput, hardware utilization, and cost. Within an engineering organization, these professionals are often part of dedicated inference, model serving, or AI infrastructure teams, working closely with research scientists, cloud infrastructure engineers, and product developers.

Day-to-day work involves owning the full inference stack, from low-level GPU kernels to global API gateways. Engineers routinely implement support for new model architectures, optimize inference runtimes using techniques like continuous batching and KV-cache management, and profile systems to eliminate performance bottlenecks. They build and maintain the distributed systems for model deployment, intelligent request routing, and cluster orchestration across thousands of accelerators. A significant portion of their work focuses on operational excellence: improving observability, automating deployments, responding to incidents, and ensuring system reliability meets strict service-level objectives for millions of requests.

Technically, the role demands proficiency across multiple layers. Programming commonly involves Python for model integration, Rust or Go for high-performance serving runtimes, and CUDA, Triton, or CUTLASS for writing and optimizing GPU kernels. Engineers must understand transformer architectures, attention mechanisms, and memory management patterns like paged attention. They work with orchestration tools like Kubernetes, inference engines such as vLLM or TensorRT, and cloud platforms including AWS, GCP, and Azure. Deep knowledge of hardware accelerators—GPUs, TPUs, and custom AI chips like Cerebras's WSE—is essential, as is experience with performance profiling tools and compiler internals for frameworks like PyTorch.

Collaboration is fundamentally cross-functional. Inference engineers partner directly with ML researchers to co-design model architectures for efficient serving and to integrate new features like sparsity or mixture-of-experts. They work alongside infrastructure teams to scale GPU clusters, with product teams to design developer APIs, and with reliability engineers to establish observability and incident response protocols. Key soft skills include systems-level debugging under pressure, clear communication to translate technical trade-offs for diverse stakeholders, and a bias for action to drive projects from prototype to production impact.

For those seeking to enter or advance in this field, prioritizing a deep understanding of distributed systems fundamentals and modern LLM architecture is crucial. Aspiring engineers should gain hands-on experience with GPU programming, container orchestration, and building low-latency services. Differentiating as a strong candidate involves demonstrating a track record of performance optimization—such as stories of boosting GPU utilization or slashing inference latency—coupled with the ability to own complex problems end-to-end, from reading a research paper to debugging a kernel to resolving a production outage. Mastery lies not in any single technology, but in the holistic skill of making cutting-edge AI models run efficiently and reliably at a global scale.

What you'll build

Apis & Integrations
22.8%
Ml Infrastructure
10.3%

Emerging skills (10-50%)

Distributed Systems
41.4%
Gpu/cuda
34.5%
Python
33.8%
Kubernetes
24.1%
Machine Learning
21.4%
Performance Optimization
19.3%
Llms
16.6%
Rust
16.6%
Load Balancing
14.5%
Communication
13.1%

Tools & Frameworks

kubernetes (37.2%) pytorch (22.8%) gpu/cuda (22.1%) rust (19.3%) python (19.3%) vllm (15.9%) aws (13.1%) google cloud (12.4%) sglang (10.3%)

Typical requirements

5+ yrs experience · Range: 4-6+

Ai Engineer 594 listings analyzed

▶

An AI Engineer is a specialized software engineer who designs, builds, and deploys production-grade systems powered by large language models and agentic reasoning. This role sits at the intersection of software engineering, machine learning, and product development, focused on translating cutting-edge AI research into reliable applications that solve real-world business problems. Unlike pure research roles, the AI Engineer is responsible for the full lifecycle of AI-powered features, from prototyping with frontier models to ensuring scalability, observability, and robustness in customer-facing environments. They often serve as a crucial bridge, embedding within product teams to enable AI-native experiences or working directly with enterprise clients to integrate AI into complex existing workflows.

The day-to-day work involves architecting and implementing end-to-end agent systems, which includes designing orchestration logic, integrating tool-use capabilities, and building guardrails for safe execution. Engineers in this role own features from conception to deployment, which entails prompt and context engineering, creating evaluation pipelines with golden datasets, and iteratively improving system performance based on metrics and user feedback. A significant portion of their work is infrastructural, building platforms and SDKs—such as those for AI observability, evaluation, and agent harnesses—that enable other developers to build AI applications more effectively. They ship production code across the stack, often working with technologies like Python, TypeScript, React, and various cloud services to deliver full-stack solutions.

Technically, proficiency in Python is nearly universal, alongside frameworks for building and orchestrating agents such as LangChain, LangGraph, and crewAI. Strong software engineering fundamentals are required, including experience with backend development, APIs, and often frontend technologies like React and TypeScript for building user interfaces. Knowledge of cloud platforms (AWS, GCP, Azure), containerization with Docker and Kubernetes, and database systems is essential for deployment. Crucially, AI Engineers must have hands-on experience with the practical application of LLMs, including techniques for retrieval-augmented generation (RAG), fine-tuning, function calling, and designing evaluation frameworks to measure agent performance and reliability.

Collaboration is a cornerstone of the role, requiring close partnership with product managers, designers, research scientists, and, in customer-facing positions, directly with client engineering teams. AI Engineers frequently act as trusted technical advisors, scoping projects, leading workshops, and guiding adoption. Strong communication skills are vital for translating complex technical concepts to diverse stakeholders and for codifying best practices into internal tools and documentation. The role demands a product-minded, iterative mindset, comfort with ambiguity, and the ability to make trade-offs between scope, speed, and quality in fast-moving environments.

For those seeking to enter or advance in this field, prioritizing hands-on experience in building and shipping agentic systems is paramount. This means going beyond simple API calls to create multi-step, tool-using workflows with proper evaluation and observability. A strong candidate differentiates themselves by demonstrating ownership of the full development lifecycle, a deep understanding of the strengths and failure modes of LLMs, and the ability to architect scalable, platform-level solutions. Building a portfolio that showcases production-grade systems, contributions to relevant open-source projects, and a clear articulation of design trade-offs will signal the necessary blend of software craftsmanship and applied AI expertise.

What you'll build

Ai Agents
28.8%
Apis & Integrations
19.4%
Prototypes & Pocs
16.0%
Documentation & Examples
10.8%
Eval Pipelines
10.1%

Emerging skills (10-50%)

Llms
35.0%
Python
31.0%
Communication
26.4%
Software Engineering
18.9%
Ai
18.9%
Machine Learning
17.0%
Typescript
16.8%
Agent Systems
16.0%
Collaboration
14.5%
Distributed Systems
13.6%

Tools & Frameworks

aws (13.3%) python (13.1%) google cloud (13.0%) typescript (11.1%) langchain (10.6%) azure (10.3%)

Typical requirements

5-6 yrs experience · Range: 4-7+

The Evidence

Click any job. They're all real.

Every listing links to the live posting. Ghost jobs filtered out. Updated daily at 6:30 AM.

▶San Francisco — Stage mix

🚀 Series C+676 (45.0%)

Unknown556 (37.0%)

📈 Public131 (8.7%)

🎯 Series B85 (5.7%)

🅰️ Series A46 (3.1%)

🏛️ Legacy Enterprise5 (0.3%)

🌱 Seed2 (0.1%)

▶Austin — Stage mix

Unknown94 (43.1%)

📈 Public49 (22.5%)

🅰️ Series A39 (17.9%)

🚀 Series C+21 (9.6%)

🎯 Series B13 (6.0%)

🏛️ Legacy Enterprise2 (0.9%)

Built by

Martin Gonzalez

I built this to answer a personal question — is SF really that much better than Austin for AI engineering jobs? — and turned it into a live dataset for other AI engineers asking the same thing. If this is useful, if you spot a bug, or if you want to hire me, reach out.

martin.jose.gonzalez@gmail.com