YOURCITYvsAI
Compare · Trends
San Francisco
versus
Austin
6.9×
more AI engineering jobs in San Francisco. But that number hides a story about what kind of AI work each city actually does.
Live data 1,719 listings ghost-filtered
Left city
Right city
Role
Inference Engineer 145 listings analyzed

An inference engineer designs, builds, and optimizes the systems that serve machine learning models to users and applications in production. This role sits at the critical intersection of machine learning research and large-scale systems engineering, responsible for transforming trained models into reliable, low-latency, and cost-efficient services. The core problem they solve is delivering the computational power of complex AI—particularly large language models and generative AI—to a global audience while rigorously managing constraints of latency, throughput, hardware utilization, and cost. Within an engineering organization, these professionals are often part of dedicated inference, model serving, or AI infrastructure teams, working closely with research scientists, cloud infrastructure engineers, and product developers.

Day-to-day work involves owning the full inference stack, from low-level GPU kernels to global API gateways. Engineers routinely implement support for new model architectures, optimize inference runtimes using techniques like continuous batching and KV-cache management, and profile systems to eliminate performance bottlenecks. They build and maintain the distributed systems for model deployment, intelligent request routing, and cluster orchestration across thousands of accelerators. A significant portion of their work focuses on operational excellence: improving observability, automating deployments, responding to incidents, and ensuring system reliability meets strict service-level objectives for millions of requests.

Technically, the role demands proficiency across multiple layers. Programming commonly involves Python for model integration, Rust or Go for high-performance serving runtimes, and CUDA, Triton, or CUTLASS for writing and optimizing GPU kernels. Engineers must understand transformer architectures, attention mechanisms, and memory management patterns like paged attention. They work with orchestration tools like Kubernetes, inference engines such as vLLM or TensorRT, and cloud platforms including AWS, GCP, and Azure. Deep knowledge of hardware accelerators—GPUs, TPUs, and custom AI chips like Cerebras's WSE—is essential, as is experience with performance profiling tools and compiler internals for frameworks like PyTorch.

Collaboration is fundamentally cross-functional. Inference engineers partner directly with ML researchers to co-design model architectures for efficient serving and to integrate new features like sparsity or mixture-of-experts. They work alongside infrastructure teams to scale GPU clusters, with product teams to design developer APIs, and with reliability engineers to establish observability and incident response protocols. Key soft skills include systems-level debugging under pressure, clear communication to translate technical trade-offs for diverse stakeholders, and a bias for action to drive projects from prototype to production impact.

For those seeking to enter or advance in this field, prioritizing a deep understanding of distributed systems fundamentals and modern LLM architecture is crucial. Aspiring engineers should gain hands-on experience with GPU programming, container orchestration, and building low-latency services. Differentiating as a strong candidate involves demonstrating a track record of performance optimization—such as stories of boosting GPU utilization or slashing inference latency—coupled with the ability to own complex problems end-to-end, from reading a research paper to debugging a kernel to resolving a production outage. Mastery lies not in any single technology, but in the holistic skill of making cutting-edge AI models run efficiently and reliably at a global scale.

What you'll build
  • Apis & Integrations
    22.8%
  • Ml Infrastructure
    10.3%
Emerging skills (10-50%)
  • Distributed Systems
    41.4%
  • Gpu/cuda
    34.5%
  • Python
    33.8%
  • Kubernetes
    24.1%
  • Machine Learning
    21.4%
  • Performance Optimization
    19.3%
  • Llms
    16.6%
  • Rust
    16.6%
  • Load Balancing
    14.5%
  • Communication
    13.1%
Tools & Frameworks
kubernetes (37.2%) pytorch (22.8%) gpu/cuda (22.1%) rust (19.3%) python (19.3%) vllm (15.9%) aws (13.1%) google cloud (12.4%) sglang (10.3%)
Typical requirements
5+ yrs experience · Range: 4-6+
Ai Engineer 594 listings analyzed

An AI Engineer is a specialized software engineer who designs, builds, and deploys production-grade systems powered by large language models and agentic reasoning. This role sits at the intersection of software engineering, machine learning, and product development, focused on translating cutting-edge AI research into reliable applications that solve real-world business problems. Unlike pure research roles, the AI Engineer is responsible for the full lifecycle of AI-powered features, from prototyping with frontier models to ensuring scalability, observability, and robustness in customer-facing environments. They often serve as a crucial bridge, embedding within product teams to enable AI-native experiences or working directly with enterprise clients to integrate AI into complex existing workflows.

The day-to-day work involves architecting and implementing end-to-end agent systems, which includes designing orchestration logic, integrating tool-use capabilities, and building guardrails for safe execution. Engineers in this role own features from conception to deployment, which entails prompt and context engineering, creating evaluation pipelines with golden datasets, and iteratively improving system performance based on metrics and user feedback. A significant portion of their work is infrastructural, building platforms and SDKs—such as those for AI observability, evaluation, and agent harnesses—that enable other developers to build AI applications more effectively. They ship production code across the stack, often working with technologies like Python, TypeScript, React, and various cloud services to deliver full-stack solutions.

Technically, proficiency in Python is nearly universal, alongside frameworks for building and orchestrating agents such as LangChain, LangGraph, and crewAI. Strong software engineering fundamentals are required, including experience with backend development, APIs, and often frontend technologies like React and TypeScript for building user interfaces. Knowledge of cloud platforms (AWS, GCP, Azure), containerization with Docker and Kubernetes, and database systems is essential for deployment. Crucially, AI Engineers must have hands-on experience with the practical application of LLMs, including techniques for retrieval-augmented generation (RAG), fine-tuning, function calling, and designing evaluation frameworks to measure agent performance and reliability.

Collaboration is a cornerstone of the role, requiring close partnership with product managers, designers, research scientists, and, in customer-facing positions, directly with client engineering teams. AI Engineers frequently act as trusted technical advisors, scoping projects, leading workshops, and guiding adoption. Strong communication skills are vital for translating complex technical concepts to diverse stakeholders and for codifying best practices into internal tools and documentation. The role demands a product-minded, iterative mindset, comfort with ambiguity, and the ability to make trade-offs between scope, speed, and quality in fast-moving environments.

For those seeking to enter or advance in this field, prioritizing hands-on experience in building and shipping agentic systems is paramount. This means going beyond simple API calls to create multi-step, tool-using workflows with proper evaluation and observability. A strong candidate differentiates themselves by demonstrating ownership of the full development lifecycle, a deep understanding of the strengths and failure modes of LLMs, and the ability to architect scalable, platform-level solutions. Building a portfolio that showcases production-grade systems, contributions to relevant open-source projects, and a clear articulation of design trade-offs will signal the necessary blend of software craftsmanship and applied AI expertise.

What you'll build
  • Ai Agents
    28.8%
  • Apis & Integrations
    19.4%
  • Prototypes & Pocs
    16.0%
  • Documentation & Examples
    10.8%
  • Eval Pipelines
    10.1%
Emerging skills (10-50%)
  • Llms
    35.0%
  • Python
    31.0%
  • Communication
    26.4%
  • Software Engineering
    18.9%
  • Ai
    18.9%
  • Machine Learning
    17.0%
  • Typescript
    16.8%
  • Agent Systems
    16.0%
  • Collaboration
    14.5%
  • Distributed Systems
    13.6%
Tools & Frameworks
aws (13.3%) python (13.1%) google cloud (13.0%) typescript (11.1%) langchain (10.6%) azure (10.3%)
Typical requirements
5-6 yrs experience · Range: 4-7+
Click any job. They're all real.
Every listing links to the live posting. Ghost jobs filtered out. Updated daily at 6:30 AM.
San Francisco
Any AI Engineering Role jobs
1,501
Lila Sciences · San Francisco
Unknown AI Dev role Other NEW
ai_infra_engineer ai_infra_engineer
Vast.ai · San Francisco
Unknown AI Dev role Other NEW
inference_engineer inference_engineer
Flux Computing · San Francisco
Unknown Not AI Other NEW
inference_engineer inference_engineer
Flux Computing · San Francisco
Unknown Not AI Other NEW
inference_engineer inference_engineer
Loop AI - Delivery Intelligence Platform · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Scale AI · San Francisco
🚀 Series C+ AI App role Other NEW
ai_engineer ai_engineer
Capital One · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Capital One · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Capital One · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Anomali · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
GoFundMe · San Francisco
Unknown AI App role Other NEW
ml_engineer ml_engineer
TwelveLabs · San Francisco, CA / Hybrid
Unknown AI Dev role Other NEW
ai_engineer ai_engineer
Parafin · San Francisco, CA / Hybrid
Unknown AI Dev role Other NEW
ai_infra_engineer ai_infra_engineer
fal · San Francisco
🚀 Series C+ AI Dev role Other NEW
ai_infra_engineer ai_infra_engineer
Gilead Sciences · San Francisco
Unknown AI Dev role Other NEW
ai_infra_engineer ai_infra_engineer
Variance · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Harvey · San Francisco
🚀 Series C+ AI Dev role Other NEW
ai_engineer ai_engineer
Quintess AI · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Uplane · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Waymo · San Francisco
📈 Public AI App role Other NEW
ai_engineer ai_engineer
AegisAI Security · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Suno (suno.com) · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
vibecode.dev · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Front · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
WorkOS · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Harper (harperinsure.com) · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Autodesk · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Sila Nanotechnologies · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Campfire · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
LlamaIndex · San Francisco, CA / Hybrid
🅰️ Series A AI App role Other NEW
ai_engineer ai_engineer
Spellbrush · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Rhythms · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Charta Health · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Gem · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Lila Sciences · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Tanium · San Francisco
🚀 Series C+ AI App role Other NEW
ai_engineer ai_engineer
Frontier Medicines · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Alpharun · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Fluidstack · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
SS&C Technologies · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Ambi Robotics · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Notable (notablehealth.com) · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Hilbert's AI · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
E.L.F. Beauty · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
AlphaSense · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Mercor · San Francisco
🎯 Series B AI App role Other NEW
ai_engineer ai_engineer
Serval · San Francisco
🎯 Series B AI App role Other NEW
ai_engineer ai_engineer
Sauron · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Eliza · San Francisco
Unknown AI App role Other NEW
ai_engineer ai_engineer
Mulligan Funding · San Francisco, CA / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
← Prev 1 / 31 Next →
Austin
Any AI Engineering Role jobs
218
HackerOne · Austin, TX / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
HackerOne · Austin, TX / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Ericsson · Austin
Unknown AI App role Other NEW
ai_engineer ai_engineer
PwC · Austin, TX / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Neurophos · Austin
Unknown AI Dev role Other NEW
inference_engineer inference_engineer
Expedia Group · Austin, TX / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
GlobalFoundries · Austin
Unknown AI App role Other NEW
ai_engineer ai_engineer
GlobalFoundries · Austin
Unknown AI App role Other NEW
ai_engineer ai_engineer
Advantest · Austin
Unknown AI App role Other NEW
ai_engineer ai_engineer
Intel · Austin
Unknown AI App role Other NEW
ai_engineer ai_engineer
Alzheimer's Association · Austin, TX / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Alzheimer's Association · Austin, TX / Hybrid
Unknown AI App role Other NEW
ai_engineer ai_engineer
Crowe Global · Austin
Unknown AI App role Other NEW
ai_engineer ai_engineer
Arm · Austin
📈 Public AI App role Other NEW
ai_engineer ai_engineer
webAI · Austin
🅰️ Series A AI App role Other
ai_engineer ai_engineer
webAI · Austin, TX / Hybrid
🅰️ Series A AI App role Other
ai_engineer ai_engineer
NVIDIA · Austin
📈 Public Not AI Other
inference_engineer inference_engineer
Stantec · Austin, TX / Hybrid
Unknown AI Dev role Other
ai_engineer ai_engineer
Writer · Austin, TX / Hybrid
Unknown AI Dev role Other
ai_infra_engineer ai_infra_engineer
eBay · Austin
📈 Public AI Dev role Other
research_engineer research_engineer
Trend Micro · Austin
Unknown AI Dev role Other
ai_infra_engineer ai_infra_engineer
GoodLeap · Austin, TX / Hybrid
🚀 Series C+ AI App role Other
ai_engineer ai_engineer
Anthropic · Austin
🚀 Series C+ AI App role Other
ai_engineer ai_engineer
HackerOne · Austin, TX / Hybrid
Unknown AI App role Other
ai_engineer ai_engineer
Tricentis · Austin
Unknown AI App role Other
ai_engineer ai_engineer
Guidehouse · Austin
Unknown AI App role Other
ai_engineer ai_engineer
Advantest · Austin
Unknown AI Dev role Other
ai_engineer ai_engineer
Search Discovery · Austin
Unknown AI App role Other
ai_engineer ai_engineer
Arganteal Corporation · Austin
Unknown AI App role Other
ai_engineer ai_engineer
SambaNova Systems · Austin
🚀 Series C+ AI App role Other
ai_engineer ai_engineer
Eliza · Austin
Unknown AI App role Other
ai_engineer ai_engineer
Ichor Systems, Inc. · Austin
Unknown AI App role Other
ai_engineer ai_engineer
NODA AI · Austin, TX / Hybrid
Unknown AI App role Other
ai_engineer ai_engineer
SpotOn · Austin, TX / Hybrid
Unknown AI App role Other
ai_engineer ai_engineer
RSM US LLP · Austin
Unknown AI App role Other
ai_engineer ai_engineer
Emerson · Austin, TX / Hybrid
🏛️ Legacy Enterprise AI App role Other
ai_engineer ai_engineer
Prokeep · Austin, TX / Hybrid
🅰️ Series A AI App role Other
ai_engineer ai_engineer
Greenberg Traurig · Austin
Unknown AI App role Other
ai_engineer ai_engineer
Expedia Group · Austin, TX / Hybrid
Unknown AI App role Other
ai_engineer ai_engineer
Dialpad · Austin
Unknown AI App role Other
ai_engineer ai_engineer
eBay · Austin
📈 Public AI App role Other
ai_engineer ai_engineer
Wolters Kluwer · Austin
Unknown AI App role Other
ai_engineer ai_engineer
PayPal · Austin
📈 Public AI App role Other
ai_engineer ai_engineer
BusPatrol · Austin, TX / Hybrid
Unknown AI App role Other
ai_engineer ai_engineer
Crowe Global · Austin
Unknown AI App role Other
ai_engineer ai_engineer
SecurityScorecard · Austin
🚀 Series C+ AI App role Other
ai_engineer ai_engineer
SecurityScorecard · Austin
🚀 Series C+ AI App role Other
ai_engineer ai_engineer
Airtable · Austin
Unknown AI App role Other
ai_engineer ai_engineer
SEON · Austin, TX / Hybrid
Unknown AI App role Other
ai_engineer ai_engineer
Intel · Austin
Unknown AI Dev role Other
ai_infra_engineer ai_infra_engineer
← Prev 1 / 5 Next →
Built by
Martin Gonzalez
I built this to answer a personal question — is SF really that much better than Austin for AI engineering jobs? — and turned it into a live dataset for other AI engineers asking the same thing. If this is useful, if you spot a bug, or if you want to hire me, reach out.
martin.jose.gonzalez@gmail.com