SmithSpektrum Blog

Last week, I had two conversations that perfectly captured the paradox of the 2026 AI talent market. On Monday, a Series B startup told me they'd been searching for an LLM engineer for nine months without making a hire. On Tuesday, a data scientist with five years of experience told me he couldn't get interviews anywhere after three months of applications.

Both were telling the truth. The AI talent market isn't a single market—it's multiple parallel markets that barely overlap. Some roles have ten qualified candidates for every opening. Others have a fraction of one qualified candidate per role. The difference is understanding which is which[^1].

After analyzing hiring data from over 300 companies and compensation packages from more than 4,000 AI/ML engineers over the past year, I can give you a detailed map of this fractured landscape. The companies winning the AI talent war understand these dynamics. The ones struggling don't.

The Great Divergence

The AI talent market has fundamentally bifurcated since the ChatGPT explosion of late 2022.

Before generative AI went mainstream, "machine learning engineer" was a reasonably coherent job category. You trained models, deployed them, and improved them. The skills were broadly transferable. A computer vision engineer could become a recommendation systems engineer with some ramping.

That world is gone. Today, the AI talent market has split into distinct segments with almost no overlap in candidate pools or compensation levels.

LLM and foundation model engineers operate in an extreme scarcity environment. There are roughly 52,000 open ML engineer roles in the US as of January 2026, up from 38,000 at the same point last year[^2]. But for roles requiring deep experience with large language models—actual training and fine-tuning experience, not just prompt engineering—there are approximately 0.6 qualified candidates per role. Hiring difficulty is extreme.

ML infrastructure engineers who can build the systems that support large-scale model training and inference face nearly as much scarcity. These engineers need deep distributed systems experience combined with ML-specific knowledge. Supply is low against high demand.

Applied ML engineers—people who can take existing models and build products with them—are in high demand but more available. These are excellent engineers, but the market for their skills is more functional than the foundation model space.

General data scientists, meanwhile, face oversupply. The data science bootcamp boom of 2018-2022 produced far more data scientists than the market needed. Combined with AI tools that can now do basic data analysis, competition for general data science roles is fierce.

The practical implication: compensation has diverged wildly. A senior ML engineer focused on traditional machine learning might earn $310,000-$480,000 in total compensation. A senior engineer with LLM training experience at comparable scope earns $400,000-$700,000—sometimes more.

Compensation Reality by Role

Let me give you specific numbers based on our verified compensation data.

Entry-level ML engineers with zero to two years of experience typically earn $140,000-$180,000 in base salary with $20,000-$50,000 in annual equity value. Total compensation ranges from $160,000 to $230,000 depending on company type and location.

Mid-level ML engineers with two to five years of experience command $180,000-$240,000 base with $50,000-$100,000 in equity. Total compensation runs $230,000-$340,000.

Senior ML engineers at the five to eight year mark earn $230,000-$300,000 base with $80,000-$180,000 in equity, bringing total compensation to $310,000-$480,000.

Staff ML engineers with eight or more years of experience reach $280,000-$380,000 base and $150,000-$350,000 in equity. Total compensation ranges from $430,000 to $730,000.

Principal ML engineers at the top of the technical ladder earn $350,000-$450,000 base with $250,000-$500,000 in equity—total compensation of $600,000 to $950,000.

Now here's where it gets interesting. For roles specifically in the LLM and generative AI space, these numbers jump substantially.

An LLM engineer with solid fine-tuning experience earns $300,000-$550,000 in total compensation—a 22% increase year-over-year. Foundation model engineers at companies actually training frontier models earn $400,000-$800,000, up 35% from last year. RLHF specialists—the people who can do reinforcement learning from human feedback—command $350,000-$600,000, up an extraordinary 40% year-over-year[^3].

AI safety engineers are increasingly in demand as companies recognize the reputational and regulatory risks of AI systems. Compensation for these roles runs $280,000-$500,000, up 28% from 2025.

Even prompt engineers—a role that didn't meaningfully exist three years ago—now command $180,000-$280,000 at senior levels. That's up 15% year-over-year, though growth is slowing as the role becomes more commoditized.

Where You Work Matters Enormously

Company type is the single largest determinant of AI compensation, more significant than geography or exact role.

AI labs—OpenAI, Anthropic, DeepMind, and the handful of other organizations actually training frontier models—pay extraordinary compensation. A mid-career engineer at OpenAI has median total compensation around $650,000. Research scientists average $850,000. Distinguished engineers and researchers exceed $1.2 million.

Anthropic runs slightly lower, with engineers around $580,000 median and research scientists at $750,000. DeepMind comes in at approximately $520,000 for engineers and $700,000 for researchers.

These numbers sound outrageous until you consider the alternative: OpenAI and Anthropic are competing for the same few hundred people in the world who can meaningfully contribute to frontier model development. If one of those people goes to a competitor, it potentially affects the trajectory of the most transformative technology of our era. The compensation reflects the stakes.

FAANG AI teams pay less but still extraordinarily well. Google AI and Meta AI offer $400,000-$900,000 for senior engineers depending on level. The trade-offs: more bureaucracy, less access to the most cutting-edge work, but also more stability, better work-life balance, and exceptional benefits.

AI unicorns—well-funded private companies building AI-first products—typically pay $320,000-$750,000 for senior talent. The equity upside can be substantial if the company succeeds, but the risk is higher than established players.

Traditional tech companies with AI teams offer $250,000-$500,000. The work may be less cutting-edge, but these roles often provide better balance and real-world problems to solve.

Early-stage AI startups present the most variable risk-reward profile. Base salaries run $180,000-$260,000, but equity grants of 0.2% to 1.5% could be worth nothing or worth millions. I generally advise candidates to treat early-stage equity as lottery tickets with some expected value, not guaranteed compensation.

Geographic Arbitrage Is Shrinking

For years, engineers in lower cost-of-living areas could capture significant value by working remotely for companies paying Bay Area rates. That arbitrage is narrowing.

San Francisco remains the anchor market at a 1.0 salary index. A senior ML engineer earns approximately $380,000 in median total compensation in the Bay Area.

New York runs about 8% lower at $350,000 median, though finance AI applications carry premiums that can exceed Bay Area levels for specialized roles. Seattle comes in around 12% below SF at $335,000.

Austin and Denver, which saw enormous tech migration during the pandemic, have compressed toward national rates at roughly 78-80% of Bay Area levels—$295,000-$305,000 for senior ML engineers.

Remote roles based on a national average now pay approximately 75% of Bay Area rates, or about $285,000. That's up from 65-70% a few years ago as companies have standardized their remote compensation bands.

The implication: if you're an AI engineer in a lower cost-of-living area, your relative advantage is shrinking. Companies are getting more sophisticated about geographic pay differentials, and more AI engineers are concentrated in high-cost areas willing to pay full freight.

Skills That Command Premiums

Not all ML skills are created equal. Certain specializations carry significant compensation premiums over general ML engineering.

LLM fine-tuning commands a 25-40% premium over general ML engineering. The ability to take a base model and effectively specialize it for a specific use case is in extremely high demand and relatively short supply.

Multi-modal models—systems that work across text, images, audio, and video—carry a 20-35% premium. This is the emerging frontier as AI systems become more general.

RLHF and RLAIF—reinforcement learning from human and AI feedback—carry the highest premiums at 30-45%. This is the secret sauce that makes models like ChatGPT actually useful, and very few people have genuine production experience with it.

Distributed training—the ability to train models across large GPU clusters—carries a 20-30% premium. As models get larger, this becomes more critical.

AI safety and alignment work commands 25-40% premiums, driven partly by genuine technical difficulty and partly by the growing recognition that safety is a business-critical function.

Meanwhile, some skills that were highly valued just a few years ago have declined significantly. Traditional NLP—anything pre-transformer—has dropped about 40% in demand as LLMs replaced most use cases. Basic data science has declined 25% as the work has become commoditized and AI tools can now do simple analysis. Classical computer vision approaches have dropped 30% as deep learning became standard.

Hiring Strategies That Work

If you're trying to build an AI team, you need to compete differently than you do for general software engineering talent.

Research paper authors are the highest-quality sourcing channel. People publishing at NeurIPS, ICML, and other top venues are demonstrably at the frontier. Outreach to paper authors has approximately a 15% response rate—low by general recruiting standards, but the quality is exceptional.

Open source contributors on GitHub and especially HuggingFace represent another high-quality channel. People contributing meaningful code to AI open source projects have demonstrated both skill and initiative. Response rates run around 18%.

Internal upskilling is increasingly important. Some companies find it easier to take their best software engineers and train them in ML than to compete for experienced ML engineers externally. This is high investment but yields engineers who understand your codebase and culture.

Acqui-hires—buying small AI startups primarily for their team—remain effective but expensive. If you need to build a team quickly and have capital to deploy, this can be the fastest path.

Standard job postings have the lowest effectiveness. The AI engineers you want are typically not actively looking, and general job boards don't attract the specialized talent you need.

The timeline reality: expect 81 days on average to fill an ML engineer role. For LLM specialists, budget 120+ days. For research scientists, sometimes longer. If you're thinking about AI hiring, start earlier than you think you need to.

Red Flags on Both Sides

When evaluating AI candidates, watch for specific warning signs.

Kaggle-only experience can indicate someone who's done competitive ML but hasn't dealt with production systems—data pipelines, monitoring, iteration cycles, and all the unsexy infrastructure that makes ML work in the real world.

Inability to explain model choices suggests surface-level understanding. Ask not just what architecture they used, but why that architecture versus alternatives. The depth of reasoning matters.

No failed project stories is concerning. AI projects fail frequently for legitimate reasons. Someone who can't describe projects that didn't work either lacks experience or lacks self-awareness.

Dismissiveness about MLOps suggests someone who will create systems that are impossible to maintain and improve. The best AI engineers understand that deployment and monitoring are at least as important as initial model development.

Candidates should watch for employer red flags as well.

"AI-powered" marketing without substance suggests a company that wants the AI hype without the investment. Ask what models are actually in production and what the ML team's responsibilities include.

No existing ML infrastructure means you'll be building everything from scratch—appropriate for founding engineers, but potentially frustrating if you want to do ML work rather than infrastructure work.

Unrealistic timelines reveal executives who don't understand ML development cycles. If they expect ChatGPT-quality output in three months from a new team, calibrate your expectations accordingly.

No compute budget is a dealbreaker for serious ML work. Training and inference require GPUs. If the company hasn't thought about this, they're not ready to do AI seriously.

Negotiation Leverage for AI Candidates

If you're an AI engineer evaluating offers, understand what's negotiable.

Base salary is moderately flexible—typically 5-15% depending on company stage. Use market data and competing offers for maximum leverage.

Signing bonuses are highly flexible, especially at well-funded companies. If the base won't move, signing bonus often will.

Equity is very flexible at startups (ask for 25-50% more) and moderately flexible at public companies (ask for additional RSUs).

Competing offers are your strongest leverage. A 15-25% increase ask with a competing offer from an AI lab succeeds about 55% of the time, yielding 12-18% average gains.

Rare specializations give you leverage even without competing offers. If you have genuine RLHF experience, you can cite the scarcity of your skills and the market data that supports a premium.

Research publications establish credibility and expand your options. Papers at top venues signal that you can do work at the frontier.

The AI talent market will remain bifurcated for the foreseeable future. As long as the gap between general ML skills and frontier ML skills remains wide, compensation and hiring dynamics will diverge.

For candidates, the implication is clear: specializing in the highest-demand areas—LLMs, multi-modal, RLHF, safety—offers dramatically better economics than general ML engineering. For companies, the implication is equally clear: you need different strategies for different parts of the market, and budget expectations that account for the extreme premiums in the hottest areas.

The startup that spent nine months looking for an LLM engineer eventually succeeded by offering a significant equity stake to a strong ML engineer and investing in their ramp-up. The data scientist who couldn't get interviews repositioned himself toward MLOps—a space with more opportunities—and had multiple offers within six weeks.

The market rewards those who understand its structure.

References

[^1]: SmithSpektrum analysis of AI/ML hiring data from 300+ companies, 2024-2026. [^2]: LinkedIn Talent Insights, "AI/ML Labor Market Report," January 2026. [^3]: Levels.fyi, "AI/ML Compensation Data," verified against SmithSpektrum placement data, 2026. [^4]: Stanford HAI, "AI Index Report 2026: Talent and Labor Market." [^5]: Hired, "State of AI Hiring," 2026.

Looking to build or scale your AI team? Contact SmithSpektrum for specialized AI/ML recruiting and compensation benchmarking.

Author: Irvan Smith, Founder & Managing Director at SmithSpektrum