What frontier AI labs
are actually building

Every open role is a signal. We scraped 3,000+ job descriptions from 25 frontier AI labs and classified them by category, stack, and intent — to surface what's being built before the press releases.

3,156
Roles Tracked
25
Labs
14
Categories
May '26
Last Scraped
Labs — role breakdown
What each lab is building — derived from hiring signals
OpenAIHardware push
Building its own silicon. 60 Hardware/Silicon roles alongside 70 Model Infra — unusual for a pure software lab. They're de-risking dependence on Nvidia while simultaneously running the largest enterprise GTM in AI (195 roles). Trust & Safety (58) is a compliance requirement for government contracts, not mission.
Signal: Hardware/Silicon = 9% of all roles · GTM = 29%
AnthropicEnterprise compliance machine
The Trust & Safety (48) + Legal & Compliance (25) + Safety & Alignment (18) cluster = 91 roles. That's 21% of all hiring. This isn't idealism — it's the price of selling to Fortune 500 and government. Finance & Corp Dev at 43 roles signals major partnership/M&A activity. They're building the compliance moat nobody else wants to pay for.
Signal: Compliance cluster = 21% of roles · Finance = 10%
xAIData flywheel first
29% of roles are Data — the highest share of any lab. This is a deliberate data acquisition and annotation play: building proprietary training data at scale using X's real-time social corpus. Hardware (40) and Model Infra (32) are secondary. xAI is betting that data quality wins the next model generation, not architecture.
Signal: Data = 29% · Hardware = 16% · almost zero GTM
DeepMindResearch org, not product co
Applied Research (15) + Fundamental Research (5) + Safety & Alignment (6) = 39% of all hiring. Product (16) exists but GTM is barely 1 role. DeepMind isn't trying to win enterprise revenue — they're publishing the science that others commercialise. Robotics (3) is early signal of Gemini Robotics investments.
Signal: Research = 39% · GTM = 1 role total
MistralEuropean enterprise GTM
GTM is 31% of roles, with EMEA-specific sales tags dominating. Applied Research (21) is unusually high for a commercial lab — they're maintaining research credibility to justify open-weight releases while selling proprietary APIs to European enterprises who can't use US-hosted models for regulatory reasons. Geographic Expansion (7) is all EU.
Signal: GTM = 31% · EMEA tags · Applied Research = 13%
CohereEnterprise B2B infrastructure play
Applied Research (22) and Model Infra (15) alongside GTM (28) — they're positioning as the "safe enterprise choice" with on-prem deployment capability. Data (13) roles signal RLHF/fine-tuning pipelines for customer-specific models. Geography (8) is English-speaking enterprise markets only.
Signal: Applied Research = 17% · on-prem infra tags
PerplexityRunning their own models
Model Infrastructure is their #1 category (19 of 65 roles = 29%) — more than GTM. For a consumer search product, this is a clear signal they've moved beyond API calls and are training and serving their own models end-to-end. Applied Research (11) confirms active model development. Search tags suggest RAG infrastructure at scale.
Signal: Model Infra = 29% · Applied Research = 17% · search tags
Cognition (Devin)Aggressive commercial push
GTM (25) + Geographic Expansion (12) = 61% of all hiring. APAC expansion tags are prominent. Cognition is past the demo phase and in full enterprise sales mode — they're trying to land dev-tool contracts before coding assistants commoditise. Almost no research hiring = they're shipping, not discovering.
Signal: GTM+Geo = 61% of roles · APAC tags · no research hiring
Scale AIUS government AI contractor
Public sector and government tags dominate their GTM (31) and Product (46) roles. Robotics/Embodied AI (10) alongside Applied Research (30) shows they're building data pipelines for autonomous vehicles and defense robotics — not just LLM RLHF. Scale is positioning as the data infrastructure layer for the US military's AI programs.
Signal: government/public sector tags · Robotics = 6% · Applied Research = 17%
ElevenLabsVoice localisation land-grab
Geographic Expansion is 29% of all roles — an extraordinary share. Combined with Voice/Audio (5), this reads as aggressive multilingual voice cloning expansion into non-English markets before competitors. GTM (70) is the largest absolute count in the dataset relative to size. They're signing enterprise deals for voice AI in every major language simultaneously.
Signal: Geo Expansion = 29% · GTM = 50% · voice AI + CRM tags
CoreWeaveThe AI cloud everyone runs on
91 Model Infrastructure roles — more than any other company in this dataset. CoreWeave isn't an AI lab; it's the compute substrate that AI labs rent. GPU, Kubernetes, data center, and cloud infra tags dominate. Every lab that can't afford or justify building its own compute (everyone except OpenAI and xAI) is a CoreWeave customer.
Signal: Model Infra = 35% · GPU/Kubernetes/data center tags
EtchedTransformer-specific chip
66% of all roles are Hardware/Silicon — ASIC, signal integrity, HPC, supercomputing tags. Etched is building a chip that only runs transformers, betting that the architecture is locked in and specialised silicon will be 10× cheaper and faster than general GPUs. Model Infra (19) roles are writing the software stack for their custom hardware.
Signal: Hardware = 66% · ASIC/HPC tags · no GTM = pre-revenue
Physical Intelligence (π)Physical world AI
Robotics/Embodied AI (9) + Hardware (5) = 70% of all roles. The hardware here isn't silicon — it's actuators, sensors, and mechanical systems. π is building foundation models for robot manipulation, betting that the same scaling laws that worked for language will work for physical control. Tiny company, very focused signal.
Signal: Robotics+Hardware = 70% · mechanical/operations tags
SesameConsumer AI hardware device
Hardware (8) + Product (9) = 52% of roles with supply-chain and manufacturing tags. Sesame is building a physical consumer device — not just software. Voice/Audio (2) alongside hardware suggests a smart speaker / ambient AI form factor. The supply-chain hiring is a tell: they're going to manufacturing, not just prototype.
Signal: Hardware+Product = 52% · supply-chain/manufacturing tags
SunoMusic AI going mobile
Product (13) is their top category, with Android/Kotlin/mobile tags prominently in their tech stack. They're building a mobile-first music creation product — not a B2B API. Voice/Audio (5) + Data (4) signal active model training. Trust & Safety (4) suggests content moderation for user-generated music at scale.
Signal: Product = 30% · Android/mobile tags · Trust & Safety = 9%
WriterEnterprise AI adoption vendor
GTM is 69% of all roles — the highest concentration in the dataset. Customer success, AI transformation, and AI adoption tags suggest they're not just selling seats but running implementation projects for large enterprises deploying AI. Applied Research (6) maintains model quality while the sales machine drives revenue.
Signal: GTM = 69% · AI transformation/adoption tags = implementation-led sales
GoodfireInterpretability → product
Applied Research (4) and Safety/Alignment (1) alongside GTM (6) = they're commercialising interpretability research. Goodfire is the only lab in this dataset where positioning and product marketing appear explicitly in tech tags — they know the hard part isn't the science, it's convincing enterprises that interpretability is worth paying for.
Signal: Applied Research = 22% · positioning/product marketing tags
Liquid AIPost-transformer architecture bet
Applied Research (4) dominates, with fine-tuning, post-training, multimodal, VLM, and vision tags. Liquid is betting on alternative architectures (liquid neural networks) that are more efficient than transformers for edge deployment. Voice/Audio (2) suggests multimodal expansion. Very research-heavy for a company of this size.
Signal: Research = 21% · VLM/multimodal/fine-tuning tags · alternative architectures
Cross-lab megatrends
The model wars are over. The revenue war has begun.
GTM is the #1 or #2 hiring category at 9 of 25 labs. OpenAI (195), Anthropic (130), ElevenLabs (70), Writer (35), Mistral (50) — this isn't growth-stage hiring. It's land-and-expand at enterprise scale. The differentiation has shifted from "better model" to "better sales motion."
OpenAIAnthropicMistralElevenLabsWriter
Compliance is a product, not a tax.
Anthropic's 91-role compliance cluster (Trust & Safety + Legal + Safety & Alignment) is a deliberate enterprise moat. Regulated industries — banking, healthcare, government — will only buy from vendors who can pass security audits. Anthropic is pricing that in before competitors realise they have to.
AnthropicOpenAICohere
Two compute bets: build it vs rent it.
OpenAI (60 Hardware roles) and xAI (40) are building custom silicon. Everyone else is implicitly betting on CoreWeave (91 Model Infra roles) or Etched's transformer chip. The compute stack is bifurcating: a few giants own it, everyone else rents.
OpenAIxAICoreWeaveEtched
Physical AI is real and hiring now.
Physical Intelligence, Scale AI (robotics data), and Sesame (consumer hardware) are all building for the physical world. The next wave isn't another chatbot — it's models that control things. Scale's robotics data pipeline is the unsexy but critical piece that makes all of this work.
Physical IntelligenceScale AISesame
Geography is the new moat for voice AI.
ElevenLabs is hiring for geographic expansion faster than any other category. Voice AI is inherently local — accent, language, cultural nuance. The lab that localises first in Hindi, Arabic, Portuguese locks in distribution before models become commodities.
ElevenLabsMistralCognition
xAI is a data company who happens to make a model.
29% data roles, minimal GTM, X's real-time corpus as training data. xAI's actual product isn't Grok — it's the world's best real-time social training set. The model is the proof of concept for a data business that nobody else can replicate.
xAI