Future Prediction — What AI Labs Are Building

What each lab is building — derived from hiring signals

OpenAIHardware push

Building its own silicon. 60 Hardware/Silicon roles alongside 70 Model Infra — unusual for a pure software lab. They're de-risking dependence on Nvidia while simultaneously running the largest enterprise GTM in AI (195 roles). Trust & Safety (58) is a compliance requirement for government contracts, not mission.

Signal: Hardware/Silicon = 9% of all roles · GTM = 29%

AnthropicEnterprise compliance machine

The Trust & Safety (48) + Legal & Compliance (25) + Safety & Alignment (18) cluster = 91 roles. That's 21% of all hiring. This isn't idealism — it's the price of selling to Fortune 500 and government. Finance & Corp Dev at 43 roles signals major partnership/M&A activity. They're building the compliance moat nobody else wants to pay for.

Signal: Compliance cluster = 21% of roles · Finance = 10%

xAIData flywheel first

29% of roles are Data — the highest share of any lab. This is a deliberate data acquisition and annotation play: building proprietary training data at scale using X's real-time social corpus. Hardware (40) and Model Infra (32) are secondary. xAI is betting that data quality wins the next model generation, not architecture.

Signal: Data = 29% · Hardware = 16% · almost zero GTM

DeepMindResearch org, not product co

Applied Research (15) + Fundamental Research (5) + Safety & Alignment (6) = 39% of all hiring. Product (16) exists but GTM is barely 1 role. DeepMind isn't trying to win enterprise revenue — they're publishing the science that others commercialise. Robotics (3) is early signal of Gemini Robotics investments.

Signal: Research = 39% · GTM = 1 role total

MistralEuropean enterprise GTM

GTM is 31% of roles, with EMEA-specific sales tags dominating. Applied Research (21) is unusually high for a commercial lab — they're maintaining research credibility to justify open-weight releases while selling proprietary APIs to European enterprises who can't use US-hosted models for regulatory reasons. Geographic Expansion (7) is all EU.

Signal: GTM = 31% · EMEA tags · Applied Research = 13%

CohereEnterprise B2B infrastructure play

Applied Research (22) and Model Infra (15) alongside GTM (28) — they're positioning as the "safe enterprise choice" with on-prem deployment capability. Data (13) roles signal RLHF/fine-tuning pipelines for customer-specific models. Geography (8) is English-speaking enterprise markets only.

Signal: Applied Research = 17% · on-prem infra tags

PerplexityRunning their own models

Model Infrastructure is their #1 category (19 of 65 roles = 29%) — more than GTM. For a consumer search product, this is a clear signal they've moved beyond API calls and are training and serving their own models end-to-end. Applied Research (11) confirms active model development. Search tags suggest RAG infrastructure at scale.

Signal: Model Infra = 29% · Applied Research = 17% · search tags

Cognition (Devin)Aggressive commercial push

GTM (25) + Geographic Expansion (12) = 61% of all hiring. APAC expansion tags are prominent. Cognition is past the demo phase and in full enterprise sales mode — they're trying to land dev-tool contracts before coding assistants commoditise. Almost no research hiring = they're shipping, not discovering.

Signal: GTM+Geo = 61% of roles · APAC tags · no research hiring

Scale AIUS government AI contractor

Public sector and government tags dominate their GTM (31) and Product (46) roles. Robotics/Embodied AI (10) alongside Applied Research (30) shows they're building data pipelines for autonomous vehicles and defense robotics — not just LLM RLHF. Scale is positioning as the data infrastructure layer for the US military's AI programs.

Signal: government/public sector tags · Robotics = 6% · Applied Research = 17%

ElevenLabsVoice localisation land-grab

Geographic Expansion is 29% of all roles — an extraordinary share. Combined with Voice/Audio (5), this reads as aggressive multilingual voice cloning expansion into non-English markets before competitors. GTM (70) is the largest absolute count in the dataset relative to size. They're signing enterprise deals for voice AI in every major language simultaneously.

Signal: Geo Expansion = 29% · GTM = 50% · voice AI + CRM tags

CoreWeaveThe AI cloud everyone runs on

91 Model Infrastructure roles — more than any other company in this dataset. CoreWeave isn't an AI lab; it's the compute substrate that AI labs rent. GPU, Kubernetes, data center, and cloud infra tags dominate. Every lab that can't afford or justify building its own compute (everyone except OpenAI and xAI) is a CoreWeave customer.

Signal: Model Infra = 35% · GPU/Kubernetes/data center tags

EtchedTransformer-specific chip

66% of all roles are Hardware/Silicon — ASIC, signal integrity, HPC, supercomputing tags. Etched is building a chip that only runs transformers, betting that the architecture is locked in and specialised silicon will be 10× cheaper and faster than general GPUs. Model Infra (19) roles are writing the software stack for their custom hardware.

Signal: Hardware = 66% · ASIC/HPC tags · no GTM = pre-revenue

Physical Intelligence (π)Physical world AI

Robotics/Embodied AI (9) + Hardware (5) = 70% of all roles. The hardware here isn't silicon — it's actuators, sensors, and mechanical systems. π is building foundation models for robot manipulation, betting that the same scaling laws that worked for language will work for physical control. Tiny company, very focused signal.

Signal: Robotics+Hardware = 70% · mechanical/operations tags

SesameConsumer AI hardware device

Hardware (8) + Product (9) = 52% of roles with supply-chain and manufacturing tags. Sesame is building a physical consumer device — not just software. Voice/Audio (2) alongside hardware suggests a smart speaker / ambient AI form factor. The supply-chain hiring is a tell: they're going to manufacturing, not just prototype.

Signal: Hardware+Product = 52% · supply-chain/manufacturing tags

SunoMusic AI going mobile

Product (13) is their top category, with Android/Kotlin/mobile tags prominently in their tech stack. They're building a mobile-first music creation product — not a B2B API. Voice/Audio (5) + Data (4) signal active model training. Trust & Safety (4) suggests content moderation for user-generated music at scale.

Signal: Product = 30% · Android/mobile tags · Trust & Safety = 9%

WriterEnterprise AI adoption vendor

GTM is 69% of all roles — the highest concentration in the dataset. Customer success, AI transformation, and AI adoption tags suggest they're not just selling seats but running implementation projects for large enterprises deploying AI. Applied Research (6) maintains model quality while the sales machine drives revenue.

Signal: GTM = 69% · AI transformation/adoption tags = implementation-led sales

GoodfireInterpretability → product

Applied Research (4) and Safety/Alignment (1) alongside GTM (6) = they're commercialising interpretability research. Goodfire is the only lab in this dataset where positioning and product marketing appear explicitly in tech tags — they know the hard part isn't the science, it's convincing enterprises that interpretability is worth paying for.

Signal: Applied Research = 22% · positioning/product marketing tags

Liquid AIPost-transformer architecture bet

Applied Research (4) dominates, with fine-tuning, post-training, multimodal, VLM, and vision tags. Liquid is betting on alternative architectures (liquid neural networks) that are more efficient than transformers for edge deployment. Voice/Audio (2) suggests multimodal expansion. Very research-heavy for a company of this size.

Signal: Research = 21% · VLM/multimodal/fine-tuning tags · alternative architectures

Cross-lab megatrends

The model wars are over. The revenue war has begun.

GTM is the #1 or #2 hiring category at 9 of 25 labs. OpenAI (195), Anthropic (130), ElevenLabs (70), Writer (35), Mistral (50) — this isn't growth-stage hiring. It's land-and-expand at enterprise scale. The differentiation has shifted from "better model" to "better sales motion."

OpenAIAnthropicMistralElevenLabsWriter

Compliance is a product, not a tax.

Anthropic's 91-role compliance cluster (Trust & Safety + Legal + Safety & Alignment) is a deliberate enterprise moat. Regulated industries — banking, healthcare, government — will only buy from vendors who can pass security audits. Anthropic is pricing that in before competitors realise they have to.

AnthropicOpenAICohere

Two compute bets: build it vs rent it.

OpenAI (60 Hardware roles) and xAI (40) are building custom silicon. Everyone else is implicitly betting on CoreWeave (91 Model Infra roles) or Etched's transformer chip. The compute stack is bifurcating: a few giants own it, everyone else rents.

OpenAIxAICoreWeaveEtched

Physical AI is real and hiring now.

Physical Intelligence, Scale AI (robotics data), and Sesame (consumer hardware) are all building for the physical world. The next wave isn't another chatbot — it's models that control things. Scale's robotics data pipeline is the unsexy but critical piece that makes all of this work.

Physical IntelligenceScale AISesame

Geography is the new moat for voice AI.

ElevenLabs is hiring for geographic expansion faster than any other category. Voice AI is inherently local — accent, language, cultural nuance. The lab that localises first in Hindi, Arabic, Portuguese locks in distribution before models become commodities.

ElevenLabsMistralCognition

xAI is a data company who happens to make a model.

29% data roles, minimal GTM, X's real-time corpus as training data. xAI's actual product isn't Grok — it's the world's best real-time social training set. The model is the proof of concept for a data business that nobody else can replicate.

xAI

What frontier AI labsare actually building

What frontier AI labs
are actually building