Karpathy Joins Anthropic, and the Training Loop Starts to Close

Karpathy's first assignment at Anthropic is to point the current model at the job of building the next one. The research cycle is being compressed from the inside — and the question worth holding any lab accountable on is whether anyone is still paid to stand outside the loop and grade what comes out.

On 19 May 2026, Andrej Karpathy told the world he had joined Anthropic. He co-founded OpenAI, built Tesla's Autopilot and Full Self-Driving program, returned to OpenAI for a year, then left in 2024 to start Eureka Labs around AI tutoring. The move was covered within hours by TechCrunch, Axios and CNBC, and read mostly as a talent-war headline. A prize researcher changed jerseys.

That reading is not wrong. It is shallow. The sentence that matters sits in Anthropic's own description of the job. Karpathy joins the pre-training team under Nick Joseph and — in Anthropic's framing — will help build a group focused on using Claude to speed up pre-training research and experimentation. His own note on X said he wanted to "get back to R&D." Put those together. A celebrated researcher's first assignment is to point the current model at the job of building the next one.

The mandate is the story

Pre-training is the expensive part. It is the long base-model run that gives Claude its raw knowledge before any fine-tuning or alignment work happens on top. It is the most capital-intensive, least glamorous, hardest-to-iterate stage of the entire pipeline — and therefore the place where a faster feedback loop is worth the most money. Staffing a marquee hire there, with an explicit brief to bring the model into the work, is a deliberate choice, not a press-release flourish.

I want to be careful with attribution, because this is the part everyone will inflate. "Using Claude to speed up pre-training research" is Anthropic's phrasing, not a measured result. Nobody has published a number showing how much faster a frontier run gets when a model designs the experiments. Treat it as a stated intention and watch for the evidence.

It is not an isolated intention. In March 2026, MIT Technology Review reported that OpenAI has made an "automated researcher" a top priority — chief scientist Jakub Pachocki described an autonomous research intern targeted for September 2026 and a full multi-agent research system aimed at 2028. IEEE Spectrum tracks the same theme under an older and blunter name: recursive self-improvement. And a survey of 25 researchers across Google DeepMind, OpenAI, Anthropic, Meta and several universities found 20 of them naming the automation of AI research itself as one of the most severe near-term risks they could name (arXiv preprint). The Karpathy hire is not a one-company quirk. It is one lab staffing for a direction the whole frontier has already chosen.

Why the economics make this hard to resist

Follow the money. Epoch AI's cost work puts the amortized cost of a frontier training run rising about 3.5x a year since 2020 — a doubling roughly every seven months. A 2026-class run sits somewhere between $200M and $500M. The underlying study projects the largest runs past a billion dollars by 2027.

The brief behind this piece put it plainly: training used to cost millions, then tens and hundreds of millions. That trajectory is the reason the labs want models inside the research loop, and it owes nothing to science fiction. When one run costs $300M, a failed experiment is not a learning opportunity. It is a write-off. Anything that triages which experiments deserve compute, writes the training and evaluation code, and reads the diagnostics faster than a human team pays for itself the first time it works. The case for AI-accelerated research is a cost-control case before it is a capability case.

There is a speed claim folded in as well. A new model generation used to mean a wait measured in months — data assembled, runs scheduled, results argued over. Compress the experiment cycle and that wait shortens. Real, and worth taking seriously. But I would discount the most excitable version of it. The brief speculated about "templated" models you simply instruct into existence — preset architectures, the AI doing the rest. We are not there, and pretending otherwise helps no one. What is real is narrower and still consequential: the cycle time of an experiment is collapsing, and its cost is being attacked from the inside.

The moat was never the credential

Here I part company with the most-quoted worry, and the brief raised it directly — that a PhD in machine learning was a moat, and an AI capable of doing the research dissolves it.

The credential was never the moat. Judgment is. Knowing which experiment is worth $300M of compute, which evaluation measures the capability you care about rather than a flattering proxy for it, what a genuine result looks like next to a merely plausible one — that is the scarce thing, and a model that writes excellent training code does not yet supply it.

I do not want to be glib, though, because something is disappearing, and the pay data shows the market has already worked out what. OpenAI is reported to offer base salaries up to $685,000 for research roles (Seoul Economic Daily); Meta's hiring run reportedly reached packages worth as much as $300M over four years for a handful of individuals (DeepLearning.AI's The Batch). Those numbers — and they deserve a pinch of salt, since recruiting figures are reported by parties with an interest in the headline — are not paying for execution. They pay for the few people who set research direction. The far larger population who execute — run the sweep, clean the data, code the ablation — is exactly the tier an automated researcher absorbs first.

So the brief is right that a moat is eroding. It has the wrong moat. Not the PhD. The mid-tier execution job that a PhD used to be a safe ticket into.

My stake, since this column should take one: I would bet against OpenAI's 2028 "fully automated research system" arriving on schedule — roadmaps in this field slip, reliably. I would bet heavily that by 2027 the median pre-training experiment is designed, coded and triaged by a model, with a human signing off. If I sat on an AI lab's board, I would stop approving headcount that executes experiments and start defending headcount that can tell a real result from a flattering one.

The quality risk nobody is pricing

The brief asked me not to flinch from the second danger: the quality and nature of models trained by other models. This is the part I take most seriously, and the part the celebratory coverage skips.

There is a well-evidenced failure mode. Recursive training on a model's own unverified output degrades it — the literature calls it model collapse. The ICLR 2025 "Strong Model Collapse" work showed that a synthetic-data fraction as small as one part in a thousand can stall performance, and that larger models can amplify the effect rather than dampen it. More recent work on escaping collapse through verification confirms the obvious counter — keep verified, human-grounded data and an independent check inside the loop.

Precision matters here. Using Claude to accelerate research is not the same act as training Claude on Claude's text. One is a colleague reading over your shoulder; the other is a photocopy of a photocopy. But the two blur at the edges, and that blur is the real exposure. When the model that designs the training signal, the model that runs the experiment and the model that judges the result all come from the same family, the independent grader is gone. Errors stop being caught because nothing outside the loop is positioned to catch them. Sakana AI's "AI Scientist", which produced a paper that cleared human peer review, is genuinely impressive and also a quiet warning: when an AI writes the paper and an AI reviews the paper, the peer-review signal itself decays.

This is the centaur-versus-autopilot choice, and it is a design decision, not a prophecy. A centaur keeps a human — or at minimum an independent model lineage — outside the loop, grading. An autopilot lets the loop close on itself because that is cheaper. The cost curve pushes hard toward autopilot. Resisting that push is now an engineering responsibility, not a matter of taste.

What I would actually watch

The Karpathy hire is a real signal, and the honest version of it is neither the talent-war shrug nor the recursive-superintelligence panic. The research cycle is being compressed and partly automated. That is good for a cost curve that had become indefensible, and bad for anyone whose job was to execute experiments rather than choose them. The question worth holding a lab accountable on is not whether a model can help train the next model — it visibly can, partially, today. It is whether anyone is still paid to stand outside the loop and grade what comes out. Keep that person funded. When that role gets automated too, start worrying.

Tarry Singh is the founder and CEO of Real AI, an enterprise AI advisory and deployment firm working with global enterprises on production agent systems, model risk, and AI sovereignty strategy. He also leads Earthscan for Energy AI startup, and is a founding contributor to the EU-funded HCAIM and PANORAIMA programmes for responsible AI education across European universities. He writes at tarrysingh.com.

Cartouche

Karpathy Joins Anthropic, and the Training Loop Starts to Close · Dispatches, 21 May 2026 · T. Singh

← Back to dispatches

Edit this post →