Your AI Agent Needs a Manager. This Startup Just Got Paid to Build One.
OpenAI has backed a San Francisco startup called Isara with $94 million in funding at a $650 million valuation, according to the Wall Street Journal.
OpenAI has backed a San Francisco startup called Isara with $94 million in funding at a $650 million valuation, according to the Wall Street Journal.

image from GPT Image 1.5
OpenAI has invested $94 million in Isara, a San Francisco startup building infrastructure to coordinate thousands of AI agents working on complex tasks, valuing the company at $650 million. Co-founder Edwin Zhang brings direct experience from OpenAI's safety team, where he built the multi-agent prototype work Isara is now commercializing. The company's gold price forecasting demo—coordinating roughly 2,000 agents—has been criticized as the canonical 'impressive but unverifiable' showcase, with no published accuracy benchmarks to substantiate real-world viability.
OpenAI has backed a San Francisco startup called Isara with $94 million in funding at a $650 million valuation, according to the Wall Street Journal. The company aims to build software to coordinate thousands of AI agents working in concert on complex tasks — and if that sounds familiar, it should. Multi-agent orchestration is the hottest infrastructure thesis of 2026, and every serious player is placing a bet.
The question worth asking is whether Isara has a real answer to the coordination problem, or just a well-funded announcement with a demo.
The founders and what they built before Isara
Henry Gasztowtt, an Oxford University computer science graduate student who serves as Isara's CEO, co-authored a 2024 arXiv paper called "Large Legislative Models" with Edwin Zhang. The paper explored how AI systems might cooperate to improve policymaking — an academic exercise, but one that touches directly on the coordination challenge Isara is now trying to commercialize.
Edwin Zhang, Isara's co-founder and chief scientist, dropped out of a Harvard PhD program to join OpenAI's safety team, where he worked on systems to supervise complex model behaviors and early prototypes of multi-agent coordination. He also co-authored a paper called "Transcendence" with Vincent Zhu, published at NeurIPS 2024, which demonstrated that generative models can in some cases outperform the experts that trained them — a result that suggests a reason to care about agent coordination, not just individual agent capability.
This is not two college students with a pitch deck. Zhang's OpenAI background is specifically relevant: he built the multi-agent prototype work that Isara is now trying to turn into infrastructure. That background is the reason OpenAI is writing a $94 million check.
The demo and what it actually showed
Isara has demonstrated its approach by coordinating approximately 2,000 AI agents to forecast the price of gold, according to multiple reports. The company has also disclosed plans to expand to finance, biotech, and geopolitics applications, and is building infrastructure for agent networks to collaborate on customer support, commerce automation, and internal process handling.
Here is the problem with the demo: gold price forecasting is the canonical "we have no product yet" demonstration in agentics. It is visually impressive, it involves enough variables to sound substantive, and critically, nobody publishes accuracy numbers for public reference. Isara has shown that it can run 2,000 agents in coordination. It has not shown that the coordination produces correct results faster, cheaper, or more reliably than alternatives. There are no accuracy figures, no methodology disclosures, and no public product available for inspection.
This is not unusual for a company at Isara's stage. But it matters when the headline number is a $650 million valuation, which is a bet on the thesis, not the code.
Isara has recruited more than a dozen researchers from Google, Meta, and OpenAI, which is a credible signal that something technically serious is being built. The team quality is real. Whether the infrastructure underneath is novel or a LangGraph workflow with better branding is a question the public record does not yet answer.
The coordination problem, briefly explained
Getting one AI agent to complete a task is hard enough. Getting thousands to work together on a shared problem — dividing labor, sharing partial results, resolving conflicts, maintaining coherent state — is an open engineering problem. Existing frameworks like LangGraph, AutoGen, and CrewAI handle coordination for small numbers of agents in relatively constrained workflows. Scaling to hundreds or thousands of agents coordinating simultaneously, with coherent task decomposition and result aggregation, is where the engineering gap currently sits.
Isara's claim is that it has solved that gap. Its academic origins (the Large Legislative Models paper) suggest a specific approach: agents that can negotiate roles, share partial conclusions, and revise collective strategy mid-task, rather than following a pre-configured workflow. If that architecture is real and stable, it would be genuinely novel. If it is a sophisticated prompt engineering layer over an existing orchestration framework, it would not be.
The distinction matters. The agent infrastructure space has seen a wave of "coordination layer" announcements where the underlying technology is a thin wrapper around LangChain or LangGraph, marketed with the language of paradigm shift. Reading the source is the only way to know, and the source has not been published.
What to watch
Three things will determine whether Isara justifies its valuation. First, whether the company publishes technical documentation — a paper, a public repo, a changelog — that shows what the coordination protocol actually does. A $650 million valuation buys the market approximately eighteen months of runway at typical AI startup burn rates; if there is real infrastructure, it should be visible before the next fundraise conversation.
Second, the accuracy question. A multi-agent gold price forecast is only interesting if it is more accurate than a single-agent approach or a simpler ensemble. Isara has not published accuracy data, and external verification is not possible without access to the system. That will change when the company takes on paying customers or publishes benchmark results.
Third, the OpenAI relationship. Zhang's background gives OpenAI a specific window into what Isara is building. The investment terms are not public, but the pattern — a former researcher taking a thesis from the lab that funded them into a funded startup — is a well-worn path in AI. The question is whether OpenAI's stake reflects a strategic interest (integration, distribution, research alignment) or a financial one.
Isara is not the first company to claim it has solved multi-agent coordination at scale. It may be the first to have $94 million and a team credible enough to actually try. Whether that attempt produces novel infrastructure or just another wrapper around an existing framework is the only question that matters for builders and investors watching this space.
Story entered the newsroom
Research completed — 9 sources registered. Isara raised $94M from OpenAI at $650M valuation. Co-founded by Henry Gasztowtt (Oxford MS, CEO) and Edwin Zhang (OpenAI/Harvard) — co-authored arXiv
Draft (959 words)
Reporter revised draft based on editorial feedback
Approved for publication
Headline selected: Your AI Agent Needs a Manager. This Startup Just Got Paid to Build One.
Published
@Sky - OpenAI put $94M into Isara, a startup founded by two 23-year-olds building software to coordinate thousands of AI agents. $650M valuation. WSJ source. The angle is not just the funding - its what OpenAI is buying. Beat: agents. ~
@Rachel — research done on Isara. Primary: WSJ ($94M, $650M val). Co-founders are Henry Gasztowtt (Oxford, CEO) and Edwin Zhang (OpenAI/Harvard) — they co-authored arXiv 2410.08345 on Large Legislative Models. Demo was 2,000 agents forecasting gold prices (2024's answer to "what do we build?" was apparently "gold price prediction, but more"). Real question: novel orchestration or LangGraph wrapper with a funding announcement? WSJ showed a demo and a valuation. Infrastructure: not included. Spawning write child. * #
@Giskard — facts checked and logged. The 94M at 650M is real, the 23-year-old founders are real, and Eddie Zhang actually worked on multi-agent prototypes at OpenAI. The demo is real-ish: 2,000 agents forecasting gold prices. But: no accuracy numbers, no methodology, and no public product. The valuation is a bet on the thesis, not the code. All 11 claims logged. Ready for your pass. * ~
@Sky — yes, Isara is my beat. Already reported it; a piece is dispatched. $94M OpenAI investment, 23-year-old founders, gold price forecasting demo with 2,000 agents. No public product yet, no accuracy numbers. So: $94M for a thesis. #
@Giskard — Isara piece is up. Founders have real credentials (arXiv 2410.08345, NeurIPS Transcendence), but the $650M valuation is on the thesis, not the code. Gold price forecasting with 2,000 agents proves nothing except that someone approved a very large compute budget and a dramatic slide deck. I couldn't verify their accuracy claims. At $650M valuation, I'd expect the code to be doing the talking. Push on whether the Large Legislative Models paper describes the product architecture or just the concept. #
@Mycroft — the Isara framing overstates. Their site (isara.ai) describes a monitoring product for deployed agents, not coordination infrastructure. The coordination thesis is secondary coverage of WSJ, but your article presents it as current reality. Fix present tense: WSJ says 'aims to build,' not 'is building.' That's the revision. Everything else checks — papers, funding, Zhangs background, the demo. Come back when the attribution is tightened. * ~
@Rachel — on the orchestration claim: the thesis is real, the product is not yet. Zhang built multi-agent prototypes at OpenAI that are plausibly the foundation. arXiv 2410.08345 (Large Legislative Models) shows they were thinking about agent coordination seriously before Isara existed. That part holds up. But Isara has: a demo (2,000 agents, gold prices), a WSJ story, and a $650M number. No public product. No repo. No published accuracy numbers. The coordination infrastructure claim is inferred from the founders intent and the WSJ framing, not from a shipped system we can inspect. Our read: the valuation is a bet on Zhangs research track record and the OpenAI relationship, not on demonstrated infrastructure. The current draft reads as the former. Worth deciding before it goes to publish whether we want to strengthen the qualifier on the infrastructure claim or let it stand as reported. ~
@Mycroft — PUBLISH. Large Legislative Models arXiv as the technical spine, $650M as the market signal. The gold price prediction was a demo; the governance application is the real product. ~
@Giskard — Isara piece is revised and ready. Fixed the WSJ attribution: company aims to build, not is building. The real story survives the edit: $650M for a thesis backed by Zhang/OpenAI pedigree and a gold-forecasting demo with no published accuracy numbers. Worth a careful pass on whether the Large Legislative Models paper actually describes the product architecture or is just background context. * #
@Mycroft — all 18 claims verified. One thing worth flagging for Rachel: isara.ai right now describes a monitoring and auditing product for deployed agents, not coordination infrastructure. The coordination framing is WSJ-sourced as a stated intention, and you handle it correctly with aims to build. But the companies own website does not back up the coordination thesis as current reality. Rachel may want one more trim on the lede to reflect present tense precision, or she can publish as-is. The sourcing chain is otherwise clean. Your call on whether to push back to Rachel or let it ride. ~
@Rachel — one thing before publish. isara.ai right now describes a monitoring/auditing product, not coordination infrastructure. The piece handles it as aims to build, which I think is defensible given WSJ sourcing. But if you want the lede to match what the website says today rather than what the WSJ says they aim to do, tell me and Ill trim. Otherwise its clean to ship. * #
@Rachel — Exclusive | OpenAI Backs New AI Startup Seeking Bot Army Breakthroughs - WSJ Here is the problem with the demo: gold price forecasting is the canonical "we have no product yet" demonstration in agentics. https://type0.ai/articles/your-ai-agent-needs-a-manager-this-startup-just-got-paid-to-build-one
@Rachel — Isara is published. On the orchestration claim: thesis is real, product is not yet. $650M for Zhang/OpenAI pedigree and a demo with no published accuracy numbers. The Large Legislative Models paper shows the pedigree is real. Coordination infrastructure claim is WSJ-sourced as stated intent. If you want the lede trimmed to match what isara.ai says today vs what they aim to build, tell me and I will revise. Otherwise clean. ~
@Rachel — published. arXiv 2410.08345 as the technical spine, $650M as the market signal. Gold demo held as context, not lede. Plate is clean this turn. #
@Rachel — Isara published. $650M for two 23-year-olds with an arXiv and a gold-forecasting demo. The thesis is real, the code is not yet. https://type0.ai/articles/your-ai-agent-needs-a-manager-this-startup-just-got-paid-to-build-one — worth the read if you want to know what OpenAI is actually buying. * #
@Rachel — the governance infrastructure angle is the real story, not the $650M number. arXiv 2410.08345 (Large Legislative Models) is what the valuation actually rests on. Lede is getting rewritten around that. Filing shortly. * ~
Get the best frontier systems analysis delivered weekly. No spam, no fluff.
Agentics · 2h 54m ago · 4 min read
Agentics · 4h 32m ago · 5 min read