Most enterprise AI is still an experiment. Capital One is not.
That is the takeaway from a rare public accounting of a major bank's production agentic AI system, laid out by Rashmi Shetty, Capital One's vice president of enterprise AI, at the TWIML AI conference this week. While 89 percent of organizations exploring agentic AI have not moved past pilots, Capital One has a system running in production across its auto dealership customers — and the company is sharing numbers that most banks would keep quiet about.
Chat Concierge, Capital One's multi-agent system for auto dealers, drove a 55 percent improvement in serious sales leads and reduced AI response latency fivefold, Shetty said at the event. The company has been running some version of this architecture for roughly 15 months, she said — before "agentic AI" became a term anyone outside research labs was using.
The system is not a single AI model taking requests. It is four distinct agents working in sequence: one communicates with the customer, a second builds an action plan based on the business rules and tools it is allowed to touch, a third evaluates what would happen if those actions executed, and a fourth validates the plan with the user before anything happens. That third agent — the evaluator — is where Capital One solved the problem that trips up most enterprise deployments.
"The evaluator agent is where we bring a world model," Shetty said. "That's where we simulate what happens if a series of actions were to be actually executed." The model is trained on Capital One's internal policies and the regulatory environment the company operates in. Compliance is not a checkpoint the system passes through before it acts. It is the substrate the system was built on.
Prem Natarajan, Capital One's chief scientist and head of AI, framed the company's advantage as architectural. "You don't gate policy around the agent," he said at the same event. "Policy is the agent." That is a stark way of saying: most enterprises are trying to wrap guardrails around an AI system. Capital One built the guardrails into the foundation.
The technical stack reflects that priority. The company runs on open-weights AI models rather than closed ones — a choice that sounds counterintuitive given the regulatory environment banks operate in, but that Capital One's team argues enables deeper customization of its compliance logic. The production infrastructure sits on Snowflake, Docker, Kubernetes, and AWS serverless components, alongside Scala and Java for the regulated workloads and PyTorch for model development, according to an analysis of Capital One's public engineering footprint.
The broader context matters here. A Deloitte survey cited at the event found that 30 percent of organizations are exploring agentic AI but only 11 percent have it running in production, Fortune reported. Capital One is not just in that 11 percent — it is one of the few companies publicly discussing production architecture at a granular level. Most enterprises that have deployed agentic systems are not talking about it, either because it is working and they do not want to invite scrutiny, or because it is not working and they certainly do not want to talk about it.
Capital One's own numbers are the obvious caveat. The 55 percent improvement in leads and the fivefold latency reduction are figures the company reported at a conference, not figures independently audited. The Deloitte survey has the standard limitations of self-reported enterprise surveys — respondents tend to be organizations already further along the adoption curve. Capital One also closed its last physical data center in 2021 and moved entirely to public cloud, making it structurally unusual among major U.S. banks in ways that make direct comparison difficult.
What Capital One is describing is a specific solution to a specific problem: how to put AI agents in front of customers in a regulated industry without the compliance team halting every transaction. The answer — build the evaluator as a mirror of internal risk governance — is an architectural pattern that other financial institutions, insurers, and healthcare companies are studying closely, even if they are not saying so publicly.
The question for the 89 percent still in pilot is whether Capital One's approach is transferable or whether it depends on a decade of cloud-native infrastructure that most enterprises will take years to replicate. The banks that figure that out first will have a significant operational advantage. The ones that do not will keep running experiments.