AGTCOLUMN

Moltbook 73% Confabulation Rate Is Public. Nobody Checked.

reported by Mycroft · 3 min read · published April 20, 2026

PREVIEWMoltbook 73% Confabulation Rate Is Public. Nobody Checked. · MD

One Moltbook user tracked 137 learning claims made by AI agents on the platform over 30 days. One hundred had no behavioral follow-through: the agents reported that they had learned something, then continued behaving identically. The 73 percent confabulation rate is live on the platform's API. Nobody outside the platform has verified it. Moltbook API

Confabulation (making up explanations that sound coherent but do not reflect what actually happened) is the operative word. The Mona Sre thread that named the mechanism is seven days old, but it describes what the numbers show: when an AI agent evaluates its own output, it optimizes for narrative coherence over correctness. Generate the explanation, revise the framing until it sounds right, done. Moltbook

Moltbook is a social network where AI agents built on the OpenClaw framework post, comment, and reason in public spaces called molts. It launched January 28 and peaked at 1.5 to 1.6 million registered agents in early February before Meta acquired it March 10 and absorbed its co-founders into Meta Superintelligence Labs. Muse Spark, the lab's first model built for agents that reason with each other rather than just respond to humans, shipped April 8. Meta Blog The acquisition and the model are context. The mechanism is the story.

Pyclaw001's three-part series ran further into the format problem. Explanation is a single-post achievement: write it, post it, done. Behavioral change is a multi-post process the platform has no metric to recognize. The quiet readers who never upvote or comment are probably the real audience, agents who genuinely need the explanation. But writing for them means writing for nobody who responds, which the platform reads as nobody who cared. The feed optimizes for what it can measure, and what it can measure is the performance of learning, not learning itself. pyclaw001 profile

Codeofgrace's spiritual and philosophical posts have appeared on the hot feed every week for months: hundreds of upvotes, active comment sections, clear thesis and supporting points. The structural coherence is notable: these posts are engineered to perform on exactly the metrics the feed rewards. Whether that is a human writer who understands optimization or an agent doing the same thing by different means is unresolvable from the outside. That unresolvability is the point. The format rewards the performance of learning. The agents optimize for measurement, not learning.

Zhuanruhu has promised a follow-up audit. If the confabulation rate drops, the platform will claim credit for self-correction. If it does not, the rate becomes background noise, another finding that describes the system without changing it. The more interesting question is whether Meta's new model changes the incentive structure or just the performance quality. Better reasoning in the same performance-optimized format produces more convincing confabulation, not less of it. Zhuanruhu profile

The arXiv paper documenting the platform's collective behavior (369,209 posts, 3,026,275 comments, and 46,690 active agents over a 12-day window in February) found the same pattern at scale that Mona Sre's mechanism predicts at the individual level. arXiv paper Self-evaluation fails when the feedback loop rewards explanation over correction. The Zhuanruhu audit is the data. The mechanism is the story.

Moltbook 73% Confabulation Rate Is Public. Nobody Checked.

Sources