Article Markdown

Raw .md Rich view All markdown articles

# Why o1's Planning Accuracy Collapses 78% When Spatial Reasoning Joins Verbal

- Date: 2026-03-23
- Category: Artificial Intelligence

When a new paper asks whether large language models can actually plan a trip — not just describe one — the answer turns out to be nearly zero.

ItinBench, a benchmark developed by researchers at the University of Virginia, tests LLMs on itinerary planning across two cognitive dimensions simultane...

---