The AI Benchmark Gap: 77% on Computing Tasks, 39% on Scientific Reasoning — Markdown | type0 | type0