The AI Benchmark Gap: 77% on Computing Tasks, 39% on Scientific Reasoning — type0 | type0