OpenAI's new test for AI scientists: 750 tasks, 173 real researchers, no multiple choice — type0 | type0