OpenAI wants to grade AI on biology's judgment calls. It also wrote the test. — type0 | type0