Before shipping a new model, OpenAI now runs it on replayed conversations first — type0 | type0