Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations — type0 | type0