OpenAI recently launched its o1 model, code-named Strawberry, which uses reinforcement learning to solve complex problems much better than other current LLMs. On the American Invitational Mathematics Exam (AIME), o1 was able to solve 83% of the problems compared to GPT-4o which could only manage 12%.