Key Points
The company today announced a new advance that signals a shift in approacha model that can reason logically through many difficult problems and is significantly smarter than existing AI without a major scale-up...
The new model, dubbed OpenAI o1, can solve problems that stump existing AI models, including OpenAIs most powerful existing model, GPT-4o..
The new model was code-named Strawberry within OpenAI, and it is not a successor to GPT-4o but rather a complement to it, the company says.. Murati says that OpenAI is currently building its next master model, GPT-5, which will be considerably larger than its predecessor..
Murati says OpenAI o1 uses reinforcement learning, which involves giving a model positive feedback when it gets answers right and negative feedback when it does not, in order to improve its reasoning process..
The [new] model is learning to think for itself, rather than kind of trying to imitate the way humans would think, as a conventional LLM does, Chen says.. OpenAI says its new model performs markedly better on a number of problem sets, including ones focused on coding, math, physics, biology, and chemistry..