News

The FrontierMath benchmark from Epoch AI tests generative models on difficult math problems. Find out how OpenAI’s o3 and ...
A discrepancy between first- and third-party benchmark results for OpenAI's o3 AI model is raising questions about the ...
OpenAI’s newest LLM, o3, is facing scrutiny after independent tests found it solved a far fewer number of tough math problems ...
In December 2024, OpenAI held a livestream on YouTube and other social media platforms, announcing the o3 AI model. At the time, the company highlighted the improved set of capabilities in the large ...
OpenAI’s o3 model is under scrutiny after third-party tests revealed far lower performance than previously claimed.
OpenAI’s o3 model shows inflated benchmark results; real-world tests reflect performance far below initial FrontierMath ...
When OpenAI unveiled o3 in December, the company claimed the model could answer just over a fourth of questions on FrontierMath, a challenging set of math problems. That score blew the competition ...
Catch up on the top artificial intelligence news and commentary by Wall Street analysts on publicly traded companies in the space with this ...
OpenAI is under scrutiny once again over claims it has made about its o3 model, with the company being accused of not being truthful.