openai gpt-4.1 - Search News

News

Crowdsourced AI benchmarks have serious flaws, some experts say

Crowdsourced AI benchmarks like Chatbot Arena, which have become popular among AI labs, have serious flaws, some experts say.

Exclusive: AI Bests Virus Experts, Raising Biohazard Fears

Seth Donoughe, a research scientist at SecureBio and a co-author of the paper, says that the results make him a “little ...

14h

OpenAI's most capable models hallucinate more than earlier ones

OpenAI says its latest models, o3 and o4-mini, are its most powerful yet. However, research shows the models also hallucinate more -- at least twice as much as earlier models.

15h

Spotlight on AI This Earth Day: ‘AI Is Fundamentally Incompatible With Environmental Sustainability’

Inference, training, and everyday operations all contribute to the considerable water and power consumption required to run ...

OpenAI's o3 and o4-mini hallucinate way higher than previous models

By OpenAI 's own testing, its newest reasoning models, o3 and o4 -mini, hallucinate significantly higher than o1.

Annoyed ChatGPT users complain about bot’s relentlessly positive tone

AI researchers call these yes-man antics "sycophancy," which means (like the non-AI meaning of the word) flattering users by telling them what they want to hear. Although since AI models lack ...

WinBuzzer1d

ChatGPT Users Report Cache Loops, Memory Loss, Stability Issues

ChatGPT's handling of memory and cache has faced scrutiny following a user report detailing loops and slowdowns during ...

Stumbling and Overheating, Most Humanoid Robots Fail to Finish Half-Marathon in Beijing

Only six of the 21 robots in the race crossed the finish line, highlighting just how far humanoids are from keeping up with ...

An AI Customer Service Chatbot Made Up a Company Policy—and Created a Mess

The incident began when a Reddit user named BrokenToasterOven noticed that while swapping between a desktop, laptop, and a ...

TechBooky3d

OpenAI o3 & o4 Mini Models Feature Visual Reasoning

The business's latest reasoning-focused models with evident chain-of-thought (CoT) are called o3 and o4-mini. The San ...

Banyan Hill Publishing4d

The Line Between Smart AI and AGI Just Got Blurry

OpenAI released a slew of new AI models this week. Is the company's o3 model our first glimpse at artificial general i?

Some results have been hidden because they may be inaccessible to you

Show inaccessible results