On Monday, OpenAI made a higher-performing and more efficient version of the artificial intelligence technology that powers its popular generative tool ChatGPT available to all users for free.
The update to OpenAI‘s flagship product came a day before Google is set to make announcements about Gemini, the search engine giant’s own AI tool that competes directly with ChatGPT.
“We’re very, very excited to bring GPT-4o to all of our free users out there,” Chief Technology Officer Mira Murati said at the highly anticipated launch event in San Francisco.
The new model GPT-4o—the “O” stands for omni—will be rolled out in OpenAI’s products over the following few weeks, according to the company, with premium clients having unlimited access to the tool.
The company said that the model could generate content or understand commands in voice, text, or images.
“The new voice (and video) mode is the best computer interface I’ve ever used. It feels like AI from the movies,” said OpenAI CEO Sam Altman in a blog post.
Altman has previously pointed to the Scarlett Johansson character in the movie “Her” as an inspiration for where he would like AI interactions to go.
“Talking to a computer has never felt really natural for me; now it does,” he added.
Murati and engineers from OpenAI demonstrated the new powers of GPT-4o at the virtual event, asking questions and posing challenges to the beefed-up version of the ChatGPT chatbot.
The demo mainly featured OpenAI staff members asking questions to the voiced ChatGPT, which responded with jokes and human-like banter.
The bot served as an interpreter from English to Italian, interpreted facial expressions and walked one user through a difficult algebra problem.
The company said that GPT-4o had the same powers as the previous version when it came to text, reasoning, and coding intelligence, and set industry leading benchmarks for multilingual conversations, audio, and vision.
In one demonstration, ChatGPT successfully interpreted an employee’s surroundings, speaking in a friendly, feminine voice, not unlike the AI bot in the film “Her”.
“Hmmm from what I can see it looks like you’re in some kind of recording or production set-up with lights, tripods… you might be gearing up to shoot a video or make an announcement?” the ChatGPT bot said.
‘Take our time’
Anticipation was high in recent weeks that OpenAI would release an AI-amped version of an online search tool to compete with Google search engine, but on Friday Altman said this would not be the case.
Observers were also waiting for the launch of GPT-5, but Altman said last week that his company would “take our time on releases of major new models.”
The event is just the latest episode in the AI arms race that has seen OpenAI-backer Microsoft propelled past Apple as the world’s biggest company by market capitalization.
OpenAI and Microsoft are in a heated rivalry with Google to be generative AI’s major player, but Facebook-owner Meta and upstart Anthropic are also making big moves to compete.
All the companies are scrambling to come up with ways to cover generative AI’s exorbitant costs, much of which goes to chip giant Nvidia and its powerful GPU semiconductors.
Making the new model available to all users, may raise questions about OpenAI’s path to monetization.
Until now, only fewer performing versions of OpenAI or Google’s chatbots were available to customers for free, with doubts that everyday smartphone users are ready to pay a subscription to maintain access to the technology.
The AI makers are also feeling pressure from creators, who are demanding payment for the content used to train the models.
OpenAI has signed content partnerships with the Associated Press, the Financial Times and Axel Springer, but is also caught in a major lawsuit with The New York Times.
AI companies have also been confronted with separate lawsuits from artists, musicians, and authors in US courtrooms.