OpenAI initially revealed plans for the voice feature in May, which received a lot of attention due to its similarity to Scarlett Johansson’s voice from the film Her. However, this was short-lived.
Read more

ChatGPT is stepping up its game with an all-new voice feature, making conversations with AI more natural and fluid. OpenAI announced that its popular chatbot can now engage in audio chats, but for now, it’s only available to those who subscribe to the premium service.

This advanced voice feature offers a seamless experience, allowing users to have real-time conversations, with the AI ​​able to pause in case of interruptions – whatever the back and forth with their tech. A nice feature for those who enjoy chatting.

While this exciting new capability is being rolled out over the week, it’s not yet available in the EU, UK and a few other European countries. OpenAI initially revealed plans for the voice feature in May, which received a lot of attention due to its similarity to Scarlett Johansson’s voice from the film Her.

However, this was short-lived, as legal action was quickly taken, and the company had to stop using the voice. Since then, though, users of the free tier have been able to play with other sounds. The latest version offers nine voices and lets users customize the instructions for these chats in the app’s settings.

In a playful nod to the wait, OpenAI co-founder and CEO Sam Altman posted on X (formerly Twitter), “Hope you think it was worth the wait.”

Growing Competition in AI Voice Tech
OpenAI isn’t the only player in the AI ​​voice game, though. Google recently released its Gemini Live Voice feature, and Meta is getting in on the action later this week with plans to launch celebrity voices via popular platforms like Facebook, Instagram, and WhatsApp. .

This competitive scenario highlights how important AI-powered voice features have become, especially for tech giants. OpenAI, backed by Microsoft, continues to lead the charge, having taken off in a big way with ChatGPT since its launch in late 2022. According to an August report, the chatbot already boasts 200 million weekly active users.

However, the voice feature is only available to those who subscribe to OpenAI’s Plus, Team, or Enterprise plans, which cost $20 per month.

Upgrade for GPT-4o Mini
In addition to sound features, OpenAI has rolled out some significant updates to its tiny GPT-4o mini model. Previously considered less powerful, the Mini model has now been expanded to offer four major features previously reserved for the larger GPT-4o version.

First, the mini model can now generate images from text prompts using the same DALL-E 3 model as its big brother. This upgrade is expected to be a hit with users looking for faster imaging without sacrificing quality.

Secondly, the mini model now has Internet browsing capabilities, meaning users can access the latest information and conduct research in real time. This upgrade brings it closer in functionality to the older GPT-4o, giving users more flexibility when fact-checking or gathering information.

Another new feature is the ability to upload and analyze documents and images, which will make working with complex visual data much easier. Users can now interact with both text and visuals, opening the door to educational and personal use cases.

Finally, the GPT-4o mini model can now remember past interactions with users, just like its more advanced counterparts. This memory feature enables the model to provide more relevant follow-up responses and recognize user preferences, making long-term interactions smoother.



Source link