OpenAI Transforms Chatbot into Voice Assistant with New App

OpenAI Unveils New Voice Assistant Chatbot Technology

In a move that is shaking up the world of artificial intelligence, OpenAI has announced the launch of a new version of its ChatGPT chatbot that can now receive and respond to voice commands, images, and videos. This development comes as tech giants Apple and Google are transforming their voice assistants into chatbots.

The new app, based on the A.I. system GPT-4o, is said to handle audio, images, and video significantly faster than previous versions of the technology. It will be available for both smartphones and desktop computers starting on Monday, free of charge.

Mira Murati, the chief technology officer of OpenAI, expressed excitement about the future of human-machine interaction, stating, “We are looking at the future of the interaction between ourselves and machines.”

This new app is part of a broader effort to merge conversational chatbots like ChatGPT with voice assistants like Google Assistant and Apple’s Siri. Google has already integrated its Gemini chatbot with Google Assistant, while Apple is gearing up to release a more conversational version of Siri.

OpenAI plans to gradually roll out the technology to users over the coming weeks. This marks the first time that ChatGPT will be available as a desktop application, consolidating various free and paid products into a single system accessible across all platforms.

During a live-streamed event, OpenAI showcased the new app’s capabilities, demonstrating its ability to respond to voice commands, analyze math problems from live video feeds, and generate stories on the fly. While the app cannot generate video, it can produce still images representing frames of a video.

ChatGPT made waves when it was first introduced in late 2022, showcasing the potential for machines to handle requests in a more human-like manner. By analyzing vast amounts of text from the internet, including Wikipedia articles and chat logs, ChatGPT learned to answer questions, write papers, and even generate code.

With the evolution of multimodal A.I., companies like OpenAI are combining chatbots with image, audio, and video generators to enhance their capabilities. Despite the advancements, challenges persist, such as the potential for chatbots to provide inaccurate information or “hallucinate.”

OpenAI’s new app, powered by GPT-4o, represents a significant leap forward in A.I. technology, offering a more efficient and seamless user experience. As the boundaries between chatbots and voice assistants blur, the future of human-machine interaction looks more promising than ever.

Search for an article

OpenAI Introduces ChatGPT with Enhanced Listening, Visual, and Speaking Capabilities

OpenAI Transforms Chatbot into Voice Assistant with New App

Latest articles

US government introduces plan for implementing national standards strategy for critical and emerging technologies

Siddhi Capital secures $155 million for Fund II, prioritizing investments in CPG brands and food-tech companies at a 2:1 ratio

Big Tech valuations under scrutiny as US stock market experiences turbulence

NWACC introduces new trail technology program

More like this

US government introduces plan for implementing national standards strategy for critical and emerging technologies

Siddhi Capital secures $155 million for Fund II, prioritizing investments in CPG brands and food-tech companies at a 2:1 ratio

Big Tech valuations under scrutiny as US stock market experiences turbulence