New ChatGPT AI Watches Through Your Camera, Offers Advice on ...
Today, OpenAI showed off its latest large language model (LLM) called GPT-4o — that's a lower-case "o," for "omni" — that the company promises can "reason across audio, vision, and text in real time." During its brief announcement, the company demonstrated the AI's uncanny ability to assess what it "sees" through the user's smartphone camera, allowing it to help solve math problems and even assist with coding.
Available to All ChatGPT Users
OpenAI is making the new model "available to all ChatGPT users, including on the free plan" per OpenAI CEO Sam Altman. "So far, GPT-4 class models have only been available to people who pay a monthly subscription." It's arguably a natural evolution of the popular AI chatbot; by harnessing a live video stream, the assistant could likely be more helpful by benefiting from far more context.
Advancements in AI Technology
OpenAI's GPT-4o is leveraging the computing power of modern smartphones to provide a seamless experience, with minimal delays between a user's question and the AI's response. The model can respond to audio inputs in as little as 232 milliseconds, similar to human response time in a conversation. This efficiency is achieved by processing all inputs and outputs through the same neural network, eliminating the need to transcribe text.
Moreover, the new model offers a more natural and emotional interaction, with a lifelike female voice that can pick up on tone and emotions of the user in real-time. This advancement brings it a lot closer to human-like conversations.
Potential Challenges and Future Developments
While the demonstration of ChatGPT's capabilities is impressive, it is essential to approach these technological advancements with caution. Tech demos, while promising, may not always reflect real-world scenarios accurately. Ensuring the AI's effectiveness in responding to live smartphone camera feeds in various environments remains a challenge.
OpenAI continues to address potential issues such as AI hallucinations and biases to enhance the model's reliability. Despite the progress showcased, there is still room for improvement and further developments in the field of AI technology.