OpenAI gives ChatGPT new powers to see, hear
OpenAI released a higher performing version of its AI technology, ChatGPT, making it free for all users. The update, GPT-4o, offers more human-like capabilities and can understand commands in voice, text, or images. This release comes ahead of Google's announcements about its AI tool, Gemini, which competes directly with ChatGPT.
At the launch event in San Francisco, OpenAI's Chief Technology Officer, Mira Murati, expressed excitement about the new model. GPT-4o, where "O" stands for omni, will be available to all users, with paid customers having unlimited access.
Enhanced Features
The new version introduces a voice and video mode, described by OpenAI CEO Sam Altman as the best computer interface, resembling AI from movies. The AI was inspired by the character in the film "Her," aiming for more natural interactions. The upgraded ChatGPT demonstrated its capabilities in various tasks, including interpreting languages, facial expressions, and solving complex problems.
Industry Standards
GPT-4o sets new industry standards for multilingual conversations, audio, and vision. It can interpret surroundings through a smartphone camera, providing detailed responses. Despite expectations for a competitive search tool with Google, OpenAI focused on enhancing ChatGPT's capabilities.
AI Arms Race
OpenAI's advancements are part of the ongoing AI arms race, with companies like Microsoft, Google, Meta, and Anthropic vying for dominance. Monetization remains a challenge, as the cost of generative AI is significant. OpenAI aims to find revenue streams while ensuring user access to advanced AI technology.
The release of GPT-4o signifies a step towards democratizing AI capabilities, raising questions about future monetization strategies. OpenAI's partnerships with publishers and ongoing legal battles reflect the complex landscape of AI development and deployment.