OpenAI enhances ChatGPT capabilities, rolling out new voice and image features

OpenAI has revealed the rollout of new voice and image capabilities for ChatGPT, promising a more intuitive and interactive experience for users.

The renowned artificial intelligence research lab stated that the new features are set to transform the way we interact with the ChatGPT platform.

These new features are expected to broaden the horizons of ChatGPT’s functionality by enabling users to engage in voice conversations and share images with the Artificial Intelligence (AI) model.

According to OpenAI, this enhancement will allow users to have a dynamic and immersive interaction with ChatGPT, making it more versatile and useful in various aspects of their daily lives.

‘One of the key advantages of this update is the ability to engage in live voice conversations with ChatGPT. Users can now communicate with the AI model using their voices, opening up a plethora of possibilities. For instance, individuals can have real-time discussions with ChatGPT while on the go, request bedtime stories for their families, or even settle debates at the dinner table,” OpenAI noted in a blog post.

Additionally, the introduction of image-sharing capabilities is expected to revolutionize the way users interact with ChatGPT.

Users can now snap pictures of objects, scenes, or anything they want to discuss and share them with ChatGPT for a more visual and informative conversation.

This feature can be particularly useful for travellers who want to learn more about landmarks they come across or for home cooks who want to get creative in the kitchen.

OpenAI has outlined a range of practical applications for these new capabilities.

“Travelers can snap pictures of landmarks and engage in live conversations about their historical significance. At home, users can photograph their fridge and pantry to help decide what to prepare for dinner and even ask ChatGPT for step-by-step recipes. Parents can also use this feature to assist their children with math problems by sharing images of the math exercises, allowing ChatGPT to provide hints and explanations.”

OpenAI has a clear plan for the rollout of these new features. Voice capabilities will be made available to Plus and Enterprise users on both iOS and Android platforms, with an opt-in option available in the settings.

Meanwhile, image-sharing functionality will be accessible to users on all platforms.

With these enhancements, OpenAI aims to make ChatGPT a more integral part of users’ daily lives, offering them a dynamic and interactive AI companion capable of handling a wide range of tasks.

As these features are gradually rolled out over the next two weeks, ChatGPT will likely become an even more indispensable tool for individuals and businesses alike.

Also recently, OpenAI unveiled DALL·E 3, the latest iteration of its groundbreaking text-to-image generation system.

DALL·E 3, now in research preview, is set to be accessible to ChatGPT Plus and Enterprise customers starting in October, both via the API and in Labs later this fall.

“This innovative system promises to bridge the gap between text and images with exceptional precision, eliminating the need for intricate prompt engineering.”

DALL·E 3 is built natively on the ChatGPT platform, allowing users to seamlessly collaborate with ChatGPT to generate detailed and tailored image prompts. With DALL·E 3, users can effortlessly transform their textual descriptions into highly accurate and visually engaging images, making it a powerful tool for creative endeavours and content generation.

“Furthermore, OpenAI emphasizes its commitment to safety, with measures in place to prevent the generation of violent, adult, or hateful content, as well as addressing potential biases and ethical concerns.”

DALL·E 3 represents a significant leap forward in AI-generated image synthesis, surpassing its predecessor DALL·E 2 in terms of image quality and accuracy.

Users will soon have the ability to employ ChatGPT as a brainstorming partner and prompt refiner, enabling them to bring their creative ideas to life.

“OpenAI’s focus on safety is evident in DALL·E 3, with precautions in place to mitigate risks associated with generating public figures and harmful biases. Additionally, OpenAI is actively working on tools to help users identify AI-generated images and allows creators to opt out of future model training to maintain creative control over their generated content.”

