Astonishing Demos of OpenAI's GPT-4o

So OpenAI has unveiled GPT-4o, their latest flagship AI. The AI can not only speak—in a disturbingly lifelike way, complete with surprise, chuckles and the like—but can use your camera to deduce what’s going on around you. It has to be seen/heard to be believed.

OpenAI won’t allow the videos to be embedded, so click here to get your socks knocked off. The demo atop the page gives you the general gist, and there are more demonstrations—customer service, interview preparation, sarcasm, two AIs talking to each other, etc.—below. Perhaps most disturbing is when the AI starts talking to the dog, more or less perfectly nailing the way humans speak to dogs.

“GPT-4o (‘o’ for ‘omni’) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, and image and generates any combination of text, audio, and image outputs. It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time in a conversation.”

Source: core77

