Recently, researchers have designed an artificial intelligence that can predict body movements with only the sound of a person’s voice.
Researchers collected 144 hours of video of 10 people speaking, including a nun, a chemistry teacher, and five TV show hosts (Conan O’Brien, Ellen DeGeneres, John Oliver, Jon Stewart, and Seth Meyers). They used an existing algorithm to produce skeletal figures representing the positions of the speakers’ arms and hands. They then trained their own algorithm with the data, so it would predict gestures based on fresh audio of the speakers.
This research is just amazing. I, myself, can’t even imagine the gestures by just hearing recordings. As a human being, I just recognize the emotion. This technology is truly astonishing.
Video: Science Magazine/ Youtube