Alibaba Groupnew artificial intelligence tool introduced. Emote Portrait Alive (EMOThe artificial intelligence tool called ) can convert any photo into a video and voice these videos. The artificial intelligence tool can make people in photographs speak with mouth movements appropriate to the desired voice.
EMO has the ability to automatically adjust the speed of speech according to the audio source to be used in the video. In this way, gestures and facial expressions appear more consistently in the video.
Experts emphasize that the artificial intelligence tool with these features consists of two components. The first of these parts defines the image and creates moving frames from a reference image. The other one analyzes the audio file and identifies important points. Then, a video is created by matching these important points with the images.
Researchers have compiled data from a wide variety of sources to realize the training of EMO. A large dataset containing over 250 hours of speech video they used. This dataset was obtained from various sources such as speeches, movies, television shows and song performances.
Experiments found that EMO performs significantly better than other methods in metrics such as video quality and expressive richness. In addition, user studies revealed that the videos produced by EMO are more natural and emotional than those produced by other systems.
Although the world’s leading technology company in the artificial intelligence sector is competing in this field, there are concerns about the misuse of such technologies. That’s why researchers continue to work on developing solutions to detect and prevent misuse of synthetic videos.
Sample videos about EMO from here You can take a look.
Source link: https://webrazzi.com/2024/02/29/alibaba-dan-fotograflari-konusturan-yapay-zeka-araci-emo/