Microsoft, a leading provider of technology, has unveiled VALL-E, a new artificial intelligence (AI) tool that can quickly reproduce any voice.
According to Gizmochina, the tool was trained using 60,000 hours of English speech data.
Furthermore, it can mimic the speaker’s tone and emotions, which prior models could note.
Concerns exist, though, regarding the ethical implications of this novel technology.
VALL-E and similar technology will enable spam calls that convincingly mimic the sounds of genuine people a prospective victim knows because the voices they make seem more believable.
Another potential is impersonating politicians and other well-known people, which can lead to the dissemination of false information on social media.
Additionally, some banks utilize voice recognition technology to determine a caller’s identity, and with AI-generated voices, it might be harder to determine if a caller is legitimate or not.
Therefore, it is crucial that Microsoft develops policies to control the usage of VALL-E to ensure that it is utilized for good and not for nefarious reasons, according to the paper.