While my last post focused on flipping and manipulating vocals, this post focuses on... creating them from thin air?
This is a computer-generated clip of Jay-Z rapping the The Tragedy of Darth Plagueis The Wise.
The creator uses Google's Tacotron 2 text-to-speech (TTS) system to make these amazing, yet also terrifying clips.
I say terrifying, but I should clarify that these powerful tools don't need to be terrifying. Check out this VentureBeat article that dives into the security, privacy, and ethical considerations of AI imitation.
For more info, the creator of the video, u/disumbrationist, also posts in an r/VocalSynthesis subreddit. There you'll find videos that were removed from YouTube for copywrite strikes, as well as various discussion on speech synthesis, deepfakes, AI, and machine learning.