Tacotron-2 and WaveGlow model for Audio Deepfake generation
AI is used to gather data to create natural sounding voices that can read digital text. Hence audio deep fakes are closely related to text to speech - TTS technology. Text to speech (TTS) is a technology that converts text input into spoken audio. It can read aloud PDFs, websites,