Google Translatotron can translate speech in speaker's voice

Google is developing a new translation model called Translatotron that can directly convert speech from one language into another while maintaining a speaker's voice. It skips the usual step of translating speech to text and back to speech. Translatotron uses a sequence-to-sequence network model that processes voice input as a spectrogram and generates a new spectrogram in the target language.

Load More