Invention Application
- Patent Title: END-TO-END SPEECH CONVERSION
-
Application No.: US17310732Application Date: 2019-11-26
-
Publication No.: US20220122579A1Publication Date: 2022-04-21
- Inventor: Fadi Biadsy , Ron J. Weiss , Aleksandar Kracun , Pedro J. Moreno Mengibar
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- International Application: PCT/US2019/063334 WO 20191126
- Main IPC: G10L13/02
- IPC: G10L13/02 ; G10L21/10 ; G10L25/30 ; G06N3/08 ; H04L51/02

Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for end to end speech conversion are disclosed. In one aspect, a method includes the actions of receiving first audio data of a first utterance of one or more first terms spoken by a user. The actions further include providing the first audio data as an input to a model that is configured to receive first given audio data in a first voice and output second given audio data in a synthesized voice without performing speech recognition on the first given audio data. The actions further include receiving second audio data of a second utterance of the one or more first terms spoken in the synthesized voice. The actions further include providing, for output, the second audio data of the second utterance of the one or more first terms spoken in the synthesized voice.
Public/Granted literature
- US12300216B2 End-to-end speech conversion Public/Granted day:2025-05-13
Information query