END-TO-END SPEECH CONVERSION

Invention Application

US20220122579A1 END-TO-END SPEECH CONVERSION 有权

Please log in to see more content

Patent Title: END-TO-END SPEECH CONVERSION
Application No.: US17310732

Application Date: 2019-11-26
Publication No.: US20220122579A1

Publication Date: 2022-04-21
Inventor: Fadi Biadsy , Ron J. Weiss , Aleksandar Kracun , Pedro J. Moreno Mengibar
Applicant: Google LLC
Applicant Address: US CA Mountain View
Assignee: Google LLC
Current Assignee: Google LLC
Current Assignee Address: US CA Mountain View
International Application: PCT/US2019/063334 WO 20191126
Main IPC: G10L13/02
IPC: G10L13/02 ; G10L21/10 ; G10L25/30 ; G06N3/08 ; H04L51/02

Abstract:

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for end to end speech conversion are disclosed. In one aspect, a method includes the actions of receiving first audio data of a first utterance of one or more first terms spoken by a user. The actions further include providing the first audio data as an input to a model that is configured to receive first given audio data in a first voice and output second given audio data in a synthesized voice without performing speech recognition on the first given audio data. The actions further include receiving second audio data of a second utterance of the one or more first terms spoken in the synthesized voice. The actions further include providing, for output, the second audio data of the second utterance of the one or more first terms spoken in the synthesized voice.

Public/Granted literature

US12300216B2 End-to-end speech conversion Public/Granted day:2025-05-13

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备