Improving Speech Recognition with Speech Synthesis-based Model Adapation

发明申请

US20230058447A1 Improving Speech Recognition with Speech Synthesis-based Model Adapation 有权

请登陆查看更多内容

专利标题： Improving Speech Recognition with Speech Synthesis-based Model Adapation
申请号： US17445537

申请日： 2021-08-20
公开(公告)号： US20230058447A1

公开(公告)日： 2023-02-23
发明人: Andrew Rosenberg , Bhuvana Ramabhadran
申请人： Google LLC
申请人地址： US CA Mountain View
专利权人： Google LLC
当前专利权人： Google LLC
当前专利权人地址： US CA Mountain View
主分类号： G10L21/007
IPC分类号： G10L21/007 ; G10L15/26 ; G10L25/30 ; G06N3/08

Improving Speech Recognition with Speech Synthesis-based Model Adapation

摘要：

A method for training a speech recognition model includes obtaining sample utterances of synthesized speech in a target domain, obtaining transcribed utterances of non-synthetic speech in the target domain, and pre-training the speech recognition model on the sample utterances of synthesized speech in the target domain to attain an initial state for warm-start training. After pre-training the speech recognition model, the method also includes warm-start training the speech recognition model on the transcribed utterances of non-synthetic speech in the target domain to teach the speech recognition model to learn to recognize real/human speech in the target domain.

公开/授权文献

US11823697B2 Improving speech recognition with speech synthesis-based model adapation 公开/授权日：2023-11-21

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L21/00	为了改变语音或声音信号的质量或其可识度而处理语音或声音信号，以产生另一种可听的或非可听的信号，例如视觉信号或触觉信号（G10L19/00优先）
G10L21/003	.改变声音质量，例如音调或共振峰
G10L21/007	..以所使用的处理为特征的