SPEAKER AWARENESS USING SPEAKER DEPENDENT SPEECH MODEL(S)

发明公开

EP4086904A1 SPEAKER AWARENESS USING SPEAKER DEPENDENT SPEECH MODEL(S) 审中-公开

请登陆查看更多内容

专利标题： SPEAKER AWARENESS USING SPEAKER DEPENDENT SPEECH MODEL(S)
申请号： EP22181074.0

申请日： 2019-12-04
公开(公告)号： EP4086904A1

公开(公告)日： 2022-11-09
发明人: MORENO, Ignacio Lopez , WANG, Quan , PELECANOS, Jason , WAN, Li , GRUENSTEIN, Alexander , ERDOGAN, Hakan
申请人： Google LLC
申请人地址： US Mountain View, CA 94043 1600 Amphitheatre Parkway
代理机构： Robinson, David Edward Ashdown
主分类号： G10L25/78
IPC分类号： G10L25/78 ; G10L25/30 ; G10L17/18 ; G10L15/07 ; G10L15/20 ; G10L17/04 ; G10L17/20

SPEAKER AWARENESS USING SPEAKER DEPENDENT SPEECH MODEL(S)

摘要：

Techniques disclosed herein enable training and/or utilizing speaker dependent (SD) speech models which are personalizable to any user of a client device. Various implementations include personalizing a SD speech model for a target user by processing, using the SD speech model, a speaker embedding corresponding to the target user along with an instance of audio data. The SD speech model can be personalized for an additional target user by processing, using the SD speech model, an additional speaker embedding, corresponding to the additional target user, along with another instance of audio data. Additional or alternative implementations include training the SD speech model based on a speaker independent speech model using teacher student learning.

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/78	.语音信号存在或不存在的检测（在双向扩音电话系统中通过语音频率切换传输的方向入H04M9/10）