- 专利标题: SPEAKER AWARENESS USING SPEAKER DEPENDENT SPEECH MODEL(S)
-
申请号: EP22181074.0申请日: 2019-12-04
-
公开(公告)号: EP4086904A1公开(公告)日: 2022-11-09
- 发明人: MORENO, Ignacio Lopez , WANG, Quan , PELECANOS, Jason , WAN, Li , GRUENSTEIN, Alexander , ERDOGAN, Hakan
- 申请人: Google LLC
- 申请人地址: US Mountain View, CA 94043 1600 Amphitheatre Parkway
- 代理机构: Robinson, David Edward Ashdown
- 主分类号: G10L25/78
- IPC分类号: G10L25/78 ; G10L25/30 ; G10L17/18 ; G10L15/07 ; G10L15/20 ; G10L17/04 ; G10L17/20
摘要:
Techniques disclosed herein enable training and/or utilizing speaker dependent (SD) speech models which are personalizable to any user of a client device. Various implementations include personalizing a SD speech model for a target user by processing, using the SD speech model, a speaker embedding corresponding to the target user along with an instance of audio data. The SD speech model can be personalized for an additional target user by processing, using the SD speech model, an additional speaker embedding, corresponding to the additional target user, along with another instance of audio data. Additional or alternative implementations include training the SD speech model based on a speaker independent speech model using teacher student learning.
信息查询