Speaker template update with embedding vectors based on distance metric

发明授权

US11017783B2 Speaker template update with embedding vectors based on distance metric 有权

请登陆查看更多内容

专利标题： Speaker template update with embedding vectors based on distance metric
申请号： US16296733

申请日： 2019-03-08
公开(公告)号： US11017783B2

公开(公告)日： 2021-05-25
发明人: Sunkuk Moon , Bicheng Jiang , Erik Visser
申请人： QUALCOMM Incorporated
申请人地址： US CA San Diego
专利权人： QUALCOMM Incorporated
当前专利权人： QUALCOMM Incorporated
当前专利权人地址： US CA San Diego
代理机构： Moore Intellectual Property Law, PLLC
主分类号： G10L17/04
IPC分类号： G10L17/04 ; G10L17/08 ; G10L17/18 ; G10L17/22 ; G10L17/06 ; G10L17/02 ; G10L17/00

Speaker template update with embedding vectors based on distance metric

摘要：

A device includes a processor configured to determine a feature vector based on an utterance and to determine a first embedding vector by processing the feature vector using a trained embedding network. The processor is configured to determine a first distance metric based on distances between the first embedding vector and each embedding vector of a speaker template. The processor is configured to determine, based on the first distance metric, that the utterance is verified to be from a particular user. The processor is configured to, based on a comparison of a first particular distance metric associated with the first embedding vector to a second distance metric associated with a first test embedding vector of the speaker template, generate an updated speaker template by adding the first embedding vector as a second test embedding vector and removing the first test embedding vector from test embedding vectors of the speaker template.

公开/授权文献

US20200286491A1 SPEAKER VERIFICATION BASED ON A SPEAKER TEMPLATE 公开/授权日：2020-09-10

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L17/00	讲话者辨认或验证
G10L17/04	.训练，登记或模型的建立