End-to-end speaker recognition using deep neural network

Invention Grant

US11468901B2 End-to-end speaker recognition using deep neural network 有权

Please log in to see more content

Patent Title: End-to-end speaker recognition using deep neural network
Application No.: US16536293

Application Date: 2019-08-08
Publication No.: US11468901B2

Publication Date: 2022-10-11
Inventor: Elie Khoury , Matthew Garland
Applicant: PINDROP SECURITY, INC.
Applicant Address: US GA Atlanta
Assignee: PINDROP SECURITY, INC.
Current Assignee: PINDROP SECURITY, INC.
Current Assignee Address: US GA Atlanta
Agency: Foley & Lardner LLP
Main IPC: G10L17/08
IPC: G10L17/08 ; G10L15/16 ; G10L17/02 ; G10L17/04 ; G10L17/22 ; G06N3/04 ; G06N3/08 ; G10L17/18

End-to-end speaker recognition using deep neural network

Abstract:

The present invention is directed to a deep neural network (DNN) having a triplet network architecture, which is suitable to perform speaker recognition. In particular, the DNN includes three feed-forward neural networks, which are trained according to a batch process utilizing a cohort set of negative training samples. After each batch of training samples is processed, the DNN may be trained according to a loss function, e.g., utilizing a cosine measure of similarity between respective samples, along with positive and negative margins, to provide a robust representation of voiceprints.

Public/Granted literature

US20190392842A1 END-TO-END SPEAKER RECOGNITION USING DEEP NEURAL NETWORK Public/Granted day:2019-12-26

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L17/00	讲话者辨认或验证
G10L17/06	.决策方法，模式适配策略
G10L17/08	..在探测模型和基准模板二者间使用特定距离或失真度量