Convolutional neural network with phonetic attention for speaker verification

Invention Grant

US11776548B2 Convolutional neural network with phonetic attention for speaker verification 有权

Please log in to see more content

Patent Title: Convolutional neural network with phonetic attention for speaker verification
Application No.: US17665862

Application Date: 2022-02-07
Publication No.: US11776548B2

Publication Date: 2023-10-03
Inventor: Yong Zhao , Tianyan Zhou , Jinyu Li , Yifan Gong , Jian Wu , Zhuo Chen
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: Microsoft Technology Licensing, LLC
Current Assignee: Microsoft Technology Licensing, LLC
Current Assignee Address: US WA Redmond
Agency: WORKMAN NYDEGGER
Main IPC: G10L17/14
IPC: G10L17/14 ; G10L17/18 ; G06N3/08 ; G10L17/02

Convolutional neural network with phonetic attention for speaker verification

Abstract:

Embodiments may include determination, for each of a plurality of speech frames associated with an acoustic feature, of a phonetic feature based on the associated acoustic feature, generation of one or more two-dimensional feature maps based on the plurality of phonetic features, input of the one or more two-dimensional feature maps to a trained neural network to generate a plurality of speaker embeddings, and aggregation of the plurality of speaker embeddings into a speaker embedding based on respective weights determined for each of the plurality of speaker embeddings, wherein the speaker embedding is associated with an identity of the speaker.

Public/Granted literature

US20220157324A1 CONVOLUTIONAL NEURAL NETWORK WITH PHONETIC ATTENTION FOR SPEAKER VERIFICATION Public/Granted day:2022-05-19

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L17/00	讲话者辨认或验证
G10L17/06	.决策方法，模式适配策略
G10L17/14	..在说话者识别或确认之前使用语音分类或语音识别