Invention Grant
- Patent Title: Convolutional neural network with phonetic attention for speaker verification
-
Application No.: US17665862Application Date: 2022-02-07
-
Publication No.: US11776548B2Publication Date: 2023-10-03
- Inventor: Yong Zhao , Tianyan Zhou , Jinyu Li , Yifan Gong , Jian Wu , Zhuo Chen
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: WORKMAN NYDEGGER
- Main IPC: G10L17/14
- IPC: G10L17/14 ; G10L17/18 ; G06N3/08 ; G10L17/02

Abstract:
Embodiments may include determination, for each of a plurality of speech frames associated with an acoustic feature, of a phonetic feature based on the associated acoustic feature, generation of one or more two-dimensional feature maps based on the plurality of phonetic features, input of the one or more two-dimensional feature maps to a trained neural network to generate a plurality of speaker embeddings, and aggregation of the plurality of speaker embeddings into a speaker embedding based on respective weights determined for each of the plurality of speaker embeddings, wherein the speaker embedding is associated with an identity of the speaker.
Public/Granted literature
- US20220157324A1 CONVOLUTIONAL NEURAL NETWORK WITH PHONETIC ATTENTION FOR SPEAKER VERIFICATION Public/Granted day:2022-05-19
Information query