- Patent Title: Generalized negative log-likelihood loss for speaker verification
-
Application No.: US17031755Application Date: 2020-09-24
-
Publication No.: US11328733B2Publication Date: 2022-05-10
- Inventor: Saeed Mosayyebpour Kaskari , Atabak Pouya
- Applicant: SYNAPTICS INCORPORATED
- Applicant Address: US CA San Jose
- Assignee: SYNAPTICS INCORPORATED
- Current Assignee: SYNAPTICS INCORPORATED
- Current Assignee Address: US CA San Jose
- Agency: Paradice & Li LLP
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G10L15/16 ; G10L17/04 ; G10L15/02 ; G10L17/18

Abstract:
Systems and methods for speaker verification comprise optimizing a neural network by minimizing a generalized negative log likelihood function, including receiving a training batch of audio samples comprising a plurality of utterances for each of a plurality of speakers, extracting features from the audio samples to generate a batch of features, processing the batch of features using a neural network to generate a plurality of embedding vectors configured to differentiate audio samples by speaker, computing a generalized negative log-likelihood loss (GNLL) value for the training batch based, at least in part, on the embedding vectors, and modifying weights of the neural network to reduce the GNLL value. Computing the GNLL may include generating a centroid vector for each of a plurality of speakers, based at least in part on the embedding vectors.
Public/Granted literature
- US20220093106A1 GENERALIZED NEGATIVE LOG-LIKELIHOOD LOSS FOR SPEAKER VERIFICATION Public/Granted day:2022-03-24
Information query