SPEAKER EMBEDDING CONVERSION FOR BACKWARD AND CROSS-CHANNEL COMPATABILITY

Invention Application

US20230005486A1 SPEAKER EMBEDDING CONVERSION FOR BACKWARD AND CROSS-CHANNEL COMPATABILITY 有权

Please log in to see more content

Patent Title: SPEAKER EMBEDDING CONVERSION FOR BACKWARD AND CROSS-CHANNEL COMPATABILITY
Application No.: US17855149

Application Date: 2022-06-30
Publication No.: US20230005486A1

Publication Date: 2023-01-05
Inventor: Tianxiang Chen , Elie Khoury
Applicant: Pindrop Security, Inc.
Applicant Address: US GA Atlanta
Assignee: Pindrop Security, Inc.
Current Assignee: Pindrop Security, Inc.
Current Assignee Address: US GA Atlanta
Main IPC: G10L17/02
IPC: G10L17/02 ; G10L17/04 ; G10L17/18

SPEAKER EMBEDDING CONVERSION FOR BACKWARD AND CROSS-CHANNEL COMPATABILITY

Abstract:

Embodiments include a computer executing voice biometric machine-learning for speaker recognition. The machine-learning architecture includes embedding extractors that extract embeddings for enrollment or for verifying inbound speakers, and embedding convertors that convert enrollment voiceprints from a first type of embedding to a second type of embedding. The embedding convertor maps the feature vector space of the first type of embedding to the feature vector space of the second type of embedding. The embedding convertor takes as input enrollment embeddings of the first type of embedding and generates as output converted enrolled embeddings that are aggregated into a converted enrolled voiceprint of the second type of embedding. To verify an inbound speaker, a second embedding extractor generates an inbound voiceprint of the second type of embedding, and scoring layers determine a similarity between the inbound voiceprint and the converted enrolled voiceprint, both of which are the second type of embedding.

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L17/00	讲话者辨认或验证
G10L17/02	.预处理操作，例如：片断选择；模式表示或模拟，例如基于线性判别式分析(LDA)或主要部件；特征选择或提取