Speech processing using embedding data

Invention Grant

US11282495B2 Speech processing using embedding data 有权

Please log in to see more content

Patent Title: Speech processing using embedding data
Application No.: US16712567

Application Date: 2019-12-12
Publication No.: US11282495B2

Publication Date: 2022-03-22
Inventor: Hongda Mao , George Yu-Chien Lin , Sundararajan Srinivasan , Chu-Cheng Hsieh
Applicant: Amazon Technologies, Inc.
Applicant Address: US WA Seattle
Assignee: Amazon Technologies, Inc.
Current Assignee: Amazon Technologies, Inc.
Current Assignee Address: US WA Seattle
Agency: Pierce Atwood LLP
Main IPC: G10L13/027
IPC: G10L13/027 ; G10L13/00 ; G10L17/04 ; G10L17/18

Abstract:

A first neural network model of a user device processes audio data to extract audio embeddings that represent vocal characteristics of a user of an utterance represented in the audio data. The audio embeddings may then be hashed to remove characteristics specific to the user while still maintaining a unique set of characteristics. The hashed embeddings may be sent to a remote system, which may use them to identify the user.

Public/Granted literature

US20210183358A1 SPEECH PROCESSING Public/Granted day:2021-06-17

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L13/00	语音合成；文本-语音合成系统
G10L13/02	.产生合成语音的方法；语音合成设备
G10L13/027	..概念－语音合成；从基于机器的概念产生自然词语（产生文本以外的语音合成参数的入G10L13/08）