TRAINING SPEECH RECOGNITION SYSTEMS USING WORD SEQUENCES

发明申请

US20220059077A1 TRAINING SPEECH RECOGNITION SYSTEMS USING WORD SEQUENCES 有权

请登陆查看更多内容

专利标题： TRAINING SPEECH RECOGNITION SYSTEMS USING WORD SEQUENCES
申请号： US16997587

申请日： 2020-08-19
公开(公告)号： US20220059077A1

公开(公告)日： 2022-02-24
发明人: David Thomson
申请人： Sorenson IP Holdings, LLC
申请人地址： US UT Salt Lake City
专利权人： Sorenson IP Holdings, LLC
当前专利权人： Sorenson IP Holdings, LLC
当前专利权人地址： US UT Salt Lake City
主分类号： G10L15/065
IPC分类号： G10L15/065 ; G10L15/06 ; G10L15/19 ; G10L25/51 ; G06F21/60 ; H04L9/08

TRAINING SPEECH RECOGNITION SYSTEMS USING WORD SEQUENCES

摘要：

A method may include obtaining a text string that is a transcription of audio data and selecting a sequence of words from the text string as a first word sequence. The method may further include encrypting the first word sequence and comparing the encrypted first word sequence to multiple encrypted word sequences. Each of the multiple encrypted word sequences may be associated with a corresponding one of multiple counters. The method may also include in response to the encrypted first word sequence corresponding to one of the multiple encrypted word sequences based on the comparison, incrementing a counter of the multiple counters associated with the one of the multiple encrypted word sequences and adapting a language model of an automatic transcription system using the multiple encrypted word sequences and the multiple counters.

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/06	.创建基准模板；训练语音识别系统，例如对说话者声音特征的适应（G10L15/14优先）
G10L15/065	..适应