- 专利标题: TRAINING SPEECH RECOGNITION SYSTEMS USING WORD SEQUENCES
-
申请号: US16997587申请日: 2020-08-19
-
公开(公告)号: US20220059077A1公开(公告)日: 2022-02-24
- 发明人: David Thomson
- 申请人: Sorenson IP Holdings, LLC
- 申请人地址: US UT Salt Lake City
- 专利权人: Sorenson IP Holdings, LLC
- 当前专利权人: Sorenson IP Holdings, LLC
- 当前专利权人地址: US UT Salt Lake City
- 主分类号: G10L15/065
- IPC分类号: G10L15/065 ; G10L15/06 ; G10L15/19 ; G10L25/51 ; G06F21/60 ; H04L9/08
摘要:
A method may include obtaining a text string that is a transcription of audio data and selecting a sequence of words from the text string as a first word sequence. The method may further include encrypting the first word sequence and comparing the encrypted first word sequence to multiple encrypted word sequences. Each of the multiple encrypted word sequences may be associated with a corresponding one of multiple counters. The method may also include in response to the encrypted first word sequence corresponding to one of the multiple encrypted word sequences based on the comparison, incrementing a counter of the multiple counters associated with the one of the multiple encrypted word sequences and adapting a language model of an automatic transcription system using the multiple encrypted word sequences and the multiple counters.
信息查询