COMPUTER SYSTEMS EXHIBITING IMPROVED COMPUTER SPEED AND TRANSCRIPTION ACCURACY OF AUTOMATIC SPEECH TRANSCRIPTION (AST) BASED ON A MULTIPLE SPEECH-TO-TEXT ENGINES AND METHODS OF USE THEREOF

发明申请

US20230114591A1 COMPUTER SYSTEMS EXHIBITING IMPROVED COMPUTER SPEED AND TRANSCRIPTION ACCURACY OF AUTOMATIC SPEECH TRANSCRIPTION (AST) BASED ON A MULTIPLE SPEECH-TO-TEXT ENGINES AND METHODS OF USE THEREOF 有权

请登陆查看更多内容

专利标题： COMPUTER SYSTEMS EXHIBITING IMPROVED COMPUTER SPEED AND TRANSCRIPTION ACCURACY OF AUTOMATIC SPEECH TRANSCRIPTION (AST) BASED ON A MULTIPLE SPEECH-TO-TEXT ENGINES AND METHODS OF USE THEREOF
申请号： US18060351

申请日： 2022-11-30
公开(公告)号： US20230114591A1

公开(公告)日： 2023-04-13
发明人: Tejas Shastry , Matthew Goldey , Svyat Vergun
申请人： GREEN KEY TECHNOLOGIES, INC.
申请人地址： GB LONDON
专利权人： GREEN KEY TECHNOLOGIES, INC.
当前专利权人： GREEN KEY TECHNOLOGIES, INC.
当前专利权人地址： GB LONDON
主分类号： G10L15/26
IPC分类号： G10L15/26 ; G10L15/04 ; G10L15/22 ; G10L17/00 ; G10L25/78

COMPUTER SYSTEMS EXHIBITING IMPROVED COMPUTER SPEED AND TRANSCRIPTION ACCURACY OF AUTOMATIC SPEECH TRANSCRIPTION (AST) BASED ON A MULTIPLE SPEECH-TO-TEXT ENGINES AND METHODS OF USE THEREOF

摘要：

In some embodiments, an exemplary inventive system for improving computer speed and accuracy of automatic speech transcription includes at least components of: a computer processor configured to perform: generating a recognition model specification for a plurality of distinct speech-to-text transcription engines; where each distinct speech-to-text transcription engine corresponds to a respective distinct speech recognition model; receiving at least one audio recording representing a speech of a person; segmenting the audio recording into a plurality of audio segments; determining a respective distinct speech-to-text transcription engine to transcribe a respective audio segment; receiving, from the respective transcription engine, a hypothesis for the respective audio segment; accepting the hypothesis to remove a need to submit the respective audio segment to another distinct speech-to-text transcription engine, resulting in the improved computer speed and the accuracy of automatic speech transcription and generating a transcript of the audio recording from respective accepted hypotheses for the plurality of audio segments.

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/26	.语音—正文识别系统（G10L15/08优先）