COMPUTER SYSTEMS EXHIBITING IMPROVED COMPUTER SPEED AND TRANSCRIPTION ACCURACY OF AUTOMATIC SPEECH TRANSCRIPTION (AST) BASED ON A MULTIPLE SPEECH-TO-TEXT ENGINES AND METHODS OF USE THEREOF
摘要:
In some embodiments, an exemplary inventive system for improving computer speed and accuracy of automatic speech transcription includes at least components of: a computer processor configured to perform: generating a recognition model specification for a plurality of distinct speech-to-text transcription engines; where each distinct speech-to-text transcription engine corresponds to a respective distinct speech recognition model; receiving at least one audio recording representing a speech of a person; segmenting the audio recording into a plurality of audio segments; determining a respective distinct speech-to-text transcription engine to transcribe a respective audio segment; receiving, from the respective transcription engine, a hypothesis for the respective audio segment; accepting the hypothesis to remove a need to submit the respective audio segment to another distinct speech-to-text transcription engine, resulting in the improved computer speed and the accuracy of automatic speech transcription and generating a transcript of the audio recording from respective accepted hypotheses for the plurality of audio segments.
信息查询
0/0