TOKEN CONFIDENCE SCORES FOR AUTOMATIC SPEECH RECOGNITION

    公开(公告)号:US20230245649A1

    公开(公告)日:2023-08-03

    申请号:US17649810

    申请日:2022-02-03

    CPC classification number: G10L15/1815 G10L15/02 G10L15/26 G10L2015/025

    Abstract: Methods and systems for correction of a likely erroneous word in a speech transcription are disclosed. By evaluating token confidence scores of individual words or phrases, the automatic speech recognition system can replace a low-confidence score word with a substitute word or phrase. Among various approaches, neural network models can be used to generate individual confidence scores. Such word substitution can enable the speech recognition system to automatically detect and correct likely errors in transcription. Furthermore, the system can indicate the token confidence scores on a graphic user interface for labeling and dictionary enhancement.

Patent Agency Ranking