-
公开(公告)号:US11211065B2
公开(公告)日:2021-12-28
申请号:US16265148
申请日:2019-02-01
Inventor: Tejas Godambe , Aravind Ganapathiraju
Abstract: A system and method are presented for the automatic filtering of test utterance mismatches in automatic speech recognition (ASR) systems. Test data are evaluated for match between audio and text in a language-independent manner. Utterances having mismatch are identified and isolated for either removal or manual verification to prevent incorrect measurements of the ASR system performance. In an embodiment, contiguous stretches of low probabilities in every utterance are searched for and removed. Such segments may be intra-word or cross-word. In another embodiment, scores may be determined using log DNN probability for every word in each utterance. Words may be sorted in the order of the scores and those utterances containing the least word scores are removed.
-
2.
公开(公告)号:US20190244611A1
公开(公告)日:2019-08-08
申请号:US16265148
申请日:2019-02-01
Inventor: Tejas Godambe , Aravind Ganapathiraju
IPC: G10L15/22 , G10L15/16 , G10L15/14 , G10L15/02 , G10L15/197
CPC classification number: G10L15/22 , G10L15/02 , G10L15/14 , G10L15/16 , G10L15/197 , G10L2015/025
Abstract: A system and method are presented for the automatic filtering of test utterance mismatches in automatic speech recognition (ASR) systems. Test data are evaluated for match between audio and text in a language-independent manner. Utterances having mismatch are identified and isolated for either removal or manual verification to prevent incorrect measurements of the ASR system performance. In an embodiment, contiguous stretches of low probabilities in every utterance are searched for and removed. Such segments may be intra-word or cross-word. In another embodiment, scores may be determined using log DNN probability for every word in each utterance. Words may be sorted in the order of the scores and those utterances containing the least word scores are removed.
-