-
公开(公告)号:US09508341B1
公开(公告)日:2016-11-29
申请号:US14476075
申请日:2014-09-03
Applicant: Amazon Technologies, Inc.
CPC classification number: G10L15/18 , G10L13/00 , G10L15/187
Abstract: Features are disclosed for active learning to identify the words which are likely to improve the guessing and automatic speech recognition (ASR) after manual annotation. When a speech recognition system needs pronunciations for words, a lexicon is typically used. For unknown words, pronunciation-guessing (G2P) may be included to provide pronunciations in an unattended (e.g., automatic) fashion. However, having manually (e.g., by a human) annotated pronunciations provides better ASR than having automatic pronunciations that may, in some instances, be wrong. The included active learning features help to direct these limited annotation resources.
Abstract translation: 公开了用于主动学习的特征以在手动注释之后识别可能改善猜测和自动语音识别(ASR)的单词。 当语音识别系统需要发音时,通常使用词典。 对于未知单词,可以包括发音猜测(G2P),以无人值守(例如,自动)的方式提供发音。 然而,手动(例如,由人类)注释的发音提供比具有在某些情况下是错误的自动发音更好的ASR。 包括的主动学习功能有助于指导这些有限的注释资源。