-
1.
公开(公告)号:US09324323B1
公开(公告)日:2016-04-26
申请号:US13715139
申请日:2012-12-14
Applicant: Google Inc.
Inventor: Daniel M. Bikel , Kapil R. Thadini , Fernando Pereira , Maria Shugrina , Fadi Biadsy
IPC: G10L15/26 , G10L15/183 , G10L15/197
CPC classification number: G10L15/183 , G10L15/197
Abstract: Speech recognition techniques may include: receiving audio; identifying one or more topics associated with audio; identifying language models in a topic space that correspond to the one or more topics, where the language models are identified based on proximity of a representation of the audio to representations of other audio in the topic space; using the language models to generate recognition candidates for the audio, where the recognition candidates have scores associated therewith that are indicative of a likelihood of a recognition candidate matching the audio; and selecting a recognition candidate for the audio based on the scores.
Abstract translation: 语音识别技术可以包括:接收音频; 识别与音频相关联的一个或多个主题; 识别对应于所述一个或多个主题的主题空间中的语言模型,其中基于所述音频的表示与所述主题空间中的其他音频的表示的接近度来识别所述语言模型; 使用语言模型来生成用于音频的识别候选,其中识别候选具有与之相关联的分数,其指示与音频匹配的识别候选者的可能性; 以及基于分数来选择音频的识别候选。