-
公开(公告)号:US09646606B2
公开(公告)日:2017-05-09
申请号:US14048199
申请日:2013-10-08
Applicant: Google Inc.
Inventor: Fuchun Peng , Ben Shahshahani , Howard Scott Roy
IPC: G10L15/00 , G10L15/08 , G10L15/18 , G10L15/22 , G10L15/183
CPC classification number: G10L15/08 , G10L15/00 , G10L15/18 , G10L15/1815 , G10L15/183 , G10L15/22
Abstract: In some implementations, data that indicates multiple candidate transcriptions for an utterance is received. For each of the candidate transcriptions, data relating to use of the candidate transcription as a search query is received, a score that is based on the received data is provided to a trained classifier, and a classifier output for the candidate transcription is received. One or more of the candidate transcriptions may be selected based on the classifier outputs.
-
公开(公告)号:US20170308519A1
公开(公告)日:2017-10-26
申请号:US13922438
申请日:2013-06-20
Applicant: Google Inc.
Inventor: Fuchun Peng , Ben Shahshahani , Howard Scott Roy
IPC: G06F17/27
CPC classification number: G06F16/33
Abstract: A server accesses an initial query associated with a classification, the classification corresponding to a likely intent of the initial query. The server obtains a set of queries, wherein each query in the set of queries is identified as having resulted in one or more users selecting a resource that was selected by one or more users in response to submitting the initial query. The server then determines a metric for one or more queries in the set of queries, wherein the metric for each of the one or more queries in the set of queries is based on a similarity between the respective query and the initial query. Next, the server selects a subset of queries from the set of queries based on the metric for each selected query satisfying a threshold and associates the selected subset of queries with the classification of the initial query.
-
公开(公告)号:US09224385B1
公开(公告)日:2015-12-29
申请号:US13919170
申请日:2013-06-17
Applicant: Google Inc.
Inventor: Matthew Sharifi , Ben Shahshahani , Dominik Roblek
CPC classification number: G10L25/51 , G10H2210/046 , G10H2220/011 , G10H2240/141 , G10L15/26 , G10L21/10
Abstract: Methods, systems, and computer programs are presented for unified recognition of speech and music. One method includes an operation for starting an audio recognition mode by a computing device while receiving an audio stream. Segments of the audio stream are analyzed as the audio stream is received, where the analysis includes simultaneous checking for speech and music. Further, the method includes an operation for determining a first confidence score for speech and a second confidence score for music. As the audio stream is received, additional segments are analyzed until the end of the audio stream or until the first and second confidence scores indicate that the audio stream has been identified as speech or music. Further, results are presented on a display based on the identification of the audio stream, including text entered if the audio stream was speech or song information if the audio stream was music.
Abstract translation: 提出方法,系统和计算机程序,用于统一识别语音和音乐。 一种方法包括在接收音频流的同时由计算设备启动音频识别模式的操作。 当接收到音频流时,分析音频流的分段,其中分析包括语音和音乐的同时检查。 此外,该方法包括用于确定用于语音的第一可信度得分和用于音乐的第二可信度得分的操作。 当音频流被接收时,分析附加段直到音频流的结束,或者直到第一和第二置信度得分指示音频流已经被识别为语音或音乐。 此外,如果音频流是音乐,则在显示器上显示结果,该显示器基于音频流的标识,包括输入的文本,如果音频流是语音或歌曲信息。
-
公开(公告)号:US20150012271A1
公开(公告)日:2015-01-08
申请号:US14048199
申请日:2013-10-08
Applicant: Google Inc.
Inventor: Fuchun Peng , Ben Shahshahani , Howard Scott Roy
IPC: G10L15/26
CPC classification number: G10L15/08 , G10L15/00 , G10L15/18 , G10L15/1815 , G10L15/183 , G10L15/22
Abstract: In some implementations, data that indicates multiple candidate transcriptions for an utterance is received. For each of the candidate transcriptions, data relating to use of the candidate transcription as a search query is received, a score that is based on the received data is provided to a trained classifier, and a classifier output for the candidate transcription is received. One or more of the candidate transcriptions may be selected based on the classifier outputs.
Abstract translation: 在一些实现中,接收指示用于话语的多个候选转录的数据。 对于每个候选转录,接收与使用候选转录作为搜索查询相关的数据,将基于接收到的数据的分数提供给训练有素的分类器,并且接收候选转录的分类器输出。 候选转录中的一个或多个可以基于分类器输出来选择。
-
-
-