-
公开(公告)号:US20170185286A1
公开(公告)日:2017-06-29
申请号:US14982887
申请日:2015-12-29
Applicant: Google Inc.
Inventor: Francoise Beaufays , Yu Ouyang , David Rybach , Michael D. Riley , Lars Hellsten
IPC: G06F3/0488 , G06F3/01 , G06F3/041
CPC classification number: G06F3/04886 , G06F3/017 , G06F3/0237 , G06F3/0416 , G06F2203/04106
Abstract: Methods, systems, and apparatus for receiving data indicating a location of a particular touchpoint representing a latest received touchpoint in a sequence of received touchpoints; identifying candidate characters associated with the particular touchpoint; generating, for each of the candidate characters, a confidence score; identifying different candidate sequences of characters each including for each received touchpoint, one candidate character associated with a location of the received touchpoint, and one of the candidate characters associated with the particular touchpoint; for each different candidate sequence of characters, determining a language model score and generating a transcription score based at least on the confidence score for one or more of the candidate characters in the candidate sequence of characters and the language model score for the candidate sequence of characters; selecting, and providing for output, a representative sequence of characters from among the candidate sequences of characters based at least on the transcription scores.
-
公开(公告)号:US20180366112A1
公开(公告)日:2018-12-20
申请号:US15681801
申请日:2017-08-21
Applicant: Google Inc.
Inventor: Petar Aleksic , Michael D. Riley , Pedro J. Moreno Mengibar , Leonid Velikovich
IPC: G10L15/18 , G10L15/22 , G10L15/197 , G10L15/14
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for tagging during speech recognition. A word lattice that indicates probabilities for sequences of words in an utterance is obtained. A conditional probability transducer that indicates a frequency that sequences of both the words and semantic tags for the words appear is obtained. The word lattice and the conditional probability transducer are composed to construct a word lattice that indicates probabilities for sequences of both the words in the utterance and the semantic tags for the words. The word lattice that indicates probabilities for sequences of both the words in the utterance and the semantic tags for the words is used to generate a transcription that includes the words in the utterance and the semantic tags for the words.
-
公开(公告)号:US09093067B1
公开(公告)日:2015-07-28
申请号:US13685228
申请日:2012-11-26
Applicant: Google Inc.
Inventor: Martin Jansche , Michael D. Riley , Andrew M. Rosenberg , Terry Tai
IPC: G10L13/08 , G10L13/027
CPC classification number: G10L13/027 , G10L13/10
Abstract: The subject matter of this specification can be implemented in a computer-implemented method that includes receiving utterances and transcripts thereof. The method includes analyzing the utterances and transcripts to determine certain attributes, such as distances between prosodic contours for pairs of utterances. A model can be generated that can be used to estimate a distance between a determined prosodic contour for a received utterance and an unknown prosodic contour for a synthesized utterance when given a distance between attributes for text associated with the received utterance and the synthesized utterance.
Abstract translation: 本说明书的主题可以以计算机实现的方法来实现,该方法包括接收其话语和抄本。 该方法包括分析话语和抄本以确定某些属性,例如话语对的韵律轮廓之间的距离。 当给定与所接收的话语相关联的文本的属性与合成话语之间的距离时,可以生成可以用于估计用于接收到的话语的确定的韵律轮廓与合成话语的未知韵律轮廓之间的距离的模型。
-
-