Verification of mappings between phoneme sequences and words

    公开(公告)号:US09837070B2

    公开(公告)日:2017-12-05

    申请号:US14186400

    申请日:2014-02-21

    Applicant: Google Inc.

    CPC classification number: G10L15/063 G10L15/26

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for verifying pronunciations. In one aspect, a method includes obtaining a first transcription for an utterance. A second transcription for the utterance is obtained. The second transcription is different from the first transcription. One or more feature scores are determined based on the first transcription and the second transcription. The one or more feature scores are input to a trained classifier. An output of the classifier is received. The output indicates which of the first transcription and the second transcription is more likely to be a correct transcription of the utterance.

    IDENTIFYING SUBSTITUTE PRONUNCIATIONS
    2.
    发明申请
    IDENTIFYING SUBSTITUTE PRONUNCIATIONS 有权
    识别替代宣传

    公开(公告)号:US20150170642A1

    公开(公告)日:2015-06-18

    申请号:US14109316

    申请日:2013-12-17

    Applicant: Google Inc.

    CPC classification number: G10L15/187 G10L15/005 G10L2015/025 G10L2015/227

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, including selecting terms; obtaining an expected phonetic transcription of an idealized native speaker of a natural language speaking the terms; receiving audio data corresponding to a particular user speaking the terms in the natural language; obtaining, based on the audio data, an actual phonetic transcription of the particular user speaking the terms in the natural language; aligning the expected phonetic transcription of the idealized native speaker of the natural language with the actual phonetic transcription of the particular user; identifying, based on the aligning, a portion of the expected phonetic transcription that is different than a corresponding portion of the actual phonetic transcription; and based on identifying the portion of the expected phonetic transcription, designating the expected phonetic transcription as a substitute pronunciation for the corresponding portion of the actual phonetic transcription.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,包括选择术语; 用语言来获得理想化的母语者自然语言的预期语音转录; 接收对应于以自然语言表达术语的特定用户的音频数据; 基于音频数据获得以自然语言表达术语的特定用户的实际语音转录; 将理想化的自然语言的母语者的预期语音转录与特定用户的实际语音转录对齐; 基于对齐来识别不同于实际语音转录的相应部分的预期语音转录的一部分; 并且基于识别预期语音转录的部分,将预期的语音转录指定为实际语音转录的相应部分的替代发音。

    SPEECH RECOGNITION USING DOMAIN KNOWLEDGE
    3.
    发明申请
    SPEECH RECOGNITION USING DOMAIN KNOWLEDGE 有权
    使用领域知识的语音识别

    公开(公告)号:US20150012271A1

    公开(公告)日:2015-01-08

    申请号:US14048199

    申请日:2013-10-08

    Applicant: Google Inc.

    Abstract: In some implementations, data that indicates multiple candidate transcriptions for an utterance is received. For each of the candidate transcriptions, data relating to use of the candidate transcription as a search query is received, a score that is based on the received data is provided to a trained classifier, and a classifier output for the candidate transcription is received. One or more of the candidate transcriptions may be selected based on the classifier outputs.

    Abstract translation: 在一些实现中,接收指示用于话语的多个候选转录的数据。 对于每个候选转录,接收与使用候选转录作为搜索查询相关的数据,将基于接收到的数据的分数提供给训练有素的分类器,并且接收候选转录的分类器输出。 候选转录中的一个或多个可以基于分类器输出来选择。

    DATA DRIVEN PRONUNCIATION LEARNING WITH CROWD SOURCING
    5.
    发明申请
    DATA DRIVEN PRONUNCIATION LEARNING WITH CROWD SOURCING 有权
    数据驱动公开学习与CROWD采购

    公开(公告)号:US20150006178A1

    公开(公告)日:2015-01-01

    申请号:US13930495

    申请日:2013-06-28

    Applicant: Google Inc.

    CPC classification number: G10L15/18 G09B17/006 G10L13/08 G10L15/06

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining pronunciations for particular terms. The methods, systems, and apparatus include actions of obtaining audio samples of speech corresponding to a particular term and obtaining candidate pronunciations for the particular term. Further actions include generating, for each candidate pronunciation for the particular term and audio sample of speech corresponding to the particular term, a score reflecting a level of similarity between of the candidate pronunciation and the audio sample. Additional actions include aggregating the scores for each candidate pronunciation and adding one or more candidate pronunciations for the particular term to a pronunciation lexicon based on the aggregated scores for the candidate pronunciations.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于确定特定术语的发音。 方法,系统和装置包括获得与特定术语相对应的语音样本的动作,并获得特定术语的候选发音。 进一步的动作包括针对特定术语的每个候选发音和对应于特定术语的语音样本生成反映候选发音和音频样本之间的相似程度的分数。 附加动作包括聚合每个候选发音的分数,并且基于候选发音的聚合分数,将特定术语的一个或多个候选发音添加到发音词典。

    Personalized Speech Synthesis for Voice Actions
    7.
    发明申请
    Personalized Speech Synthesis for Voice Actions 审中-公开
    语音操作的个性化语音综合

    公开(公告)号:US20160307569A1

    公开(公告)日:2016-10-20

    申请号:US14686670

    申请日:2015-04-14

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for presenting notifications in an enterprise system. In one aspect, a method include actions of obtaining a template that defines (i) trigger criteria for presenting a notification type and (ii) content rules for determining content to include in a notification of the notification type. Additional actions include accessing enterprise resources of an enterprise, the enterprise resources including data describing entities related to the enterprise and relationships among the entities. Further actions include, accessing user information specific to a user and determining that the trigger criteria is satisfied by the enterprise resources and the user information. Additional actions include generating a particular notification of the notification type based at least on the content rules and providing the particular notification to the user.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于在企业系统中呈现通知。 一方面,一种方法包括获取定义(i)触发呈现通知类型的标准的模板的动作,以及(ii)用于确定内容的内容规则以包括在通知类型的通知中。 其他行动包括访问企业的企业资源,企业资源,包括描述与企业相关的实体的数据和实体之间的关系。 进一步的操作包括访问用户特有的用户信息,并确定触发条件由企业资源和用户信息来满足。 附加动作包括至少基于内容规则生成通知类型的特定通知,并向用户提供特定通知。

    PRONUNCIATION VERIFICATION
    8.
    发明申请
    PRONUNCIATION VERIFICATION 有权
    授权验证

    公开(公告)号:US20150161985A1

    公开(公告)日:2015-06-11

    申请号:US14186400

    申请日:2014-02-21

    Applicant: Google Inc.

    CPC classification number: G10L15/063 G10L15/26

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for verifying pronunciations. In one aspect, a method includes obtaining a first transcription for an utterance. A second transcription for the utterance is obtained. The second transcription is different from the first transcription. One or more feature scores are determined based on the first transcription and the second transcription. The one or more feature scores are input to a trained classifier. An output of the classifier is received. The output indicates which of the first transcription and the second transcription is more likely to be a correct transcription of the utterance.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的用于验证发音的计算机程序。 一方面,一种方法包括获得用于发音的第一转录。 获得了用于说话的第二个转录。 第二次转录与第一次转录不同。 基于第一次转录和第二次转录确定一个或多个特征得分。 将一个或多个特征得分输入到训练有素的分类器。 接收分类器的输出。 输出表明第一次转录和第二次转录中哪一个更可能是正确的发音转录。

    LEARNING SEMANTIC PARSING
    9.
    发明申请

    公开(公告)号:US20170308519A1

    公开(公告)日:2017-10-26

    申请号:US13922438

    申请日:2013-06-20

    Applicant: Google Inc.

    CPC classification number: G06F16/33

    Abstract: A server accesses an initial query associated with a classification, the classification corresponding to a likely intent of the initial query. The server obtains a set of queries, wherein each query in the set of queries is identified as having resulted in one or more users selecting a resource that was selected by one or more users in response to submitting the initial query. The server then determines a metric for one or more queries in the set of queries, wherein the metric for each of the one or more queries in the set of queries is based on a similarity between the respective query and the initial query. Next, the server selects a subset of queries from the set of queries based on the metric for each selected query satisfying a threshold and associates the selected subset of queries with the classification of the initial query.

    Identifying substitute pronunciations

    公开(公告)号:US09747897B2

    公开(公告)日:2017-08-29

    申请号:US14109316

    申请日:2013-12-17

    Applicant: Google Inc.

    CPC classification number: G10L15/187 G10L15/005 G10L2015/025 G10L2015/227

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, including selecting terms; obtaining an expected phonetic transcription of an idealized native speaker of a natural language speaking the terms; receiving audio data corresponding to a particular user speaking the terms in the natural language; obtaining, based on the audio data, an actual phonetic transcription of the particular user speaking the terms in the natural language; aligning the expected phonetic transcription of the idealized native speaker of the natural language with the actual phonetic transcription of the particular user; identifying, based on the aligning, a portion of the expected phonetic transcription that is different than a corresponding portion of the actual phonetic transcription; and based on identifying the portion of the expected phonetic transcription, designating the expected phonetic transcription as a substitute pronunciation for the corresponding portion of the actual phonetic transcription.

Patent Agency Ranking