Patent search ap:("Google Inc.") AND inv:"Fuchun Peng" Page 1

1.

发明授权
Verification of mappings between phoneme sequences and words 有权

公开(公告)号：US09837070B2

公开(公告)日：2017-12-05

申请号：US14186400

申请日：2014-02-21

Applicant: Google Inc.

Inventor： Fuchun Peng , Kanury Kanishka Rao , Francoise Beaufays

IPC: G10L15/06 , G10L15/26

CPC classification number: G10L15/063 , G10L15/26

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for verifying pronunciations. In one aspect, a method includes obtaining a first transcription for an utterance. A second transcription for the utterance is obtained. The second transcription is different from the first transcription. One or more feature scores are determined based on the first transcription and the second transcription. The one or more feature scores are input to a trained classifier. An output of the classifier is received. The output indicates which of the first transcription and the second transcription is more likely to be a correct transcription of the utterance.

2.

发明申请
IDENTIFYING SUBSTITUTE PRONUNCIATIONS 有权
Title translation: 识别替代宣传

公开(公告)号：US20150170642A1

公开(公告)日：2015-06-18

申请号：US14109316

申请日：2013-12-17

Applicant: Google Inc.

Inventor： Fuchun Peng , Francoise Beaufays , Pedro J. Moreno Mengibar , Brian Patrick Strope

IPC: G10L15/187 , G10L15/26

CPC classification number: G10L15/187 , G10L15/005 , G10L2015/025 , G10L2015/227

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, including selecting terms; obtaining an expected phonetic transcription of an idealized native speaker of a natural language speaking the terms; receiving audio data corresponding to a particular user speaking the terms in the natural language; obtaining, based on the audio data, an actual phonetic transcription of the particular user speaking the terms in the natural language; aligning the expected phonetic transcription of the idealized native speaker of the natural language with the actual phonetic transcription of the particular user; identifying, based on the aligning, a portion of the expected phonetic transcription that is different than a corresponding portion of the actual phonetic transcription; and based on identifying the portion of the expected phonetic transcription, designating the expected phonetic transcription as a substitute pronunciation for the corresponding portion of the actual phonetic transcription.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，包括选择术语; 用语言来获得理想化的母语者自然语言的预期语音转录; 接收对应于以自然语言表达术语的特定用户的音频数据; 基于音频数据获得以自然语言表达术语的特定用户的实际语音转录; 将理想化的自然语言的母语者的预期语音转录与特定用户的实际语音转录对齐; 基于对齐来识别不同于实际语音转录的相应部分的预期语音转录的一部分; 并且基于识别预期语音转录的部分，将预期的语音转录指定为实际语音转录的相应部分的替代发音。

3.

发明申请
SPEECH RECOGNITION USING DOMAIN KNOWLEDGE 有权
Title translation: 使用领域知识的语音识别

公开(公告)号：US20150012271A1

公开(公告)日：2015-01-08

申请号：US14048199

申请日：2013-10-08

Applicant: Google Inc.

Inventor： Fuchun Peng , Ben Shahshahani , Howard Scott Roy

IPC: G10L15/26

CPC classification number: G10L15/08 , G10L15/00 , G10L15/18 , G10L15/1815 , G10L15/183 , G10L15/22

Abstract: In some implementations, data that indicates multiple candidate transcriptions for an utterance is received. For each of the candidate transcriptions, data relating to use of the candidate transcription as a search query is received, a score that is based on the received data is provided to a trained classifier, and a classifier output for the candidate transcription is received. One or more of the candidate transcriptions may be selected based on the classifier outputs.

Abstract translation: 在一些实现中，接收指示用于话语的多个候选转录的数据。对于每个候选转录，接收与使用候选转录作为搜索查询相关的数据，将基于接收到的数据的分数提供给训练有素的分类器，并且接收候选转录的分类器输出。候选转录中的一个或多个可以基于分类器输出来选择。

4.

发明授权
Data driven word pronunciation learning and scoring with crowd sourcing based on the word's phonemes pronunciation scores 有权

公开(公告)号：US09741339B2

公开(公告)日：2017-08-22

申请号：US13930495

申请日：2013-06-28

Applicant: Google Inc.

Inventor： Fuchun Peng , Francoise Beaufays , Brian Strope , Xin Lei , Pedro J. Moreno Mengibar , Trevor D. Strohman

IPC: G10L15/00 , G09B5/00 , G10L15/14 , G10L15/18 , G10L13/08 , G10L15/06 , G09B17/00

CPC classification number: G10L15/18 , G09B17/006 , G10L13/08 , G10L15/06

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining pronunciations for particular terms. The methods, systems, and apparatus include actions of obtaining audio samples of speech corresponding to a particular term and obtaining candidate pronunciations for the particular term. Further actions include generating, for each candidate pronunciation for the particular term and audio sample of speech corresponding to the particular term, a score reflecting a level of similarity between of the candidate pronunciation and the audio sample, wherein the said score for the particular term is obtained by using a minimum of individual scores of phonemes comprising the term. Additional actions include aggregating the scores for each candidate pronunciation and adding one or more candidate pronunciations for the particular term to a pronunciation lexicon based on the aggregated scores for the candidate pronunciations.

5.

发明申请
DATA DRIVEN PRONUNCIATION LEARNING WITH CROWD SOURCING 有权
Title translation: 数据驱动公开学习与CROWD采购

公开(公告)号：US20150006178A1

公开(公告)日：2015-01-01

申请号：US13930495

申请日：2013-06-28

Applicant: Google Inc.

Inventor： Fuchun Peng , Francoise Beaufays , Brian Strope , Xin Lei , Pedro J. Moreno Mengibar , Trevor D. Strohman

IPC: G10L15/18

CPC classification number: G10L15/18 , G09B17/006 , G10L13/08 , G10L15/06

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining pronunciations for particular terms. The methods, systems, and apparatus include actions of obtaining audio samples of speech corresponding to a particular term and obtaining candidate pronunciations for the particular term. Further actions include generating, for each candidate pronunciation for the particular term and audio sample of speech corresponding to the particular term, a score reflecting a level of similarity between of the candidate pronunciation and the audio sample. Additional actions include aggregating the scores for each candidate pronunciation and adding one or more candidate pronunciations for the particular term to a pronunciation lexicon based on the aggregated scores for the candidate pronunciations.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于确定特定术语的发音。方法，系统和装置包括获得与特定术语相对应的语音样本的动作，并获得特定术语的候选发音。进一步的动作包括针对特定术语的每个候选发音和对应于特定术语的语音样本生成反映候选发音和音频样本之间的相似程度的分数。附加动作包括聚合每个候选发音的分数，并且基于候选发音的聚合分数，将特定术语的一个或多个候选发音添加到发音词典。

6.

发明授权
Speech recognition using domain knowledge 有权

公开(公告)号：US09646606B2

公开(公告)日：2017-05-09

申请号：US14048199

申请日：2013-10-08

Applicant: Google Inc.

Inventor： Fuchun Peng , Ben Shahshahani , Howard Scott Roy

IPC: G10L15/00 , G10L15/08 , G10L15/18 , G10L15/22 , G10L15/183

CPC classification number: G10L15/08 , G10L15/00 , G10L15/18 , G10L15/1815 , G10L15/183 , G10L15/22

Abstract: In some implementations, data that indicates multiple candidate transcriptions for an utterance is received. For each of the candidate transcriptions, data relating to use of the candidate transcription as a search query is received, a score that is based on the received data is provided to a trained classifier, and a classifier output for the candidate transcription is received. One or more of the candidate transcriptions may be selected based on the classifier outputs.

7.

发明申请
Personalized Speech Synthesis for Voice Actions 审中-公开
Title translation: 语音操作的个性化语音综合

公开(公告)号：US20160307569A1

公开(公告)日：2016-10-20

申请号：US14686670

申请日：2015-04-14

Applicant: Google Inc.

Inventor： Fuchun Peng , Jakob Nicolaus Foerster , Diego Melendo Casado , Fei Huang , Francoise Beaufays

IPC: G10L15/22 , G10L15/26

CPC classification number: G10L15/22 , G10L13/033 , G10L15/07 , G10L15/187 , G10L15/26 , G10L2015/221 , G10L2015/225

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for presenting notifications in an enterprise system. In one aspect, a method include actions of obtaining a template that defines (i) trigger criteria for presenting a notification type and (ii) content rules for determining content to include in a notification of the notification type. Additional actions include accessing enterprise resources of an enterprise, the enterprise resources including data describing entities related to the enterprise and relationships among the entities. Further actions include, accessing user information specific to a user and determining that the trigger criteria is satisfied by the enterprise resources and the user information. Additional actions include generating a particular notification of the notification type based at least on the content rules and providing the particular notification to the user.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于在企业系统中呈现通知。一方面，一种方法包括获取定义（i）触发呈现通知类型的标准的模板的动作，以及（ii）用于确定内容的内容规则以包括在通知类型的通知中。其他行动包括访问企业的企业资源，企业资源，包括描述与企业相关的实体的数据和实体之间的关系。进一步的操作包括访问用户特有的用户信息，并确定触发条件由企业资源和用户信息来满足。附加动作包括至少基于内容规则生成通知类型的特定通知，并向用户提供特定通知。

8.

发明申请
PRONUNCIATION VERIFICATION 有权
Title translation: 授权验证

公开(公告)号：US20150161985A1

公开(公告)日：2015-06-11

申请号：US14186400

申请日：2014-02-21

Applicant: Google Inc.

Inventor： Fuchun Peng , Kanury Kanishka Rao , Francoise Beaufays

IPC: G10L15/06 , G10L15/18 , G10L15/26

CPC classification number: G10L15/063 , G10L15/26

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for verifying pronunciations. In one aspect, a method includes obtaining a first transcription for an utterance. A second transcription for the utterance is obtained. The second transcription is different from the first transcription. One or more feature scores are determined based on the first transcription and the second transcription. The one or more feature scores are input to a trained classifier. An output of the classifier is received. The output indicates which of the first transcription and the second transcription is more likely to be a correct transcription of the utterance.

Abstract translation: 方法，系统和装置，包括在计算机存储介质上编码的用于验证发音的计算机程序。一方面，一种方法包括获得用于发音的第一转录。获得了用于说话的第二个转录。第二次转录与第一次转录不同。基于第一次转录和第二次转录确定一个或多个特征得分。将一个或多个特征得分输入到训练有素的分类器。接收分类器的输出。输出表明第一次转录和第二次转录中哪一个更可能是正确的发音转录。

9.

发明申请
LEARNING SEMANTIC PARSING 审中-公开

公开(公告)号：US20170308519A1

公开(公告)日：2017-10-26

申请号：US13922438

申请日：2013-06-20

Applicant: Google Inc.

Inventor： Fuchun Peng , Ben Shahshahani , Howard Scott Roy

IPC: G06F17/27

CPC classification number: G06F16/33

Abstract: A server accesses an initial query associated with a classification, the classification corresponding to a likely intent of the initial query. The server obtains a set of queries, wherein each query in the set of queries is identified as having resulted in one or more users selecting a resource that was selected by one or more users in response to submitting the initial query. The server then determines a metric for one or more queries in the set of queries, wherein the metric for each of the one or more queries in the set of queries is based on a similarity between the respective query and the initial query. Next, the server selects a subset of queries from the set of queries based on the metric for each selected query satisfying a threshold and associates the selected subset of queries with the classification of the initial query.

10.

发明授权
Identifying substitute pronunciations 有权

公开(公告)号：US09747897B2

公开(公告)日：2017-08-29

申请号：US14109316

申请日：2013-12-17

Applicant: Google Inc.

Inventor： Fuchun Peng , Francoise Beaufays , Pedro J. Moreno Mengibar , Brian Patrick Strope

IPC: G10L15/28 , G10L15/187 , G10L15/00 , G10L15/22 , G10L15/02

CPC classification number: G10L15/187 , G10L15/005 , G10L2015/025 , G10L2015/227

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, including selecting terms; obtaining an expected phonetic transcription of an idealized native speaker of a natural language speaking the terms; receiving audio data corresponding to a particular user speaking the terms in the natural language; obtaining, based on the audio data, an actual phonetic transcription of the particular user speaking the terms in the natural language; aligning the expected phonetic transcription of the idealized native speaker of the natural language with the actual phonetic transcription of the particular user; identifying, based on the aligning, a portion of the expected phonetic transcription that is different than a corresponding portion of the actual phonetic transcription; and based on identifying the portion of the expected phonetic transcription, designating the expected phonetic transcription as a substitute pronunciation for the corresponding portion of the actual phonetic transcription.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification