专利检索 ap:("Tencent Technology (Shenzhen) Company Limited") AND inv:"Jianxiong Ma" 第 1 页

1.

发明授权
Keyword detection for speech recognition 有权
标题翻译：语音识别的关键字检测

公开(公告)号：US09230541B2

公开(公告)日：2016-01-05

申请号：US14567969

申请日：2014-12-11

申请人： Tencent Technology (Shenzhen) Company Limited

发明人： Lu Ll , Li Lu , Jianxiong Ma , Linghui Kong , Feng Rao , Shuai Yue , Xiang Zhang , Haibo Liu , Eryu Wang , Bo Chen

IPC分类号： G10L15/08

CPC分类号： G10L15/08 , G10L15/083 , G10L2015/088

摘要： This application discloses a method implemented of recognizing a keyword in a speech that includes a sequence of audio frames further including a current frame and a subsequent frame. A candidate keyword is determined for the current frame using a decoding network that includes keywords and filler words of multiple languages, and used to determine a confidence score for the audio frame sequence. A word option is also determined for the subsequent frame based on the decoding network, and when the candidate keyword and the word option are associated with two distinct types of languages, the confidence score of the audio frame sequence is updated at least based on a penalty factor associated with the two distinct types of languages. The audio frame sequence is then determined to include both the candidate keyword and the word option by evaluating the updated confidence score according to a keyword determination criterion.

摘要翻译： 本申请公开了一种实现的方法，其中识别语音中的关键字，其中包括进一步包括当前帧和后续帧的音频帧序列。使用包括多种语言的关键词和填充词的解码网络为当前帧确定候选关键字，并且用于确定音频帧序列的置信度分数。还基于解码网络为后续帧确定字选项，并且当候选关键词和词选项与两种不同类型的语言相关联时，至少基于惩罚来更新音频帧序列的置信度得分与两种不同类型语言相关联的因素。然后通过根据关键字确定标准评估更新的可信度得分，确定音频帧序列以包括候选关键词和词选项。

2.

发明申请
Keyword Detection For Speech Recognition 有权
标题翻译：语音识别的关键字检测

公开(公告)号：US20150095032A1

公开(公告)日：2015-04-02

申请号：US14567969

申请日：2014-12-11

申请人： Tencent Technology (Shenzhen) Company Limited

发明人： Lu LI , Li Lu , Jianxiong Ma , Linghui Kong , Feng Rao , Shuai Yue , Xiang Zhang , Haibo Liu , Eryu Wang , Bo Chen

IPC分类号： G10L15/08

CPC分类号： G10L15/08 , G10L15/083 , G10L2015/088

摘要： This application discloses a method implemented of recognizing a keyword in a speech that includes a sequence of audio frames further including a current frame and a subsequent frame. A candidate keyword is determined for the current frame using a decoding network that includes keywords and filler words of multiple languages, and used to determine a confidence score for the audio frame sequence. A word option is also determined for the subsequent frame based on the decoding network, and when the candidate keyword and the word option are associated with two distinct types of languages, the confidence score of the audio frame sequence is updated at least based on a penalty factor associated with the two distinct types of languages. The audio frame sequence is then determined to include both the candidate keyword and the word option by evaluating the updated confidence score according to a keyword determination criterion.

摘要翻译： 本申请公开了一种实现的方法，其中识别语音中的关键字，其中包括进一步包括当前帧和后续帧的音频帧序列。使用包括多种语言的关键词和填充词的解码网络为当前帧确定候选关键字，并且用于确定音频帧序列的置信度分数。还基于解码网络为后续帧确定字选项，并且当候选关键词和词选项与两种不同类型的语言相关联时，至少基于惩罚来更新音频帧序列的置信度得分与两种不同类型语言相关联的因素。然后通过根据关键字确定标准评估更新的可信度得分，确定音频帧序列以包括候选关键词和词选项。

3.

发明授权
Method and computer system for performing audio search on a social networking platform 有权

公开(公告)号：US09818432B2

公开(公告)日：2017-11-14

申请号：US15176047

申请日：2016-06-07

申请人： Tencent Technology (Shenzhen) Company Limited

发明人： Lu Li , Jianxiong Ma , Li Lu

IPC分类号： G10L15/26 , G10L25/54 , G06F17/30 , G10L15/14 , G10L21/10 , G10L15/02 , G10L15/08

CPC分类号： G10L25/54 , G06F17/30026 , G10L15/14 , G10L21/10 , G10L2015/027 , G10L2015/088

摘要： Methods and computer systems for audio search on a social networking platform are disclosed. The method includes: while running a social networking application, receiving a first audio input from a user of the computer system, the first audio input including one or more search keywords; generating a first audio confusion network from the first audio input; determining whether the first audio confusion network matches at least one of one or more second audio confusion networks, wherein a respective second audio confusion network was generated from a corresponding second audio input associated with a chat session of which the user is a participant; and identifying a second audio input corresponding to the at least one second audio confusion network that matches the first audio confusion network, wherein the identified second audio input includes the one or more search keywords that are included in the first audio input.

4.

发明授权
Systems and methods for speech recognition 有权
标题翻译：用于语音识别的系统和方法

公开(公告)号：US09558741B2

公开(公告)日：2017-01-31

申请号：US14291138

申请日：2014-05-30

申请人： Tencent Technology (Shenzhen) Company Limited

发明人： Lou Li , Li Lu , Xiang Zhang , Feng Rao , Shuai Yue , Bo Chen , Jianxiong Ma , Haibo Liu

IPC分类号： G10L15/28 , G10L15/08 , G10L15/18 , G10L15/183

CPC分类号： G10L15/083 , G10L15/1815 , G10L15/183

摘要： Systems and methods are provided for speech recognition. For example, audio characteristics are extracted from acquired voice signals; a syllable confusion network is identified based on at least information associated with the audio characteristics; a word lattice is generated based on at least information associated with the syllable confusion network and a predetermined phonetic dictionary; and an optimal character sequence is calculated in the word lattice as a speech recognition result.

摘要翻译： 提供了语音识别的系统和方法。例如，从获取的语音信号中提取音频特性; 至少基于与音频特征相关联的信息来识别音节混淆网络; 基于至少与音节混淆网络和预定语音字典相关联的信息生成单词格点; 并且在单词格中计算出最佳字符序列作为语音识别结果。

5.

发明授权
Language recognition based on vocabulary lists 有权
标题翻译：基于词汇表的语言识别

公开(公告)号：US09336197B2

公开(公告)日：2016-05-10

申请号：US14108224

申请日：2013-12-16

申请人： Tencent Technology (Shenzhen) Company Limited

发明人： Lu Li , Qiang Cheng , Jianxiong Ma , Feng Rao , Duling Lu , Li Lu , Xiang Zhang , Bo Chen

IPC分类号： G06F17/28 , G06F17/27

CPC分类号： G06F17/2735

摘要： A method is implemented at a computer to determine that certain information content is composed or compiled in a specific language selected among two or more similar languages. The computer integrates a first vocabulary list of a first language and a second vocabulary list of a second language into a comprehensive vocabulary list. The integrating includes analyzing the first vocabulary list in view of the second vocabulary list to identify a first vocabulary sub-list that is used in the first language, but not in the second language. The computer then identifies, in the information content, a plurality of expressions that are included in the comprehensive vocabulary list, and a subset of expressions that are included in the first vocabulary sub-list. Upon a determination that a total frequency of occurrence of the subset of expressions meets predetermined occurrence criteria, the computer determines that the information content is composed in the first language.

摘要翻译： 在计算机上实现一种方法来确定某些信息内容是以两种或多种类似语言中选择的特定语言来组合或编译的。计算机将第一语言的第一词汇列表和第二语言的第二词汇列表集成到综合词汇列表中。该集成包括根据第二词汇列表分析第一词汇列表以识别在第一语言中使用的第一词汇子列表，而不是第二语言。然后，计算机在信息内容中识别包括在综合词汇列表中的多个表达式以及包括在第一词汇子列表中的表达式的子集。在确定表达子集的总出现频率满足预定出现标准的情况下，计算机确定信息内容以第一语言组成。

6.

发明授权
Method and computer system for performing audio search on a social networking platform 有权

公开(公告)号：US10453477B2

公开(公告)日：2019-10-22

申请号：US15728464

申请日：2017-10-09

申请人： Tencent Technology (Shenzhen) Company Limited

发明人： Lu Li , Jianxiong Ma , Li Lu

IPC分类号： G10L15/08 , G10L25/54 , G06F16/432 , G10L15/14 , G10L21/10 , G10L15/02

摘要： Methods and computer systems for audio search on a social networking platform are disclosed. While running a social networking application, a computer system receives a first audio input from a user of the computer system and then generates a first audio confusion network from the first audio input. After comparing the first audio confusion network with one or more second audio confusion networks, each corresponding to a second audio input associated with one of a plurality of participants of a chat session of the social networking application, the computer system identifies at least one second audio input corresponding to the at least one second audio confusion network that matches the first audio confusion network and displays a portion of the chat session including a visual icon representing the identified second audio input on a display of the computer system.

7.

发明授权
Method and apparatus for performing speech keyword retrieval 有权
标题翻译：执行语音关键词检索的方法和装置

公开(公告)号：US09355637B2

公开(公告)日：2016-05-31

申请号：US14620000

申请日：2015-02-11

申请人： Tencent Technology (Shenzhen) Company Limited

发明人： Jianxiong Ma , Lu Li , Li Lu , Xiang Zhang , Shuai Yue , Feng Rao , Eryu Wang , Linghui Kong

IPC分类号： G10L15/18 , G10L15/28 , G10L15/08 , G10L15/32

CPC分类号： G10L15/18 , G10L15/08 , G10L15/28 , G10L15/32 , G10L2015/088

摘要： A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.

摘要翻译： 提供了一种用于检索关键字的方法和装置。该装置在模型文件中配置至少两种类型的语言模型，其中每种类型的语言模型包括识别模型和相应的解码模型; 该设备从待处理语音数据中提取语音特征; 通过在模型文件中逐一使用识别模型对提取出的语音特征进行语言匹配，并根据语言匹配率确定识别模型; 并确定与识别模型相对应的解码模型; 通过使用所确定的解码模型来解码所提取的语音特征，并且在解码之后获得字识别结果; 并且将关键词字典中的关键字与单词识别结果进行匹配，并输出匹配的关键字。

8.

发明授权
Method and apparatus for performing speech keyword retrieval 有权

公开(公告)号：US09257118B2

公开(公告)日：2016-02-09

申请号：US14620000

申请日：2015-02-11

申请人： Tencent Technology (Shenzhen) Company Limited

发明人： Jianxiong Ma , Lu Li , Li Lu , Xiang Zhang , Shuai Yue , Feng Rao , Eryu Wang , Linghui Kong

IPC分类号： G10L15/18 , G10L15/28 , G10L15/08 , G10L15/32

摘要： A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.

9.

发明申请
Systems and Methods for Voice Identification 有权
标题翻译：语音识别系统与方法

公开(公告)号：US20140350934A1

公开(公告)日：2014-11-27

申请号：US14291138

申请日：2014-05-30

申请人： Tencent Technology (Shenzhen) Company Limited

发明人： Lou Li , Li Lu , Xiang Zhang , Feng Rao , Shuai Yue , Bo Chen , Jianxiong Ma , Haibo Liu

IPC分类号： G10L17/22

CPC分类号： G10L15/083 , G10L15/1815 , G10L15/183

摘要： Systems and methods are provided for voice identification. For example, audio characteristics are extracted from acquired voice signals; a syllable confusion network is identified based on at least information associated with the audio characteristics; a word lattice is generated based on at least information associated with the syllable confusion network and a predetermined phonetic dictionary; and an optimal character sequence is calculated in the word lattice as an identification result.

摘要翻译： 为语音识别提供了系统和方法。例如，从获取的语音信号中提取音频特性; 至少基于与音频特征相关联的信息来识别音节混淆网络; 基于至少与音节混淆网络和预定语音字典相关联的信息生成单词格点; 并且在字格中计算最佳字符序列作为识别结果。

10.

发明申请
LANGUAGE RECOGNITION BASED ON VOCABULARY LISTS 有权
标题翻译：基于VOCABULARY LISTS的语言识别

公开(公告)号：US20140207440A1

公开(公告)日：2014-07-24

申请号：US14108224

申请日：2013-12-16

申请人： Tencent Technology (Shenzhen) Company Limited

发明人： Lu Li , Qiang Cheng , Jianxiong Ma , Feng Rao , Duling Lu , Li Lu , Xiang Zhang , Bo Chen

IPC分类号： G06F17/28

CPC分类号： G06F17/2735

摘要： A method is implemented at a computer to determine that certain information content is composed or compiled in a specific language selected among two or more similar languages. The computer integrates a first vocabulary list of a first language and a second vocabulary list of a second language into a comprehensive vocabulary list. The integrating includes analyzing the first vocabulary list in view of the second vocabulary list to identify a first vocabulary sub-list that is used in the first language, but not in the second language. The computer then identifies, in the information content, a plurality of expressions that are included in the comprehensive vocabulary list, and a subset of expressions that are included in the first vocabulary sub-list. Upon a determination that a total frequency of occurrence of the subset of expressions meets predetermined occurrence criteria, the computer determines that the information content is composed in the first language.

摘要翻译： 在计算机上实现一种方法来确定某些信息内容是以两种或多种类似语言中选择的特定语言来组合或编译的。计算机将第一语言的第一词汇列表和第二语言的第二词汇列表集成到综合词汇列表中。该集成包括根据第二词汇列表分析第一词汇列表以识别在第一语言中使用的第一词汇子列表，而不是第二语言。然后，计算机在信息内容中识别包括在综合词汇列表中的多个表达式以及包括在第一词汇子列表中的表达式的子集。在确定表达子集的总出现频率满足预定出现标准的情况下，计算机确定信息内容以第一语言组成。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类