-
公开(公告)号:US09230541B2
公开(公告)日:2016-01-05
申请号:US14567969
申请日:2014-12-11
发明人: Lu Ll , Li Lu , Jianxiong Ma , Linghui Kong , Feng Rao , Shuai Yue , Xiang Zhang , Haibo Liu , Eryu Wang , Bo Chen
IPC分类号: G10L15/08
CPC分类号: G10L15/08 , G10L15/083 , G10L2015/088
摘要: This application discloses a method implemented of recognizing a keyword in a speech that includes a sequence of audio frames further including a current frame and a subsequent frame. A candidate keyword is determined for the current frame using a decoding network that includes keywords and filler words of multiple languages, and used to determine a confidence score for the audio frame sequence. A word option is also determined for the subsequent frame based on the decoding network, and when the candidate keyword and the word option are associated with two distinct types of languages, the confidence score of the audio frame sequence is updated at least based on a penalty factor associated with the two distinct types of languages. The audio frame sequence is then determined to include both the candidate keyword and the word option by evaluating the updated confidence score according to a keyword determination criterion.
摘要翻译: 本申请公开了一种实现的方法,其中识别语音中的关键字,其中包括进一步包括当前帧和后续帧的音频帧序列。 使用包括多种语言的关键词和填充词的解码网络为当前帧确定候选关键字,并且用于确定音频帧序列的置信度分数。 还基于解码网络为后续帧确定字选项,并且当候选关键词和词选项与两种不同类型的语言相关联时,至少基于惩罚来更新音频帧序列的置信度得分 与两种不同类型语言相关联的因素。 然后通过根据关键字确定标准评估更新的可信度得分,确定音频帧序列以包括候选关键词和词选项。
-
公开(公告)号:US20150095032A1
公开(公告)日:2015-04-02
申请号:US14567969
申请日:2014-12-11
发明人: Lu LI , Li Lu , Jianxiong Ma , Linghui Kong , Feng Rao , Shuai Yue , Xiang Zhang , Haibo Liu , Eryu Wang , Bo Chen
IPC分类号: G10L15/08
CPC分类号: G10L15/08 , G10L15/083 , G10L2015/088
摘要: This application discloses a method implemented of recognizing a keyword in a speech that includes a sequence of audio frames further including a current frame and a subsequent frame. A candidate keyword is determined for the current frame using a decoding network that includes keywords and filler words of multiple languages, and used to determine a confidence score for the audio frame sequence. A word option is also determined for the subsequent frame based on the decoding network, and when the candidate keyword and the word option are associated with two distinct types of languages, the confidence score of the audio frame sequence is updated at least based on a penalty factor associated with the two distinct types of languages. The audio frame sequence is then determined to include both the candidate keyword and the word option by evaluating the updated confidence score according to a keyword determination criterion.
摘要翻译: 本申请公开了一种实现的方法,其中识别语音中的关键字,其中包括进一步包括当前帧和后续帧的音频帧序列。 使用包括多种语言的关键词和填充词的解码网络为当前帧确定候选关键字,并且用于确定音频帧序列的置信度分数。 还基于解码网络为后续帧确定字选项,并且当候选关键词和词选项与两种不同类型的语言相关联时,至少基于惩罚来更新音频帧序列的置信度得分 与两种不同类型语言相关联的因素。 然后通过根据关键字确定标准评估更新的可信度得分,确定音频帧序列以包括候选关键词和词选项。
-
公开(公告)号:US09818432B2
公开(公告)日:2017-11-14
申请号:US15176047
申请日:2016-06-07
发明人: Lu Li , Jianxiong Ma , Li Lu
CPC分类号: G10L25/54 , G06F17/30026 , G10L15/14 , G10L21/10 , G10L2015/027 , G10L2015/088
摘要: Methods and computer systems for audio search on a social networking platform are disclosed. The method includes: while running a social networking application, receiving a first audio input from a user of the computer system, the first audio input including one or more search keywords; generating a first audio confusion network from the first audio input; determining whether the first audio confusion network matches at least one of one or more second audio confusion networks, wherein a respective second audio confusion network was generated from a corresponding second audio input associated with a chat session of which the user is a participant; and identifying a second audio input corresponding to the at least one second audio confusion network that matches the first audio confusion network, wherein the identified second audio input includes the one or more search keywords that are included in the first audio input.
-
公开(公告)号:US09558741B2
公开(公告)日:2017-01-31
申请号:US14291138
申请日:2014-05-30
发明人: Lou Li , Li Lu , Xiang Zhang , Feng Rao , Shuai Yue , Bo Chen , Jianxiong Ma , Haibo Liu
IPC分类号: G10L15/28 , G10L15/08 , G10L15/18 , G10L15/183
CPC分类号: G10L15/083 , G10L15/1815 , G10L15/183
摘要: Systems and methods are provided for speech recognition. For example, audio characteristics are extracted from acquired voice signals; a syllable confusion network is identified based on at least information associated with the audio characteristics; a word lattice is generated based on at least information associated with the syllable confusion network and a predetermined phonetic dictionary; and an optimal character sequence is calculated in the word lattice as a speech recognition result.
摘要翻译: 提供了语音识别的系统和方法。 例如,从获取的语音信号中提取音频特性; 至少基于与音频特征相关联的信息来识别音节混淆网络; 基于至少与音节混淆网络和预定语音字典相关联的信息生成单词格点; 并且在单词格中计算出最佳字符序列作为语音识别结果。
-
公开(公告)号:US09336197B2
公开(公告)日:2016-05-10
申请号:US14108224
申请日:2013-12-16
发明人: Lu Li , Qiang Cheng , Jianxiong Ma , Feng Rao , Duling Lu , Li Lu , Xiang Zhang , Bo Chen
CPC分类号: G06F17/2735
摘要: A method is implemented at a computer to determine that certain information content is composed or compiled in a specific language selected among two or more similar languages. The computer integrates a first vocabulary list of a first language and a second vocabulary list of a second language into a comprehensive vocabulary list. The integrating includes analyzing the first vocabulary list in view of the second vocabulary list to identify a first vocabulary sub-list that is used in the first language, but not in the second language. The computer then identifies, in the information content, a plurality of expressions that are included in the comprehensive vocabulary list, and a subset of expressions that are included in the first vocabulary sub-list. Upon a determination that a total frequency of occurrence of the subset of expressions meets predetermined occurrence criteria, the computer determines that the information content is composed in the first language.
摘要翻译: 在计算机上实现一种方法来确定某些信息内容是以两种或多种类似语言中选择的特定语言来组合或编译的。 计算机将第一语言的第一词汇列表和第二语言的第二词汇列表集成到综合词汇列表中。 该集成包括根据第二词汇列表分析第一词汇列表以识别在第一语言中使用的第一词汇子列表,而不是第二语言。 然后,计算机在信息内容中识别包括在综合词汇列表中的多个表达式以及包括在第一词汇子列表中的表达式的子集。 在确定表达子集的总出现频率满足预定出现标准的情况下,计算机确定信息内容以第一语言组成。
-
公开(公告)号:US10453477B2
公开(公告)日:2019-10-22
申请号:US15728464
申请日:2017-10-09
发明人: Lu Li , Jianxiong Ma , Li Lu
摘要: Methods and computer systems for audio search on a social networking platform are disclosed. While running a social networking application, a computer system receives a first audio input from a user of the computer system and then generates a first audio confusion network from the first audio input. After comparing the first audio confusion network with one or more second audio confusion networks, each corresponding to a second audio input associated with one of a plurality of participants of a chat session of the social networking application, the computer system identifies at least one second audio input corresponding to the at least one second audio confusion network that matches the first audio confusion network and displays a portion of the chat session including a visual icon representing the identified second audio input on a display of the computer system.
-
公开(公告)号:US09355637B2
公开(公告)日:2016-05-31
申请号:US14620000
申请日:2015-02-11
发明人: Jianxiong Ma , Lu Li , Li Lu , Xiang Zhang , Shuai Yue , Feng Rao , Eryu Wang , Linghui Kong
CPC分类号: G10L15/18 , G10L15/08 , G10L15/28 , G10L15/32 , G10L2015/088
摘要: A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.
摘要翻译: 提供了一种用于检索关键字的方法和装置。 该装置在模型文件中配置至少两种类型的语言模型,其中每种类型的语言模型包括识别模型和相应的解码模型; 该设备从待处理语音数据中提取语音特征; 通过在模型文件中逐一使用识别模型对提取出的语音特征进行语言匹配,并根据语言匹配率确定识别模型; 并确定与识别模型相对应的解码模型; 通过使用所确定的解码模型来解码所提取的语音特征,并且在解码之后获得字识别结果; 并且将关键词字典中的关键字与单词识别结果进行匹配,并输出匹配的关键字。
-
公开(公告)号:US09257118B2
公开(公告)日:2016-02-09
申请号:US14620000
申请日:2015-02-11
发明人: Jianxiong Ma , Lu Li , Li Lu , Xiang Zhang , Shuai Yue , Feng Rao , Eryu Wang , Linghui Kong
摘要: A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.
-
公开(公告)号:US20140350934A1
公开(公告)日:2014-11-27
申请号:US14291138
申请日:2014-05-30
发明人: Lou Li , Li Lu , Xiang Zhang , Feng Rao , Shuai Yue , Bo Chen , Jianxiong Ma , Haibo Liu
IPC分类号: G10L17/22
CPC分类号: G10L15/083 , G10L15/1815 , G10L15/183
摘要: Systems and methods are provided for voice identification. For example, audio characteristics are extracted from acquired voice signals; a syllable confusion network is identified based on at least information associated with the audio characteristics; a word lattice is generated based on at least information associated with the syllable confusion network and a predetermined phonetic dictionary; and an optimal character sequence is calculated in the word lattice as an identification result.
摘要翻译: 为语音识别提供了系统和方法。 例如,从获取的语音信号中提取音频特性; 至少基于与音频特征相关联的信息来识别音节混淆网络; 基于至少与音节混淆网络和预定语音字典相关联的信息生成单词格点; 并且在字格中计算最佳字符序列作为识别结果。
-
公开(公告)号:US20140207440A1
公开(公告)日:2014-07-24
申请号:US14108224
申请日:2013-12-16
发明人: Lu Li , Qiang Cheng , Jianxiong Ma , Feng Rao , Duling Lu , Li Lu , Xiang Zhang , Bo Chen
IPC分类号: G06F17/28
CPC分类号: G06F17/2735
摘要: A method is implemented at a computer to determine that certain information content is composed or compiled in a specific language selected among two or more similar languages. The computer integrates a first vocabulary list of a first language and a second vocabulary list of a second language into a comprehensive vocabulary list. The integrating includes analyzing the first vocabulary list in view of the second vocabulary list to identify a first vocabulary sub-list that is used in the first language, but not in the second language. The computer then identifies, in the information content, a plurality of expressions that are included in the comprehensive vocabulary list, and a subset of expressions that are included in the first vocabulary sub-list. Upon a determination that a total frequency of occurrence of the subset of expressions meets predetermined occurrence criteria, the computer determines that the information content is composed in the first language.
摘要翻译: 在计算机上实现一种方法来确定某些信息内容是以两种或多种类似语言中选择的特定语言来组合或编译的。 计算机将第一语言的第一词汇列表和第二语言的第二词汇列表集成到综合词汇列表中。 该集成包括根据第二词汇列表分析第一词汇列表以识别在第一语言中使用的第一词汇子列表,而不是第二语言。 然后,计算机在信息内容中识别包括在综合词汇列表中的多个表达式以及包括在第一词汇子列表中的表达式的子集。 在确定表达子集的总出现频率满足预定出现标准的情况下,计算机确定信息内容以第一语言组成。
-
-
-
-
-
-
-
-
-