Keyword detection for speech recognition
    1.
    发明授权
    Keyword detection for speech recognition 有权
    语音识别的关键字检测

    公开(公告)号:US09230541B2

    公开(公告)日:2016-01-05

    申请号:US14567969

    申请日:2014-12-11

    IPC分类号: G10L15/08

    摘要: This application discloses a method implemented of recognizing a keyword in a speech that includes a sequence of audio frames further including a current frame and a subsequent frame. A candidate keyword is determined for the current frame using a decoding network that includes keywords and filler words of multiple languages, and used to determine a confidence score for the audio frame sequence. A word option is also determined for the subsequent frame based on the decoding network, and when the candidate keyword and the word option are associated with two distinct types of languages, the confidence score of the audio frame sequence is updated at least based on a penalty factor associated with the two distinct types of languages. The audio frame sequence is then determined to include both the candidate keyword and the word option by evaluating the updated confidence score according to a keyword determination criterion.

    摘要翻译: 本申请公开了一种实现的方法,其中识别语音中的关键字,其中包括进一步包括当前帧和后续帧的音频帧序列。 使用包括多种语言的关键词和填充词的解码网络为当前帧确定候选关键字,并且用于确定音频帧序列的置信度分数。 还基于解码网络为后续帧确定字选项,并且当候选关键词和词选项与两种不同类型的语言相关联时,至少基于惩罚来更新音频帧序列的置信度得分 与两种不同类型语言相关联的因素。 然后通过根据关键字确定标准评估更新的可信度得分,确定音频帧序列以包括候选关键词和词选项。

    Keyword Detection For Speech Recognition
    2.
    发明申请
    Keyword Detection For Speech Recognition 有权
    语音识别的关键字检测

    公开(公告)号:US20150095032A1

    公开(公告)日:2015-04-02

    申请号:US14567969

    申请日:2014-12-11

    IPC分类号: G10L15/08

    摘要: This application discloses a method implemented of recognizing a keyword in a speech that includes a sequence of audio frames further including a current frame and a subsequent frame. A candidate keyword is determined for the current frame using a decoding network that includes keywords and filler words of multiple languages, and used to determine a confidence score for the audio frame sequence. A word option is also determined for the subsequent frame based on the decoding network, and when the candidate keyword and the word option are associated with two distinct types of languages, the confidence score of the audio frame sequence is updated at least based on a penalty factor associated with the two distinct types of languages. The audio frame sequence is then determined to include both the candidate keyword and the word option by evaluating the updated confidence score according to a keyword determination criterion.

    摘要翻译: 本申请公开了一种实现的方法,其中识别语音中的关键字,其中包括进一步包括当前帧和后续帧的音频帧序列。 使用包括多种语言的关键词和填充词的解码网络为当前帧确定候选关键字,并且用于确定音频帧序列的置信度分数。 还基于解码网络为后续帧确定字选项,并且当候选关键词和词选项与两种不同类型的语言相关联时,至少基于惩罚来更新音频帧序列的置信度得分 与两种不同类型语言相关联的因素。 然后通过根据关键字确定标准评估更新的可信度得分,确定音频帧序列以包括候选关键词和词选项。

    Systems and methods for speech recognition
    4.
    发明授权
    Systems and methods for speech recognition 有权
    用于语音识别的系统和方法

    公开(公告)号:US09558741B2

    公开(公告)日:2017-01-31

    申请号:US14291138

    申请日:2014-05-30

    摘要: Systems and methods are provided for speech recognition. For example, audio characteristics are extracted from acquired voice signals; a syllable confusion network is identified based on at least information associated with the audio characteristics; a word lattice is generated based on at least information associated with the syllable confusion network and a predetermined phonetic dictionary; and an optimal character sequence is calculated in the word lattice as a speech recognition result.

    摘要翻译: 提供了语音识别的系统和方法。 例如,从获取的语音信号中提取音频特性; 至少基于与音频特征相关联的信息来识别音节混淆网络; 基于至少与音节混淆网络和预定语音字典相关联的信息生成单词格点; 并且在单词格中计算出最佳字符序列作为语音识别结果。

    Language recognition based on vocabulary lists
    5.
    发明授权
    Language recognition based on vocabulary lists 有权
    基于词汇表的语言识别

    公开(公告)号:US09336197B2

    公开(公告)日:2016-05-10

    申请号:US14108224

    申请日:2013-12-16

    IPC分类号: G06F17/28 G06F17/27

    CPC分类号: G06F17/2735

    摘要: A method is implemented at a computer to determine that certain information content is composed or compiled in a specific language selected among two or more similar languages. The computer integrates a first vocabulary list of a first language and a second vocabulary list of a second language into a comprehensive vocabulary list. The integrating includes analyzing the first vocabulary list in view of the second vocabulary list to identify a first vocabulary sub-list that is used in the first language, but not in the second language. The computer then identifies, in the information content, a plurality of expressions that are included in the comprehensive vocabulary list, and a subset of expressions that are included in the first vocabulary sub-list. Upon a determination that a total frequency of occurrence of the subset of expressions meets predetermined occurrence criteria, the computer determines that the information content is composed in the first language.

    摘要翻译: 在计算机上实现一种方法来确定某些信息内容是以两种或多种类似语言中选择的特定语言来组合或编译的。 计算机将第一语言的第一词汇列表和第二语言的第二词汇列表集成到综合词汇列表中。 该集成包括根据第二词汇列表分析第一词汇列表以识别在第一语言中使用的第一词汇子列表,而不是第二语言。 然后,计算机在信息内容中识别包括在综合词汇列表中的多个表达式以及包括在第一词汇子列表中的表达式的子集。 在确定表达子集的总出现频率满足预定出现标准的情况下,计算机确定信息内容以第一语言组成。

    Method and computer system for performing audio search on a social networking platform

    公开(公告)号:US10453477B2

    公开(公告)日:2019-10-22

    申请号:US15728464

    申请日:2017-10-09

    发明人: Lu Li Jianxiong Ma Li Lu

    摘要: Methods and computer systems for audio search on a social networking platform are disclosed. While running a social networking application, a computer system receives a first audio input from a user of the computer system and then generates a first audio confusion network from the first audio input. After comparing the first audio confusion network with one or more second audio confusion networks, each corresponding to a second audio input associated with one of a plurality of participants of a chat session of the social networking application, the computer system identifies at least one second audio input corresponding to the at least one second audio confusion network that matches the first audio confusion network and displays a portion of the chat session including a visual icon representing the identified second audio input on a display of the computer system.

    Method and apparatus for performing speech keyword retrieval
    7.
    发明授权
    Method and apparatus for performing speech keyword retrieval 有权
    执行语音关键词检索的方法和装置

    公开(公告)号:US09355637B2

    公开(公告)日:2016-05-31

    申请号:US14620000

    申请日:2015-02-11

    摘要: A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.

    摘要翻译: 提供了一种用于检索关键字的方法和装置。 该装置在模型文件中配置至少两种类型的语言模型,其中每种类型的语言模型包括识别模型和相应的解码模型; 该设备从待处理语音数据中提取语音特征; 通过在模型文件中逐一使用识别模型对提取出的语音特征进行语言匹配,并根据语言匹配率确定识别模型; 并确定与识别模型相对应的解码模型; 通过使用所确定的解码模型来解码所提取的语音特征,并且在解码之后获得字识别结果; 并且将关键词字典中的关键字与单词识别结果进行匹配,并输出匹配的关键字。

    Method and apparatus for performing speech keyword retrieval

    公开(公告)号:US09257118B2

    公开(公告)日:2016-02-09

    申请号:US14620000

    申请日:2015-02-11

    摘要: A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.

    Systems and Methods for Voice Identification
    9.
    发明申请
    Systems and Methods for Voice Identification 有权
    语音识别系统与方法

    公开(公告)号:US20140350934A1

    公开(公告)日:2014-11-27

    申请号:US14291138

    申请日:2014-05-30

    IPC分类号: G10L17/22

    摘要: Systems and methods are provided for voice identification. For example, audio characteristics are extracted from acquired voice signals; a syllable confusion network is identified based on at least information associated with the audio characteristics; a word lattice is generated based on at least information associated with the syllable confusion network and a predetermined phonetic dictionary; and an optimal character sequence is calculated in the word lattice as an identification result.

    摘要翻译: 为语音识别提供了系统和方法。 例如,从获取的语音信号中提取音频特性; 至少基于与音频特征相关联的信息来识别音节混淆网络; 基于至少与音节混淆网络和预定语音字典相关联的信息生成单词格点; 并且在字格中计算最佳字符序列作为识别结果。

    LANGUAGE RECOGNITION BASED ON VOCABULARY LISTS
    10.
    发明申请
    LANGUAGE RECOGNITION BASED ON VOCABULARY LISTS 有权
    基于VOCABULARY LISTS的语言识别

    公开(公告)号:US20140207440A1

    公开(公告)日:2014-07-24

    申请号:US14108224

    申请日:2013-12-16

    IPC分类号: G06F17/28

    CPC分类号: G06F17/2735

    摘要: A method is implemented at a computer to determine that certain information content is composed or compiled in a specific language selected among two or more similar languages. The computer integrates a first vocabulary list of a first language and a second vocabulary list of a second language into a comprehensive vocabulary list. The integrating includes analyzing the first vocabulary list in view of the second vocabulary list to identify a first vocabulary sub-list that is used in the first language, but not in the second language. The computer then identifies, in the information content, a plurality of expressions that are included in the comprehensive vocabulary list, and a subset of expressions that are included in the first vocabulary sub-list. Upon a determination that a total frequency of occurrence of the subset of expressions meets predetermined occurrence criteria, the computer determines that the information content is composed in the first language.

    摘要翻译: 在计算机上实现一种方法来确定某些信息内容是以两种或多种类似语言中选择的特定语言来组合或编译的。 计算机将第一语言的第一词汇列表和第二语言的第二词汇列表集成到综合词汇列表中。 该集成包括根据第二词汇列表分析第一词汇列表以识别在第一语言中使用的第一词汇子列表,而不是第二语言。 然后,计算机在信息内容中识别包括在综合词汇列表中的多个表达式以及包括在第一词汇子列表中的表达式的子集。 在确定表达子集的总出现频率满足预定出现标准的情况下,计算机确定信息内容以第一语言组成。