SYSTEMS AND METHODS FOR ADDING PUNCTUATIONS
    1.
    发明申请
    SYSTEMS AND METHODS FOR ADDING PUNCTUATIONS 审中-公开
    用于增加攻击力的系统和方法

    公开(公告)号:WO2014187069A1

    公开(公告)日:2014-11-27

    申请号:PCT/CN2013/085347

    申请日:2013-10-16

    Abstract: Systems and methods are provided for adding punctuations. For example, one or more first feature units are identified in a voice file taken as a whole; the voice file is divided into multiple segments; one or more second feature units are identified in the voice file; a first aggregate weight of first punctuation states of the voice file and a second aggregate weight of second punctuation states of the voice file are determined, using a language model established based on word separation and third semantic features; a weighted calculation is performed to generate a third aggregate weight based on at least information associated with the first aggregate weight and the second aggregate weight; and one or more final punctuations are added to the voice file based on at least information associated with the third aggregate weight.

    Abstract translation: 提供了系统和方法来添加标点符号。 例如,一个或多个第一特征单元在作为整体而言的语音文件中被识别; 语音文件分为多个段; 在语音文件中识别一个或多个第二特征单元; 使用基于词分离和第三语义特征建立的语言模型来确定语音文件的第一标点状态的第一聚合权重和语音文件的第二标点状态的第二聚合权重; 基于至少与第一聚集权重和第二聚集权重相关联的信息来执行加权计算以产生第三聚集权重; 并且基于至少与第三聚合权重相关联的信息将一个或多个最终标点符号添加到语音文件。

    METHOD AND APPARATUS FOR PERFORMING SPEECH KEYWORD RETRIEVAL
    2.
    发明申请
    METHOD AND APPARATUS FOR PERFORMING SPEECH KEYWORD RETRIEVAL 审中-公开
    执行语音关键词检索的方法和装置

    公开(公告)号:WO2015024431A1

    公开(公告)日:2015-02-26

    申请号:PCT/CN2014/083531

    申请日:2014-08-01

    CPC classification number: G10L15/18 G10L15/08 G10L15/28 G10L15/32 G10L2015/088

    Abstract: A method and an apparatus are provided for retrieving keyword. The apparatus configures at least two types of language models in a model file, where each type of language model includes a recognition model and a corresponding decoding model; the apparatus extracts a speech feature from the to-be-processed speech data; performs language matching on the extracted speech feature by using recognition models in the model file one by one, and determines a recognition model based on a language matching rate; and determines a decoding model corresponding to the recognition model; decoding the extracted speech feature by using the determined decoding model, and obtains a word recognition result after the decoding; and matches a keyword in a keyword dictionary and the word recognition result, and outputs a matched keyword.

    Abstract translation: 提供了一种用于检索关键字的方法和装置。 该装置在模型文件中配置至少两种类型的语言模型,其中每种类型的语言模型包括识别模型和相应的解码模型; 该设备从待处理语音数据中提取语音特征; 通过在模型文件中逐一使用识别模型对提取出的语音特征进行语言匹配,并根据语言匹配率确定识别模型; 并确定与识别模型相对应的解码模型; 通过使用所确定的解码模型来解码所提取的语音特征,并且在解码之后获得字识别结果; 并且将关键词字典中的关键词与单词识别结果进行匹配,并输出匹配关键字。

    METHOD AND DEVICE FOR KEYWORD DETECTION
    3.
    发明申请
    METHOD AND DEVICE FOR KEYWORD DETECTION 审中-公开
    用于关键字检测的方法和装置

    公开(公告)号:WO2014117547A1

    公开(公告)日:2014-08-07

    申请号:PCT/CN2013/085905

    申请日:2013-10-24

    CPC classification number: G10L15/063 G10L15/08 G10L2015/088

    Abstract: An electronic device with one or more processors and memory trains an acoustic model with an international phonetic alphabet (IPA) phoneme mapping collection and audio samples in different languages, where the acoustic model includes: a foreground model; and a background model. The device generates a phone decoder based on the trained acoustic model. The device collects keyword audio samples, decodes the keyword audio samples with the phone decoder to generate phoneme sequence candidates, and selects a keyword phoneme sequence from the phoneme sequence candidates. After obtaining the keyword phoneme sequence, the device detects one or more keywords in an input audio signal with the trained acoustic model, including: matching phonemic keyword portions of the input audio signal with phonemes in the keyword phoneme sequence with the foreground model; and filtering out phonemic non-keyword portions of the input audio signal with the background model.

    Abstract translation: 具有一个或多个处理器和存储器的电子设备具有使用不同语言的国际语音字母(IPA)音素映射收集和音频样本的声学模型,其中声学模型包括:前景模型; 和背景模型。 该设备基于经过训练的声学模型生成电话解码器。 设备收集关键字音频样本,用手机解码器对关键词音频样本进行解码,以产生音素序列候选,并从音素序列候选中选择关键词音素序列。 在获得关键字音素序列之后,设备利用经训练的声学模型检测输入音频信号中的一个或多个关键词,包括:使用前景模型将关键词音素序列中的输入音频信号的音素关键字部分与音素相匹配; 并用背景模型滤出输入音频信号的音素非关键字部分。

    KEYWORD DETECTION FOR SPEECH RECOGNITION
    4.
    发明申请
    KEYWORD DETECTION FOR SPEECH RECOGNITION 审中-公开
    语音识别的关键词检测

    公开(公告)号:WO2015021844A1

    公开(公告)日:2015-02-19

    申请号:PCT/CN2014/082332

    申请日:2014-07-16

    CPC classification number: G10L15/08 G10L15/083 G10L2015/088

    Abstract: Disclosed is a method implemented of recognizing a keyword in a speech that includes a sequence of audio frames further including a current frame and a subsequent frame. A candidate keyword is determined for the current frame using a decoding network that includes keywords and filler words of multiple languages, and used to determine a confidence score for the audio frame sequence. A word option is also determined for the subsequent frame based on the decoding network, and when the candidate keyword and the word option are associated with two distinct types of languages, the confidence score of the audio frame sequence is updated at least based on a penalty factor associated with the two distinct types of languages. The audio frame sequence is then determined to include both the candidate keyword and the word option by evaluating the updated confidence score according to a keyword determination criterion.

    Abstract translation: 公开了一种在语音中识别关键字的方法,该方法包括进一步包括当前帧和后续帧的音频帧序列。 使用包括多种语言的关键词和填充词的解码网络为当前帧确定候选关键字,并且用于确定音频帧序列的置信度分数。 还基于解码网络为后续帧确定字选项,并且当候选关键词和词选项与两种不同类型的语言相关联时,至少基于惩罚来更新音频帧序列的置信度得分 与两种不同类型语言相关联的因素。 然后通过根据关键字确定标准评估更新的可信度得分,确定音频帧序列以包括候选关键词和词选项。

    METHOD AND SYSTEM OF ADDING PUNCTUATION AND ESTABLISHING LANGUAGE MODEL
    5.
    发明申请
    METHOD AND SYSTEM OF ADDING PUNCTUATION AND ESTABLISHING LANGUAGE MODEL 审中-公开
    方法和系统的添加和建立语言模型

    公开(公告)号:WO2014117553A1

    公开(公告)日:2014-08-07

    申请号:PCT/CN2013/086618

    申请日:2013-11-06

    CPC classification number: G10L15/04 G10L15/1815

    Abstract: A method of processing information content based on a language model is performed at a computer. The method includes the following steps: identifying a plurality of expressions in the information content that is queued to be processed; dividing the plurality of expressions into a plurality of characteristic units according to semantic features and predetermined characteristics associated with each of the plurality of characteristic units, each characteristic unit including a subset of the plurality of expressions and the predetermined characteristics at least including a respective integer number of expressions that are included in the characteristic unit; extracting, from the language model, a plurality of probabilities for a plurality of punctuation marks associated with each of the plurality of characteristic units; and in accordance with the extracted probabilities, associating a respective punctuation mark with each of the plurality of characteristic units included in the information content.

    Abstract translation: 在计算机上执行基于语言模型处理信息内容的方法。 该方法包括以下步骤:识别排队等待处理的信息内容中的多个表达; 根据与多个特征单元中的每一个相关联的语义特征和预定特征将多个表达式划分为多个特征单元,每个特征单元包括多个表达式的子集和预定特征,至少包括相应的整数 包含在特征单元中的表达式; 从所述语言模型中提取与所述多个特征单元中的每一个相关联的多个标点符号的多个概率; 并且根据所提取的概率,将相应的标点符号与包括在信息内容中的多个特征单元中的每一个相关联。

    METHOD AND COMPUTER SYSTEM FOR PERFORMING AUDIO SEARCH ON SOCIAL NETWORKING PLATFORM
    6.
    发明申请
    METHOD AND COMPUTER SYSTEM FOR PERFORMING AUDIO SEARCH ON SOCIAL NETWORKING PLATFORM 审中-公开
    用于在社交网络平台上进行音频搜索的方法和计算机系统

    公开(公告)号:WO2015106646A1

    公开(公告)日:2015-07-23

    申请号:PCT/CN2015/070227

    申请日:2015-01-06

    Abstract: Methods and computer systems for audio search on a social networking platform are disclosed. The method includes: while running a social networking application, receiving a first audio input from a user of the computer system, the first audio input including one or more search keywords; generating a first audio confusion network from the first audio input; determining whether the first audio confusion network matches at least one of one or more second audio confusion networks, wherein a respective second audio confusion network was generated from a corresponding second audio input associated with a chat session of which the user is a participant; and identifying a second audio input corresponding to the at least one second audio confusion network that matches the first audio confusion network, wherein the identified second audio input includes the one or more search keywords that are included in the first audio input.

    Abstract translation: 公开了用于在社交网络平台上进行音频搜索的方法和计算机系统。 该方法包括:在运行社交网络应用程序时,从计算机系统的用户接收第一音频输入,所述第一音频输入包括一个或多个搜索关键字; 从所述第一音频输入产生第一音频混淆网络; 确定所述第一音频混淆网络是否匹配一个或多个第二音频混淆网络中的至少一个,其中相应的第二音频混淆网络是从与所述用户是其参与者的聊天会话相关联的对应的第二音频输入生成的; 以及识别对应于与所述第一音频混淆网络匹配的所述至少一个第二音频混淆网络的第二音频输入,其中所识别的第二音频输入包括包括在所述第一音频输入中的所述一个或多个搜索关键字。

    SYSTEMS AND METHODS FOR AUDIO COMMAND RECOGNITION
    7.
    发明申请
    SYSTEMS AND METHODS FOR AUDIO COMMAND RECOGNITION 审中-公开
    用于音频命令识别的系统和方法

    公开(公告)号:WO2015081681A1

    公开(公告)日:2015-06-11

    申请号:PCT/CN2014/079766

    申请日:2014-06-12

    Abstract: A method, an electronic system and a non-transitory computer readable storage medium for recognizing audio commands in an electronic device are disclosed. The electronic device obtains audio data based on an audio signal provided by a user and extracts characteristic audio fingerprint features from the audio data. The electronic device further determines whether the corresponding audio signal is generated by an authorized user by comparing the characteristic audio fingerprint features with an audio fingerprint model for the authorized user and with a universal background model that represents user-independent audio fingerprint features, respectively. When the corresponding audio signal is generated by the authorized user of the electronic device, an audio command is extracted from the audio data, and an operation is performed according to the audio command.

    Abstract translation: 公开了一种用于在电子设备中识别音频命令的方法,电子系统和非暂时性计算机可读存储介质。 电子设备基于由用户提供的音频信号获得音频数据,并从音频数据中提取特征音频指纹特征。 电子设备还通过将特征音频指纹特征与用于授权用户的音频指纹模型进行比较,以及分别表示用户独立的音频指纹特征的通用背景模型来确定对应的音频信号是否由授权用户产生。 当由电子设备的授权用户产生相应的音频信号时,从音频数据中提取音频命令,并根据音频命令进行操作。

    METHOD AND APPARATUS FOR BUILDING A LANGUAGE MODEL
    8.
    发明申请
    METHOD AND APPARATUS FOR BUILDING A LANGUAGE MODEL 审中-公开
    用于建立语言模型的方法和装置

    公开(公告)号:WO2014190732A1

    公开(公告)日:2014-12-04

    申请号:PCT/CN2013/089588

    申请日:2013-12-16

    CPC classification number: G06F17/2775 G06F17/277 G10L15/063 G10L15/183

    Abstract: A method includes: acquiring data samples; performing categorized sentence mining in the acquired data samples to obtain categorized training samples for multiple categories; building a text classifier based on the categorized training samples; classifying the data samples using the text classifier to obtain a class vocabulary and a corpus for each category; mining the corpus for each category according to the class vocabulary for the category to obtain a respective set of high-frequency language templates; training on the templates for each category to obtain a template-based language model for the category; training on the corpus for each category to obtain a class-based language model for the category; training on the class vocabulary for each category to obtain a lexicon-based language model for the category; building a speech decoder according to an acoustic model, the class-based language model and the lexicon-based language model for any given field, and the data samples.

    Abstract translation: 一种方法包括:获取数据样本; 在获取的数据样本中执行分类句子挖掘以获得用于多个类别的分类训练样本; 基于分类训练样本构建文本分类器; 使用文本分类器对数据样本进行分类,以获得每个类别的类词汇和语料库; 根据类别的词汇量挖掘每个类别的语料库,以获得相应的一组高频语言模板; 对每个类别的模板进行培训,以获得该类别的基于模板的语言模型; 对每个类别的语料库进行训练,以获得该类别的基于类的语言模型; 对每个类别的课堂词汇进行培训,以获得该类别的基于词典的语言模型; 根据声学模型,基于类的语言模型和任何给定字段的基于词典的语言模型构建语音解码器,以及数据样本。

    METHOD AND DEVICE FOR ACOUSTIC LANGUAGE MODEL TRAINING
    9.
    发明申请
    METHOD AND DEVICE FOR ACOUSTIC LANGUAGE MODEL TRAINING 审中-公开
    用于语音语言模型训练的方法和装置

    公开(公告)号:WO2014117548A1

    公开(公告)日:2014-08-07

    申请号:PCT/CN2013/085948

    申请日:2013-10-25

    CPC classification number: G10L15/063 G10L15/05 G10L2015/0631

    Abstract: A method and a device for training an acoustic language model, include: conducting word segmentation for training samples in a training corpus using an initial language model containing no word class labels, to obtain initial word segmentation data containing no word class labels; performing word class replacement for the initial word segmentation data containing no word class labels, to obtain first word segmentation data containing word class labels; using the first word segmentation data containing word class labels to train a first language model containing word class labels; using the first language model containing word class labels to conduct word segmentation for the training samples in the training corpus, to obtain second word segmentation data containing word class labels; and in accordance with the second word segmentation data meeting one or more predetermined criteria, using the second word segmentation data containing word class labels to train the acoustic language model.

    Abstract translation: 一种用于训练声学语言模型的方法和装置,包括:使用不含词类标签的初始语言模型,在训练语料库中训练样本的词分割,以获得不包含词类标签的初始分词数据; 对不包含词类标签的初始分词数据执行单词类替换,以获得包含单词分类标签的第一分词数据; 使用包含词类标签的第一词分割数据来训练包含词类标签的第一语言模型; 使用包含词类标签的第一语言模型对训练语料库中的训练样本进行词分割,以获得包含词类标签的第二词分割数据; 并且根据满足一个或多个预定标准的第二字分割数据,使用包含词类标签的第二词分割数据来训练声学语言模型。

    LANGUAGE RECOGNITION BASED ON VOCABULARY LISTS
    10.
    发明申请
    LANGUAGE RECOGNITION BASED ON VOCABULARY LISTS 审中-公开
    基于VOCABULARY LISTS的语言识别

    公开(公告)号:WO2014114117A1

    公开(公告)日:2014-07-31

    申请号:PCT/CN2013/085926

    申请日:2013-10-25

    CPC classification number: G06F17/2735 G06F17/2863

    Abstract: A method is implemented at a computer to determine that certain information content is composed or compiled in a specific language selected among two or more similar languages. The computer integrates a first vocabulary list of a first language and a second vocabulary list of a second language into a comprehensive vocabulary list. The integrating includes analyzing the first vocabulary list in view of the second vocabulary list to identify a first vocabulary sub-list that is used in the first language, but not in the second language. The computer then identifies, in the information content, a plurality of expressions that are included in the comprehensive vocabulary list, and a subset of expressions that are included in the first vocabulary sub-list. Upon a determination that a total frequency of occurrence of the subset of expressions meets predetermined occurrence criteria, the computer determines that the information content is composed in the first language.

    Abstract translation: 在计算机上实现一种方法,以确定某些信息内容是以两种或多种类似语言中选择的特定语言构成或编译的。 计算机将第一语言的第一词汇列表和第二语言的第二词汇列表集成到综合词汇列表中。 该集成包括根据第二词汇列表分析第一词汇列表以识别在第一语言中使用的第一词汇子列表,而不是第二语言。 然后,计算机在信息内容中识别包括在综合词汇列表中的多个表达式以及包括在第一词汇子列表中的表达式的子集。 在确定表达子集的总出现频率满足预定出现标准的情况下,计算机确定信息内容以第一语言组成。

Patent Agency Ranking