Speech recognition method, apparatus and navigation system
    21.
    发明申请
    Speech recognition method, apparatus and navigation system 审中-公开
    语音识别方法,装置和导航系统

    公开(公告)号:US20060100871A1

    公开(公告)日:2006-05-11

    申请号:US11253641

    申请日:2005-10-20

    IPC分类号: G10L15/04

    摘要: A speech recognition method and apparatus and a navigation system having the speech recognition apparatus are provided. The speech recognition method includes capturing speech as speech signal and extracting features from the speech signal, selecting candidates of a subword among subwords of the word based on the extracted features and displaying the candidate subwords for the subword, selecting candidates of a next subword following the subword based on the selected candidates of the subword and displaying the candidates of the next subword, and determining whether the user has selected one of the candidates of the next subword and, if not, selecting candidates of subwords following the next subword based on the series of subwords that have been previously selected by the user and displaying the selected candidates of the next subword.

    摘要翻译: 提供具有语音识别装置的语音识别方法和装置以及导航系统。 语音识别方法包括将语音作为语音信号进行采集,并从语音信号中提取特征,基于所提取的特征选择单词的子词中的子词的候选,并显示子词的候选词,选择下一个子词的候选 基于所选择的子词的候选者的子字,并显示下一个子词的候选,并且确定用户是否已经选择了下一个子词的候选中的一个,如果不是,则基于该系列选择下一个子词后的子词的候选 的以前由用户选择并显示下一个子词的选定候选者的子词。

    Multi-stage speech recognition apparatus and method
    24.
    发明授权
    Multi-stage speech recognition apparatus and method 有权
    多级语音识别装置及方法

    公开(公告)号:US08762142B2

    公开(公告)日:2014-06-24

    申请号:US11889665

    申请日:2007-08-15

    IPC分类号: G10L15/02 G10L15/16 G10L15/32

    CPC分类号: G10L15/32 G10L15/02 G10L15/16

    摘要: Provided are a multi-stage speech recognition apparatus and method. The multi-stage speech recognition apparatus includes a first speech recognition unit performing initial speech recognition on a feature vector, which is extracted from an input speech signal, and generating a plurality of candidate words; and a second speech recognition unit rescoring the candidate words, which are provided by the first speech recognition unit, using a temporal posterior feature vector extracted from the speech signal.

    摘要翻译: 提供了一种多级语音识别装置和方法。 多级语音识别装置包括:第一语音识别单元,对从输入语音信号提取的特征向量进行初始语音识别,生成多个候选词; 以及第二语音识别单元,使用从所述语音信号提取的时间后向特征向量,对由所述第一语音识别单元提供的候选词进行重新排序。

    Apparatus and method for detecting named entity
    25.
    发明授权
    Apparatus and method for detecting named entity 失效
    用于检测命名实体的装置和方法

    公开(公告)号:US08655646B2

    公开(公告)日:2014-02-18

    申请号:US11498050

    申请日:2006-08-03

    IPC分类号: G06F17/27 G06F17/30 G06F17/28

    CPC分类号: G06F17/278 G10L15/18

    摘要: An apparatus and method for detecting a named-entity. The apparatus includes a candidate-named-entity extraction module that detects a candidate-named-entity based on an initial learning example and feature information regarding morphemes constituting an inputted sentence, the candidate-named-entity extraction module providing a tagged sentence including the detected candidate-named-entity; a storage module that stores information regarding a named-entity dictionary and a rule; and a learning-example-regeneration module for finally determining whether the candidate-named-entity included in the provided sentence is a valid named-entity, based on the named-entity dictionary and the rule, the learning-example-regeneration module providing the sentence as a learning example, based on a determination result, so that a probability of candidate-named-entity detection is gradually updated.

    摘要翻译: 一种用于检测命名实体的装置和方法。 该装置包括候选名称实体提取模块,其基于初始学习示例和关于构成输入句子的语素的特征信息来检测候选名称实体,候选名称实体提取模块提供包括检测到的标记语句的标记语句 候选名称实体; 存储有关命名实体字典和规则的信息的存储模块; 以及基于命名实体字典和规则,最终确定包括在所提供的句子中的候选名称实体是否是有效命名实体的学习示例再生模块,所述学习示例再生模块提供 句子作为学习示例,基于确定结果,使得候选命名实体检测的概率逐渐更新。

    User adaptive speech recognition method and apparatus
    26.
    发明授权
    User adaptive speech recognition method and apparatus 有权
    用户自适应语音识别方法和装置

    公开(公告)号:US07996218B2

    公开(公告)日:2011-08-09

    申请号:US11354942

    申请日:2006-02-16

    IPC分类号: G10L15/00

    摘要: A user adaptive speech recognition method and apparatus is disclosed that controls user confirmation of a recognition candidate using a new threshold value adapted to a user. The user adaptive speech recognition method includes calculating a confidence score of a recognition candidate according to the result of speech recognition, setting a new threshold value adapted to the user based on a result of user confirmation of the recognition candidate and the confidence score of the recognition candidate, and outputting a corresponding recognition candidate as a result of the speech recognition if the calculated confidence score is higher than the new threshold value. Thus, the need for user confirmation of the result of speech recognition is reduced and the probability of speech recognition success is increased.

    摘要翻译: 公开了一种用户自适应语音识别方法和装置,其使用适合于用户的新阈值来控制用户对识别候选者的确认。 用户自适应语音识别方法包括根据语音识别结果计算识别候选者的置信度分数,根据识别候选者的用户确认结果和识别的置信度分数设定适合用户的新阈值 并且如果所计算的置信度分数高于新阈值,则作为语音识别的结果输出相应的识别候选。 因此,减少了对用户对语音识别结果的确认的需要,并提高了语音识别成功的概率。

    Method and apparatus for speech recognition using device usage pattern of user
    27.
    发明申请
    Method and apparatus for speech recognition using device usage pattern of user 有权
    用户使用设备使用模式进行语音识别的方法和装置

    公开(公告)号:US20080167871A1

    公开(公告)日:2008-07-10

    申请号:US11878595

    申请日:2007-07-25

    IPC分类号: G10L15/00

    CPC分类号: G10L15/22 G10L2015/227

    摘要: A method and apparatus for improving the performance of voice recognition in a mobile device are provided. The method of recognizing a voice includes: monitoring the usage pattern of a user of a device for inputting a voice; selecting predetermined words from among words stored in the device based on the result of monitoring, and storing the selected words; and recognizing a voice based on an acoustic model and predetermined words. In this way, a voice can be recognized by using prediction of whom the user mainly makes a call to. Also, by automatically modeling the device usage pattern of the user and applying the pattern to vocabulary for voice recognition based on probabilities, the performance of voice recognition, as actually felt by the user, can be enhanced.

    摘要翻译: 提供了一种用于提高移动设备中语音识别性能的方法和装置。 识别语音的方法包括:监视用于输入语音的设备的用户的使用模式; 基于监视结果从存储在设备中的字中选择预定字,并存储所选择的字; 以及基于声学模型和预定词语识别语音。 以这种方式,可以通过使用用户主要呼叫的人的预测来识别语音。 另外,通过对用户的设备使用模式进行自动建模并将该模式​​应用于基于概率的语音识别的词汇表,可以增强用户实际感觉到的语音识别的性能。

    System and method for speech synthesis using a smoothing filter
    28.
    发明授权
    System and method for speech synthesis using a smoothing filter 有权
    使用平滑滤波器进行语音合成的系统和方法

    公开(公告)号:US07277856B2

    公开(公告)日:2007-10-02

    申请号:US10284189

    申请日:2002-10-31

    IPC分类号: G10L13/00

    CPC分类号: G10L13/07

    摘要: A speech synthesis system for controlling a discontinuous distortion that occurs at the transition portion between concatenated phonemes which are speech units of a synthesized speech using a smoothing technique, comprising: a discontinuous distortion processing means adapted to predict a discontinuity at the transition portion between concatenated samples of phonemes used for a speech synthesis through a predetermined learning process, and control a discontinuity at the transition portion between the concatenated phonemes of the synthesized speech in such a fashion that it is smoothed adaptively to correspond to a degree of the predicted discontinuity. The smoothing filter smoothes the synthesized speech so that the discontinuity degree of synthesized speech follows the predicted discontinuity degree according to the filter coefficient (a) changed adaptively to correspond to a ratio of the predicted discontinuity degree to the real discontinuity degree. That is, since a discontinuity at a transition portion between concatenated phonemes of the synthesized speech (IN) is adaptively smoothed to follow that which occurs in the actually spoken sound, the synthesized speech (IN) can be approximated more closely to a real human voice.

    摘要翻译: 一种用于控制在作为使用平滑技术的合成语音的语音单元的级联音素之间的过渡部分处发生的不连续失真的语音合成系统,包括:不连续失真处理装置,用于预测级联样本之间的过渡部分处的不连续性 用于通过预定学习过程进行语音合成的音素,并且以合成语音的级联音素之间的转换部分以这样的方式控制不连续性,使得它被自适应地平滑地对应于预测的不连续性的程度。 平滑滤波器对合成语音进行平滑,使得合成语音的不连续度遵循根据预测不连续度与实际不连续度的比率自适应地改变的滤波器系数(a)的预测不连续度。 也就是说,由于在合成语音(IN)的级联音素之间的过渡部分处的不连续性被自适应地平滑以跟随发生在实际语音中的音频,所以合成语音(IN)可以更接近于真正的人类语音 。

    Apparatus and method for detecting voice activity period
    29.
    发明申请
    Apparatus and method for detecting voice activity period 有权
    检测语音活动期的装置和方法

    公开(公告)号:US20070073537A1

    公开(公告)日:2007-03-29

    申请号:US11472304

    申请日:2006-06-22

    IPC分类号: G10L15/20

    CPC分类号: G10L25/78

    摘要: An apparatus and method for detecting a voice activity period. The apparatus for detecting a voice activity period includes a domain conversion module that converts an input signal into a frequency domain signal in the unit of a frame obtained by dividing the input signal at predetermined intervals, a subtracted-spectrum-generation module that generates a spectral subtraction signal which is obtained by subtracting a predetermined noise spectrum from the converted frequency domain signal, a modeling module that applies the spectral subtraction signal to a predetermined probability distribution model, and a speech-detection module that determines whether a speech signal is present in a current frame through a probability distribution calculated by the modeling module.

    摘要翻译: 一种用于检测语音活动期的装置和方法。 用于检测语音活动期间的装置包括域转换模块,该域转换模块将输入信号转换成以预定间隔划分输入信号所获得的帧为单位的频域信号;产生频谱的减法频谱生成模块 通过从转换的频域信号中减去预定的噪声频谱获得的减法信号,将频谱减法信号应用于预定概率分布模型的建模模块,以及确定语音信号是否存在于语音信号中的语音检测模块 通过由建模模块计算的概率分布的当前帧。

    Speech recognition method and apparatus using lexicon group tree
    30.
    发明申请
    Speech recognition method and apparatus using lexicon group tree 有权
    使用词汇组树的语音识别方法和装置

    公开(公告)号:US20060173673A1

    公开(公告)日:2006-08-03

    申请号:US11342701

    申请日:2006-01-31

    IPC分类号: G06F17/27

    CPC分类号: G06F17/2765 G10L15/197

    摘要: A method and an apparatus for selecting a vocabulary closest to an input speech from among lexicons stored in memory, wherein a centroid lexicon representing lexicons belonging to a predetermined lexicon group is generated. Two lexicons, having a longest distance therebetween in the lexicon group, are selected using the centroid lexicon from the lexicon group, and a node indicating the lexicon group branches based on the two selected lexicons. A node having low group similarity is selected from among current terminal nodes, including branch nodes, and the above procedure is repeatedly performed on a lexicon group indicated by the selected node.

    摘要翻译: 一种用于从存储在存储器中的词典中选择最接近输入语音的词汇的方法和装置,其中生成表示属于预定词典组的词典的质心词典。 在词典组中具有最长距离的两个词典使用来自词典组的质心词典进行选择,并且指示词典组的节点基于两个选定的词典进行分支。 从包括分支节点的当前终端节点中选择具有低组相似性的节点,并且对由所选节点指示的词典组重复执行上述过程。