METHOD AND SYSTEM FOR GENERATING SEARCH NETWORK FOR VOICE RECOGNITION
    1.
    发明申请
    METHOD AND SYSTEM FOR GENERATING SEARCH NETWORK FOR VOICE RECOGNITION 审中-公开
    用于生成语音识别的搜索网络的方法和系统

    公开(公告)号:US20130138441A1

    公开(公告)日:2013-05-30

    申请号:US13585475

    申请日:2012-08-14

    IPC分类号: G10L15/04

    CPC分类号: G10L15/083 G10L15/187

    摘要: Disclosed is a method of generating a search network for voice recognition, the method including: generating a pronunciation transduction weighted finite state transducer by implementing a pronunciation transduction rule representing a phenomenon of pronunciation transduction between recognition units as a weighted finite state transducer; and composing the pronunciation transduction weighted finite state transducer and one or more weighted finite state transducers.

    摘要翻译: 本发明公开了一种生成用于语音识别的搜索网络的方法,该方法包括:通过实现表示作为加权有限状态换能器的识别单元之间的语音转换现象的语音转换规则,生成语音转导加权有限状态换能器; 并组合发音转导加权有限状态换能器和一个或多个加权有限状态换能器。

    APPARATUS AND METHOD FOR CREATING ACOUSTIC MODEL
    2.
    发明申请
    APPARATUS AND METHOD FOR CREATING ACOUSTIC MODEL 审中-公开
    用于创建声学模型的装置和方法

    公开(公告)号:US20120109650A1

    公开(公告)日:2012-05-03

    申请号:US13284095

    申请日:2011-10-28

    IPC分类号: G10L15/14

    CPC分类号: G10L15/144 G10L15/285

    摘要: Disclosed herein is an apparatus and method for creating an acoustic model. The apparatus includes a binary tree creation unit, an information creation unit, and a binary tree reduction unit. The binary tree creation unit creates a binary tree by repeatedly merging a plurality of Gaussian components for each Hidden Markov Model (HMM) state of an acoustic model based on a distance measure reflecting a variation in likelihood score. The information creation unit creates information about information about the largest size of the acoustic model in accordance with a platform including a speech recognizer. The binary tree reduction unit reduces the binary tree in accordance with the information about the largest size of the acoustic model.

    摘要翻译: 本文公开了一种用于创建声学模型的装置和方法。 该装置包括二叉树创建单元,信息创建单元和二进制树缩小单元。 二叉树创建单元通过基于反映可能性得分的变化的距离度量反复地合并声学模型的每个隐马尔可夫模型(HMM)状态的多个高斯分量来创建二叉树。 信息创建单元根据包括语音识别器的平台创建关于声学模型的最大尺寸的信息。 二叉树缩小单元根据关于声学模型的最大尺寸的信息减少二叉树。

    Automatic translation method and system based on corresponding sentence pattern
    4.
    发明授权
    Automatic translation method and system based on corresponding sentence pattern 失效
    基于相应句型的自动翻译方法和系统

    公开(公告)号:US08015016B2

    公开(公告)日:2011-09-06

    申请号:US11924102

    申请日:2007-10-25

    IPC分类号: G10L21/00 G06F17/27 G06F17/28

    摘要: Provided are an automatic speech translation system and a method for obtaining accurate translation performance with a simple structure. Because input and output sentences are written in different languages, automatic speech translation requires techniques for processing different languages. Repetition of text processing like morpheme analysis or sentence parsing in conventional automatic speech translation can complicate the overall translation process. Meanwhile, although input and output sentences are written in different languages, they have to have the same meaning and a corresponding sentence form and words. Accordingly, the corresponding words and sentence forms of the two languages can be expressed with a simple structure and utilized in the automatic speech translation process, thereby maintaining consistency during the process and avoiding unnecessary process repetition, which reduces errors and improves performance.

    摘要翻译: 提供了一种自动语音翻译系统和一种以简单的结构获得准确的翻译性能的方法。 由于输入和输出句子是用不同的语言编写的,所以自动语音翻译需要处理不同语言的技术。 在常规自动语音翻译中重复文本处理,如语素分析或句法解析可能会使整个翻译过程复杂化。 同时,虽然输入和输出句子是用不同的语言编写的,但它们必须具有相同的含义和相应的句子形式和单词。 因此,两种语言的对应单词和句子形式可以用简单的结构表示,并在自动语音翻译过程中使用,从而保持过程中的一致性,避免不必要的过程重复,从而减少错误并提高性能。

    METHOD AND APPARATUS FOR RECOGNIZING SPEECH
    5.
    发明申请
    METHOD AND APPARATUS FOR RECOGNIZING SPEECH 审中-公开
    用于识别语音的方法和装置

    公开(公告)号:US20090076817A1

    公开(公告)日:2009-03-19

    申请号:US12047634

    申请日:2008-03-13

    IPC分类号: G10L15/00

    CPC分类号: G10L15/187 G10L2015/025

    摘要: Provided are an apparatus and method for recognizing speech, in which reliability with respect to phoneme-recognized phoneme sequences is calculated and performance of speech recognition is enhanced using the calculated results. The method of recognizing speech includes the steps of: determining a boundary between phonemes included in character sequences that are phonetically input to detect each phoneme interval; calculating reliability according to a probability that a phoneme indicated by the detected phoneme interval corresponds to a phoneme included in a predefined phoneme model; calculating a phoneme alignment cost with respect to the character sequences based on the calculated reliability and a pre-trained and stored phoneme recognition probability distribution; and performing phoneme alignment based on the calculated phoneme alignment cost to perform speech recognition on the input character sequences. As a result, reliability with respect to the phoneme-recognized phoneme sequences can be calculated, and the performance of speech recognition can be enhanced using the calculated results.

    摘要翻译: 提供了一种用于识别语音的装置和方法,其中计算相对于音素识别的音素序列的可靠性,并且使用所计算的结果来增强语音识别的性能。 识别语音的方法包括以下步骤:确定语音输入的字符序列中包含的音素之间的边界,以检测每个音素间隔; 根据由检测到的音素间隔指示的音素对应于包含在预定音素模型中的音素的概率来计算可靠性; 基于计算的可靠性和预先训练和存储的音素识别概率分布来计算相对于字符序列的音素对准成本; 并且基于计算的音素对准成本执行音素对准以对输入的字符序列执行语音识别。 结果,可以计算相对于音素识别的音素序列的可靠性,并且可以使用计算结果来增强语音识别的性能。

    AUTOMATIC TRANSLATION METHOD AND SYSTEM BASED ON CORRESPONDING SENTENCE PATTERN
    6.
    发明申请
    AUTOMATIC TRANSLATION METHOD AND SYSTEM BASED ON CORRESPONDING SENTENCE PATTERN 失效
    自动翻译方法和系统基于相应的句型

    公开(公告)号:US20080109228A1

    公开(公告)日:2008-05-08

    申请号:US11924102

    申请日:2007-10-25

    IPC分类号: G10L11/00

    摘要: Provided are an automatic speech translation system and a method for obtaining accurate translation performance with a simple structure. Because input and output sentences are written in different languages, automatic speech translation requires techniques for processing different languages. Repetition of text processing like morpheme analysis or sentence parsing in conventional automatic speech translation can complicate the overall translation process. Meanwhile, although input and output sentences are written in different languages, they have to have the same meaning and a corresponding sentence form and words. Accordingly, the corresponding words and sentence forms of the two languages can be expressed with a simple structure and utilized in the automatic speech translation process, thereby maintaining consistency during the process and avoiding unnecessary process repetition, which reduces errors and improves performance.

    摘要翻译: 提供了一种自动语音翻译系统和一种以简单的结构获得准确的翻译性能的方法。 由于输入和输出句子是用不同的语言编写的,所以自动语音翻译需要处理不同语言的技术。 在常规自动语音翻译中重复文本处理,如语素分析或句法解析可能会使整个翻译过程复杂化。 同时,虽然输入和输出句子是用不同的语言编写的,但它们必须具有相同的含义和相应的句子形式和单词。 因此,两种语言的对应单词和句子形式可以用简单的结构表示,并在自动语音翻译过程中使用,从而保持过程中的一致性,避免不必要的过程重复,从而减少错误并提高性能。

    SPEECH UNDERSTANDING SYSTEM USING AN EXAMPLE-BASED SEMANTIC REPRESENTATION PATTERN
    7.
    发明申请
    SPEECH UNDERSTANDING SYSTEM USING AN EXAMPLE-BASED SEMANTIC REPRESENTATION PATTERN 有权
    使用基于示例的语义表示模式的语音理解系统

    公开(公告)号:US20110054883A1

    公开(公告)日:2011-03-03

    申请号:US12622060

    申请日:2009-11-19

    IPC分类号: G06F17/27

    摘要: A speech understanding apparatus includes: a speech recognition unit for recognizing an input speech to produce a speech recognition result; a sentence analysis unit for performing morpheme analysis on a sentence corresponding to the speech recognition result, extracting additional information, and performing syntax analysis; a hierarchy describing unit for describing hierarchy of the sentence; a class transformation unit for performing class transformation on the sentence; a semantic representation determination unit for marking optional expressions for the sentence, deleting meaningless expressions and the additional information, converting the sentence into its base form, and deleting morphemic tags or symbols to determine a semantic representation; a semantic representation retrieval unit for retrieving the determined semantic representation from an example-based semantic representation pattern database; and a retrieval result processing unit for selectively producing a retrieved semantic representation.

    摘要翻译: 语音理解装置包括:语音识别单元,用于识别输入语音以产生语音识别结果; 句子分析单元,用于对与语音识别结果相对应的句子执行语素分析,提取附加信息,并执行语法分析; 用于描述句子的层次结构的层次描述单元; 用于对句子进行类转换的类变换单元; 语义表示确定单元,用于标记句子的可选表达式,删除无意义表达式和附加信息,将该句子转换成其基本形式,以及删除语素标签或符号以确定语义表示; 语义表示检索单元,用于从基于示例的语义表示模式数据库中检索所确定的语义表示; 以及检索结果处理单元,用于选择性地产生检索的语义表示。

    Speech understanding system using an example-based semantic representation pattern
    8.
    发明授权
    Speech understanding system using an example-based semantic representation pattern 有权
    语言理解系统使用基于示例的语义表示模式

    公开(公告)号:US08370130B2

    公开(公告)日:2013-02-05

    申请号:US12622060

    申请日:2009-11-19

    IPC分类号: G06F17/27

    摘要: A speech understanding apparatus includes: a speech recognition unit for recognizing an input speech to produce a speech recognition result; a sentence analysis unit for performing morpheme analysis on a sentence corresponding to the speech recognition result, extracting additional information, and performing syntax analysis; a hierarchy describing unit for describing hierarchy of the sentence; a class transformation unit for performing class transformation on the sentence; a semantic representation determination unit for marking optional expressions for the sentence, deleting meaningless expressions and the additional information, converting the sentence into its base form, and deleting morphemic tags or symbols to determine a semantic representation; a semantic representation retrieval unit for retrieving the determined semantic representation from an example-based semantic representation pattern database; and a retrieval result processing unit for selectively producing a retrieved semantic representation.

    摘要翻译: 语音理解装置包括:语音识别单元,用于识别输入语音以产生语音识别结果; 句子分析单元,用于对与语音识别结果相对应的句子执行语素分析,提取附加信息,并执行语法分析; 用于描述句子的层次结构的层次描述单元; 用于对句子进行类转换的类变换单元; 语义表示确定单元,用于标记句子的可选表达式,删除无意义表达式和附加信息,将该句子转换成其基本形式,以及删除语素标签或符号以确定语义表示; 语义表示检索单元,用于从基于示例的语义表示模式数据库中检索所确定的语义表示; 以及检索结果处理单元,用于选择性地产生检索的语义表示。

    Method and apparatus for recognizing continuous speech using search space restriction based on phoneme recognition
    9.
    发明授权
    Method and apparatus for recognizing continuous speech using search space restriction based on phoneme recognition 有权
    基于音素识别的搜索空间限制识别连续语音的方法和装置

    公开(公告)号:US08032374B2

    公开(公告)日:2011-10-04

    申请号:US11950130

    申请日:2007-12-04

    IPC分类号: G10L15/04

    CPC分类号: G10L15/187 G10L2015/025

    摘要: Provided are an apparatus and method for recognizing continuous speech using search space restriction based on phoneme recognition. In the apparatus and method, a search space can be primarily reduced by restricting connection words to be shifted at a boundary between words based on the phoneme recognition result. In addition, the search space can be secondarily reduced by rapidly calculating a degree of similarity between the connection word to be shifted and the phoneme recognition result using a phoneme code and shifting the corresponding phonemes to only connection words having degrees of similarity equal to or higher than a predetermined reference value. Therefore, the speed and performance of the speech recognition process can be improved in various speech recognition services.

    摘要翻译: 提供了一种使用基于音素识别的搜索空间限制来识别连续语音的装置和方法。 在该装置和方法中,可以通过基于音素识别结果来限制在字之间的边界处被移位的连接字来主要减少搜索空间。 此外,通过使用音素码快速计算要移位的连接字和音素识别结果之间的相似度的程度,可以二次减小搜索空间,并将相应的音素移位到仅具有等于或更高相似度的相似度的连接词 比预定的参考值。 因此,可以在各种语音识别服务中提高语音识别处理的速度和性能。

    METHOD AND APPARATUS FOR RECOGNIZING CONTINUOUS SPEECH USING SEARCH SPACE RESTRICTION BASED ON PHONEME RECOGNITION
    10.
    发明申请
    METHOD AND APPARATUS FOR RECOGNIZING CONTINUOUS SPEECH USING SEARCH SPACE RESTRICTION BASED ON PHONEME RECOGNITION 有权
    使用基于语音识别的搜索空间限制来识别连续语音的方法和装置

    公开(公告)号:US20080133239A1

    公开(公告)日:2008-06-05

    申请号:US11950130

    申请日:2007-12-04

    IPC分类号: G10L15/04

    CPC分类号: G10L15/187 G10L2015/025

    摘要: Provided are an apparatus and method for recognizing continuous speech using search space restriction based on phoneme recognition. In the apparatus and method, a search space can be primarily reduced by restricting connection words to be shifted at a boundary between words based on the phoneme recognition result. In addition, the search space can be secondarily reduced by rapidly calculating a degree of similarity between the connection word to be shifted and the phoneme recognition result using a phoneme code and shifting the corresponding phonemes to only connection words having degrees of similarity equal to or higher than a predetermined reference value. Therefore, the speed and performance of the speech recognition process can be improved in various speech recognition services.

    摘要翻译: 提供了一种使用基于音素识别的搜索空间限制来识别连续语音的装置和方法。 在该装置和方法中,可以通过基于音素识别结果来限制在字之间的边界处被移位的连接字来主要减少搜索空间。 此外,通过使用音素码快速计算要移位的连接字和音素识别结果之间的相似度的程度,可以二次减小搜索空间,并将相应的音素移位到仅具有等于或更高相似度的相似度的连接词 比预定的参考值。 因此,可以在各种语音识别服务中提高语音识别处理的速度和性能。