Speech coding via speech recognition and synthesis based on pre-enrolled
phonetic tokens
    1.
    发明授权
    Speech coding via speech recognition and synthesis based on pre-enrolled phonetic tokens 失效
    基于预先录入的语音标记的语音识别和综合语音编码

    公开(公告)号:US6119086A

    公开(公告)日:2000-09-12

    申请号:US67863

    申请日:1998-04-28

    CPC分类号: G10L19/0018

    摘要: A speech coding system, responsive to an input speech signal provided by a system user, comprises: a speech coding portion including a speech recognition system responsive to the input speech signal and having a word vocabulary associated therewith, the speech recognition system recognizing the input speech signal in accordance with the vocabulary and generating phonetic tokens, such as at least one sequence of lefemes, representative of the input speech signal; a channel, responsive to the at least one sequence of lefemes, for transmitting and/or storing the at least one sequence of lefemes; and a speech synthesizing portion, responsive to the transmitted/stored sequence of lefemes, for generating a synthesized speech signal which is representative of the input speech signal provided by the system user using the at least one sequence of lefemes. The speech recognition system preferably generates acoustic parameters from the input speech signal which include voice characteristics of the system user. The speech coding system also preferably comprises a labeler which processes the input speech signal including words uttered by the system user which are not in the word vocabulary associated with the speech recognition system, the labeler generating phonetic tokens, such as at least one sequence of lefemes, optimally representative of the input speech signal. The sequence of lefemes from the labeler and the speech recognition portion are compared, for each speech segment, and the sequence most similar to the input speech is selected for transmission/storage. The speech synthesizing portion of the system preferably performs speech synthesis using pre-enrolled phonetic sub-units or tokens.

    摘要翻译: 响应于由系统用户提供的输入语音信号的语音编码系统包括:语音编码部分,包括响应于输入语音信号并具有与其相关联的词汇词汇的语音识别系统,语音识别系统识别输入语音 信号,并产生语音令牌,例如表示输入语音信号的至少一个左派序列; 响应于所述至少一个左列的序列的信道,用于发送和/或存储所述至少一个左派序列; 以及语音合成部分,响应于所发送/存储的莱佛斯序列,用于产生代表由系统用户使用至少一个左派序列提供的输入语音信号的合成语音信号。 语音识别系统优选地从包括系统用户的语音特征的输入语音信号生成声学参数。 语音编码系统还优选地包括标签器,其处理包括不在与语音识别系统相关联的词汇词汇中的由系统用户发出的单词的输入语音信号,产生语音令牌的标签器,例如至少一个lefemes序列 ,最佳地代表输入语音信号。 对于每个语音段,比较来自标签机和语音识别部分的左派序列,并且选择与输入语音最相似的序列用于传输/存储。 系统的语音合成部分优选地使用预先注册的语音子单元或令牌来执行语音合成。

    Apparatus and methods for rejecting confusible words during training associated with a speech recognition system
    2.
    发明授权
    Apparatus and methods for rejecting confusible words during training associated with a speech recognition system 有权
    用于在与语音识别系统相关的训练期间拒绝混淆词的装置和方法

    公开(公告)号:US06192337B1

    公开(公告)日:2001-02-20

    申请号:US09134259

    申请日:1998-08-14

    IPC分类号: G10L1506

    CPC分类号: G10L15/063

    摘要: A method of training at least one new word for addition to a vocabulary of a speech recognition engine containing existing words comprises the steps of: a user uttering the at least one new word; computing respective measures between the at least one newly uttered word and at least a portion of the existing vocabulary words, the respective measures indicative of acoustic similarity between the at least one word and the at least a portion of existing words; if no measure is within the threshold range, automatically adding the at least one newly uttered word to the vocabulary; and if at least one measure is within a threshold range, refraining from automatically adding the at least one newly uttered word to the vocabulary.

    摘要翻译: 训练至少一个新词以补充含有现有单词的语音识别引擎的词汇表的方法包括以下步骤:用户发出所述至少一个新单词; 计算所述至少一个新发出的字与所述现有词汇单词的至少一部分之间的相应度量,所述各个度量指示所述至少一个单词与所述至少一部分现有单词之间的声学​​相似性; 如果没有措施在阈值范围内,则自动将至少一个新发出的词添加到词汇; 并且如果至少一个测量值在阈值范围内,则避免将所述至少一个新发出的词自动添加到词汇表中。

    Apparatus and methods for identifying potential acoustic confusibility among words in a speech recognition system
    3.
    发明授权
    Apparatus and methods for identifying potential acoustic confusibility among words in a speech recognition system 有权
    用于识别语音识别系统中的单词之间潜在的声学混淆性的装置和方法

    公开(公告)号:US06185530B2

    公开(公告)日:2001-02-06

    申请号:US09134582

    申请日:1998-08-14

    IPC分类号: G10L506

    摘要: A method of determining potential acoustic confusion between at least one new word and at least a portion of existing words of a vocabulary of a speech recognition engine comprises the steps of: a user inputting the at least one new word; computing respective measures between the at least one new word and the at least a portion of existing vocabulary words, the respective measures indicative of acoustic similarity between the at least one word and the at least a portion of existing words; if at least one measure is within a threshold range, indicating results associated with the at least one measure and prompting the user to input an alternative word or additional information pertaining to the at least one new word; and if no measure is within the threshold range, adding the at least one new word to the vocabulary.

    摘要翻译: 确定至少一个新单词与语音识别引擎的词汇表的现有单词的至少一部分之间的潜在声音混淆的方法包括以下步骤:用户输入所述至少一个新单词; 计算所述至少一个新单词与所述现有词汇单词的所述至少一部分之间的相应度量,所述各个度量指示所述至少一个单词与所述至少一部分现有单词之间的声学​​相似度; 如果至少一个测量值在阈值范围内,指示与所述至少一个测量相关联的结果并且提示用户输入替代单词或与所述至少一个新单词有关的附加信息; 并且如果没有措施在阈值范围内,则将至少一个新词添加到词汇表。

    Apparatus and methods for identifying homophones among words in a speech recognition system
    5.
    发明授权
    Apparatus and methods for identifying homophones among words in a speech recognition system 有权
    用于在语音识别系统中识别单词之间的同音词的装置和方法

    公开(公告)号:US06269335B1

    公开(公告)日:2001-07-31

    申请号:US09134261

    申请日:1998-08-14

    IPC分类号: G10L2100

    CPC分类号: G10L15/22

    摘要: A method of identifying homophones of a word uttered by a user from at least a portion of existing words of a vocabulary of a speech recognition engine comprises the steps of: a user uttering the word; decoding the uttered word; computing respective measures between the decoded word and at least a portion of the other existing vocabulary words, the respective measures indicative of acoustic similarity between the word and the at least a portion of other existing words; if at least one measure is within a threshold range, indicating, to the user, results associated with the at least one measure, the results preferably including the decoded word and the other existing vocabulary word associated with the at least one measure; and the user preferably making a selection depending on the word the user intended to utter.

    摘要翻译: 从语音识别引擎的词汇表的现有单词的至少一部分中识别用户发出的单词的同音词的方法包括以下步骤:用户说出该单词; 解码发音字; 计算解码字与至少一部分其他现有词汇词之间的相应度量,所述各个度量指示词与其他现有词的至少一部分之间的声学​​相似性; 如果至少一个度量在阈值范围内,则向用户指示与至少一个度量相关联的结果,结果优选地包括与所述至少一个度量相关联的解码词和其他现有词汇单; 并且用户优选地根据用户想要发出的词进行选择。

    System and method for analysis and filtering of signals in a telecommunications network
    7.
    发明申请
    System and method for analysis and filtering of signals in a telecommunications network 失效
    用于电信网络信号分析和滤波的系统和方法

    公开(公告)号:US20050039070A1

    公开(公告)日:2005-02-17

    申请号:US10639883

    申请日:2003-08-13

    IPC分类号: H02H3/05 H04B3/20

    CPC分类号: H04B3/20

    摘要: A system and method for signal analysis in a network. The method includes attempting, by a first processor, to compute optimal coefficients for filtering a signal, determining that computing the optimal coefficients exceeds the computational capabilities of the first processor, notifying a second processor that computing the optimal coefficients exceeds the computational capabilities of the first processor, and computing, by the second processor, the optimal coefficients. The system and method account for limited computational resources allocated to certain processors in a telecommunications system.

    摘要翻译: 一种用于网络中信号分析的系统和方法。 该方法包括由第一处理器尝试计算用于对信号进行滤波的最佳系数,确定计算最佳系数超过第一处理器的计算能力,通知第二处理器计算最佳系数超过第一处理器的计算能力 处理器和由第二处理器计算最佳系数。 该系统和方法考虑到分配给电信系统中某些处理器的有限的计算资源。

    Systems and methods for processing audio using multiple speech technologies
    10.
    发明申请
    Systems and methods for processing audio using multiple speech technologies 有权
    使用多种语音技术处理音频的系统和方法

    公开(公告)号:US20070124360A1

    公开(公告)日:2007-05-31

    申请号:US11497995

    申请日:2006-08-02

    IPC分类号: G06F15/16

    摘要: An audio splitting system for sharing speech data associated with the same utterance between multiple speech technologies (consumers). In one aspect, the system comprises one or more queues for storing data, a plurality of consumers each sharing the data stored in the one or more queues and a scheduler for managing the storage of the data in the one or more queues and the consumption of the data in the one or more queues by each of the plurality of consumers. The consumers will register their data requirements and priority requests with the scheduler. The scheduler assigns each of the plurality of consumers to one or more of the queues based on the registered data requirements.

    摘要翻译: 一种音频分割系统,用于共享与多个语音技术(消费者)之间的相同话语相关联的语音数据。 在一个方面,该系统包括用于存储数据的一个或多个队列,每个共享存储在一个或多个队列中的数据的多个消费者和用于管理一个或多个队列中的数据的存储的调度器以及消费 多个消费者中的每一个的一个或多个队列中的数据。 消费者将通过调度器注册其数据要求和优先级请求。 调度器基于注册的数据要求将多个消费者中的每一个分配给一个或多个队列。