Tailored speaker-independent voice recognition system
    1.
    发明申请
    Tailored speaker-independent voice recognition system 有权
    量身定制的与扬声器无关的语音识别系统

    公开(公告)号:US20060085186A1

    公开(公告)日:2006-04-20

    申请号:US10967957

    申请日:2004-10-19

    申请人: Changxue Ma Yan Cheng

    发明人: Changxue Ma Yan Cheng

    IPC分类号: G10L15/08

    CPC分类号: G10L15/063 G10L2015/0631

    摘要: A tailored speaker-independent voice recognition system has a speech recognition dictionary (360) with at least one word (371). That word (371) has at least two transcriptions (373), each transcription (373) having a probability factor (375) and an indicator (377) of whether the transcription is active. When a speech utterance is received (510), the voice recognition system determines (520, 530) the word signified by the speech utterance, evaluates (540) the speech utterance against the transcriptions of the correct word, updates (550) the probability factors for each transcription, and inactivates (570) any transcription that has an updated probability factor that is less than a threshold.

    摘要翻译: 定制的与扬声器无关的语音识别系统具有至少一个单词(371)的语音识别词典(360)。 该字(371)具有至少两个转录(373),每个转录(373)具有概率因子(375)和指示符(377)是否转录是活性的。 当接收到语音话语(510)时,语音识别系统确定(520,530)由语音发音表示的单词,根据正确单词的转录评估(540)语音发音,更新(550)概率因子 对于每个转录,并使(570)任何具有小于阈值的更新概率因子的转录失活。

    High quality speech reconstruction for a dialog method and system
    2.
    发明申请
    High quality speech reconstruction for a dialog method and system 审中-公开
    对话方法和系统的高质量语音重建

    公开(公告)号:US20070129946A1

    公开(公告)日:2007-06-07

    申请号:US11294964

    申请日:2005-12-06

    IPC分类号: G10L15/14

    摘要: An electronic device (400) for speech dialog includes functions that receive (405, 205) a speech phrase that includes an instantiated variable (315), generate pitch and voicing characteristics (330) of the instantiated variable, and performs voice recognition (410, 220) of the instantiated variable to determine a most likely set of recognition acoustic states (335). A trained map (358) is established (115) that maps recognition feature vectors derived from training speech (105) to synthesis feature vectors derived from the same training speech (110). Recognition feature vectors that represent the most likely set of recognition acoustic states for the recognized instantiated variable are converted to a most likely set of synthesis acoustic states (420) in accordance with the map. The electronic device may generate (421, 440, 445) a synthesized value of the instantiated variable using the most likely set of synthesis acoustic states and the pitch and voicing characteristics extracted from the instantiated variable.

    摘要翻译: 一种用于语音对话的电子设备(400)包括接收(405,205)包括实例变量(315)的语音短语的功能,产生所述实例化变量的音调和语音特征(330),并且执行语音识别(410, 220),以确定最可能的识别声学状态集合(335)。 建立训练图(358)(115),其将从训练语音(105)导出的识别特征向量映射到从相同训练语音(110)导出的合成特征向量。 表示识别的实例化变量的最可能的识别声学状态集合的识别特征向量根据该映射被转换成最可能的一组合成声学状态(420)。 电子设备可以使用最可能的合成声学状态集合和从实例变量提取的音高和发声特性来生成(421,440,445)所述实例化变量的合成值。

    Speech dialog method and system
    3.
    发明申请
    Speech dialog method and system 有权
    语音对话方法和系统

    公开(公告)号:US20060247921A1

    公开(公告)日:2006-11-02

    申请号:US11118670

    申请日:2005-04-29

    IPC分类号: G10L11/04

    摘要: An electronic device (300) for speech dialog includes functions that receive (305, 105) a speech phrase that comprises a request phrase that includes an instantiated variable (215), generate (335, 115) pitch and voicing characteristics (315) of the instantiated variable, and performs voice recognition (319, 125) of the instantiated variable to determine a most likely set of acoustic states (235). The electronic device may generate (335, 140) a synthesized value of the instantiated variable using the most likely set of acoustic states and the pitch and voicing characteristics of the instantiated variable. The electronic device may use a table of previously entered values of variables that have been determined to be unique, and in which the values are associated with a most likely set of acoustic states and the pitch and voicing characteristics determined at the receipt of each value to disambiguate (425, 430) a newly received instantiated variable.

    摘要翻译: 一种用于语音对话的电子设备(300)包括接收(305,105)语音短语的功能,该语音短语包括包含实例化变量(215)的请求短语,产生(335,115)音调和语音特征(315) 并且执行所述实例化变量的语音识别(319,125)以确定最可能的一组声学状态(235)。 电子设备可以使用最可能的声学状态集合和实例化变量的音调和语音特征来生成(335,140)实例化变量的合成值。 电子设备可以使用已经被确定为唯一的先前输入的变量值的表,并且其中值与最可能的一组声学状态相关联,并且在接收每个值时确定的音高和发声特性 消除歧义(425,430)一个新接收的实例变量。

    Method and apparatus for generating a voice tag
    4.
    发明申请
    Method and apparatus for generating a voice tag 审中-公开
    用于生成语音标签的方法和装置

    公开(公告)号:US20060287867A1

    公开(公告)日:2006-12-21

    申请号:US11155944

    申请日:2005-06-17

    申请人: Yan Cheng Changxue Ma

    发明人: Yan Cheng Changxue Ma

    IPC分类号: G10L21/00

    摘要: A method and apparatus for generating a voice tag (140) includes a means (110) for combining (205) a plurality of utterances (106, 107, 108) into a combined utterance (111) and a means (120) for extraction (210) of the voice tag as a sequence of phonemes having a high likelihood of representing the combined utterance, using a set of stored phonemes (115) and the combined utterance.

    摘要翻译: 一种用于生成语音标签(140)的方法和装置包括:用于将多个话语(106,107,108)组合(205)到组合话语(111)中的装置(110)和用于提取的装置(120) 210)作为具有表示组合发音的高可能性的音素序列,使用一组存储的音素(115)和组合的话语。

    Method and system for interpreting verbal inputs in multimodal dialog system
    5.
    发明申请
    Method and system for interpreting verbal inputs in multimodal dialog system 有权
    在多模态对话系统中解释口头输入的方法和系统

    公开(公告)号:US20060229862A1

    公开(公告)日:2006-10-12

    申请号:US11100185

    申请日:2005-04-06

    IPC分类号: G06F17/28

    摘要: A method, a system and a computer program product for interpreting a verbal input in a multimodal dialog system are provided. The method includes assigning (302) a confidence value to at least one word generated by a verbal recognition component. The method further includes generating (304) a semantic unit confidence score for the verbal input. The generation of a semantic unit confidence score is based on the confidence value of at least one word and at least one semantic confidence operator.

    摘要翻译: 提供了一种用于在多模式对话系统中解释口头输入的方法,系统和计算机程序产品。 该方法包括将置信度值(302)分配(302)至由语言识别组件生成的至少一个词。 该方法还包括为语言输入生成(304)语义单位置信度得分。 语义单位置信度得分的产生基于至少一个单词和至少一个语义置信度运算符的置信度值。

    Voice quality control for high quality speech reconstruction
    6.
    发明申请
    Voice quality control for high quality speech reconstruction 审中-公开
    高质量语音重建的语音质量控制

    公开(公告)号:US20070129945A1

    公开(公告)日:2007-06-07

    申请号:US11294959

    申请日:2005-12-06

    IPC分类号: G10L15/04

    CPC分类号: G10L25/69 G10L15/26

    摘要: A method and apparatus are provided for reproducing a speech sequence of a user through a communication device of the user. The method includes the steps of detecting a speech sequence from the user through the communication device, recognizing a phoneme sequence within the detected speech sequence and forming a confidence level of each phoneme within the recognized phoneme sequence. The method further includes the steps of audibly reproducing the recognized phoneme sequence for the user through the communication device and gradually highlighting or degrading a voice quality of at least some phonemes of the recognized phoneme sequence based upon the formed confidence level of the at least some phonemes.

    摘要翻译: 提供了一种用于通过用户的通信设备再现用户的语音序列的方法和装置。 该方法包括以下步骤:通过通信设备检测来自用户的语音序列,识别检测到的语音序列内的音素序列,并形成识别的音素序列内每个音素的置信度。 该方法还包括以下步骤:通过通信设备可听地再现用户的识别音素序列,并基于所形成的至少一些音素的置信水平逐渐突出或降低所识别的音素序列的至少一些音素的语音质量 。

    Modifying a user account during an authentication process
    8.
    发明授权
    Modifying a user account during an authentication process 有权
    在身份验证过程中修改用户帐户

    公开(公告)号:US08671442B2

    公开(公告)日:2014-03-11

    申请号:US13176687

    申请日:2011-07-05

    CPC分类号: G06F21/31 G06F2221/2131

    摘要: Techniques are described for repairing some types of user account problems that interfere with granting a user access to a computer system and doing so during a process to authenticate the user in a way that does not require the user to re-enter authentication information or require the user to restart a communication session with the computer system. In response to a determination that a user's account has a problem during an authentication process, techniques are provided to enable a user to execute an appropriate process or processes to fix the user account, after which the authentication process continues. In this way, the correction to the user account may appear to be seamless to the user.

    摘要翻译: 描述了修复某些类型的用户帐户问题的技术,这些问题干扰授予用户对计算机系统的访问,并且在以不需要用户重新输入认证信息或要求用户身份的方式认证用户的过程中这样做 用户重新启动与计算机系统的通信会话。 响应于在认证过程中确定用户的帐户存在问题,提供了技术以使得用户能够执行适当的过程或处理来修复用户帐户,之后认证过程继续进行。 以这种方式,用户帐户的更正可能看起来与用户无缝。

    Pyrazolyl acrylonitrile compounds and uses thereof
    9.
    发明授权
    Pyrazolyl acrylonitrile compounds and uses thereof 有权
    吡唑基丙烯腈化合物及其用途

    公开(公告)号:US08455532B2

    公开(公告)日:2013-06-04

    申请号:US13265010

    申请日:2010-04-27

    IPC分类号: A01N43/56 C07D231/12

    摘要: A kind of pyrazolyl acrylniitrile compounds represented by the structures of formula I or stereoisomers thereof are disclosed in the present invention. Where in: R1 is selected from the group of substituents consisting of H, C1-C4 alkoxy C1-C2 alkyl, C3-C5 alkenyloxy C1-C2 alkyl, C3-C5 alknyloxy C1-C2 alkyl, C1-C4 alkylthio C1-C2 alkyl, C1-C5 alkyl carbonyl, C3-C8 cycloalkyl carbonyl, C1-C5 alkoxy carbonyl or C1-C5 alkylthio carbonyl; R2 is Cl or methyl; R3 is H, methyl, CN, NO2 or halogen. Or its stereoisomers.The Formula I compounds have high insecticidal activities or acaricidal activities, so they can be used as insecticide or acaricide.

    摘要翻译: 在本发明中公开了一种由式I结构或其立体异构体表示的吡唑基丙烯腈化合物。 其中:R1选自H,C1-C4烷氧基C1-C2烷基,C3-C5烯氧基C1-C2烷基,C3-C5烷氧基C1-C2烷基,C1-C4烷硫基C1-C2烷基 ,C 1 -C 5烷基羰基,C 3 -C 8环烷基羰基,C 1 -C 5烷氧基羰基或C 1 -C 5烷硫基羰基; R2是Cl或甲基; R3是H,甲基,CN,NO2或卤素。 或其立体异构体。 式I化合物具有高杀虫活性或杀螨活性,因此可用作杀虫剂或杀螨剂。

    Device and method for determining a user-desired mode of inputting speech
    10.
    发明申请
    Device and method for determining a user-desired mode of inputting speech 审中-公开
    用于确定用户期望的语音输入模式的装置和方法

    公开(公告)号:US20070129098A1

    公开(公告)日:2007-06-07

    申请号:US11295198

    申请日:2005-12-06

    IPC分类号: H04M1/00 H04B1/38

    CPC分类号: H04M1/7258 H04M2250/74

    摘要: A device and method of detecting a mode of inputting speech to a wireless device includes a processor (220) communicatively coupled to a user input (218), an audio input (206), a timer (211), and a speech processor (230). The processor (220) monitors the user input (218) and, upon detection of a first change in state of the user input (218), opens an input channel from the audio input (206) to the speech processor (230), monitors the timer (211) for an elapsed time, and monitors the user input (218) for a second change of state. Upon detection of the second change of state after a predetermined amount of time elapses, the input channel is closed. Upon detection of the second change of state before a predetermined amount of time elapses, the user input is monitored for a third change of state and upon detecting the third change of state, the input channel is closed.

    摘要翻译: 一种检测向无线设备输入语音的模式的设备和方法包括通信地耦合到用户输入(218),音频输入(206),定时器(211)和语音处理器(230)的处理器(220) )。 处理器(220)监视用户输入(218),并且在检测到用户输入(218)的状态的第一次改变时,将从音频输入(206)输入到声音处理器(230)的输入通道打开,监视器 定时器(211),并且监视用户输入(218)以进行第二次状态改变。 在经过预定时间量之后检测到第二状态变化,输入通道关闭。 当在经过预定时间量之前检测到第二状态变化时,监视用户输入第三状态变化,并且在检测到第三状态改变后,输入通道关闭。