Spoken language system
    1.
    发明申请
    Spoken language system 审中-公开
    口语系统

    公开(公告)号:US20050027523A1

    公开(公告)日:2005-02-03

    申请号:US10631256

    申请日:2003-07-31

    IPC分类号: G10L15/22 G10L19/14

    CPC分类号: G10L15/22

    摘要: A spoken language system (100) includes a recognition component (120) that generates (220) a recognized sequence of words from a sequence of received spoken words, and assigns (225) a confidence score to each word in the recognized sequence of words. A presentation component (140) of the spoken language system adjusts (240) nominal acoustical properties of words in a presentation (142) of the recognized sequence of words, the adjustment performed according to the confidence score of each word. The adjustments include adjustments to acoustical features and acoustical contexts of words and groups of words in the presented sequence of words. The presentation component presents (245) the adjusted sequence of words.

    摘要翻译: 口语系统(100)包括识别组件(120),其从接收到的语音单词的序列中生成(220)识别的单词序列,并且将所识别的单词序列中的每个单词赋予(225)置信度分数(225)。 口语系统的呈现组件(140)根据所识别的单词序列的呈现(142)来调整(240)单词的标称声学特性,根据每个单词的置信度进行调整。 调整包括对所提出的单词序列中的单词和单词组的声学特征和声学上下文的调整。 演示组件呈现(245)调整的单词序列。

    Content item retrieval based on a free text entry
    2.
    发明授权
    Content item retrieval based on a free text entry 失效
    基于自由文本输入的内容项检索

    公开(公告)号:US08041700B2

    公开(公告)日:2011-10-18

    申请号:US12419341

    申请日:2009-04-07

    申请人: Changxue Ma

    发明人: Changxue Ma

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30675

    摘要: A method and apparatus for textual searching of a database is provided herein. During operation a user will input a letter into a search engine. The search engine will score words based on the letter and display results of the highest-scored words. Another letter will again be received and the process repeated. In situations where titles are returned to the user, additional steps of associating the words with a title and scoring the title take place. The highest-scored titles are provided to the user as the displayed results.

    摘要翻译: 本文提供了一种用于文本搜索数据库的方法和装置。 在操作期间,用户将输入一个字母到搜索引擎。 搜索引擎将根据最高得分字的字母和显示结果对单词进行分数。 再次收到另一封信,重复过程。 在将标题返回给用户的情况下,会发生将单词与标题相关联并对标题进行评分的其他步骤。 作为显示结果,向用户提供最高分的标题。

    METHOD AND APPARATUS FOR ORDERING RESULTS OF A QUERY
    3.
    发明申请
    METHOD AND APPARATUS FOR ORDERING RESULTS OF A QUERY 审中-公开
    用于订购查询结果的方法和装置

    公开(公告)号:US20110071826A1

    公开(公告)日:2011-03-24

    申请号:US12564968

    申请日:2009-09-23

    IPC分类号: G10L15/26 G06F17/30

    CPC分类号: G10L15/083 G06F16/3343

    摘要: A method and apparatus for ordering results from a query is provided herein. During operation, a spoken query is received and converted to a textual representation, such as a word lattice. Search strings are then created from the word lattice. For example a set search strings may be created from the N-grams, such as unigrams and bigrams, of the word lattice. The search strings may be ordered and truncated based on confidence values assigned to the n-grams by the speech recognition system. The set of search strings are sent to at least one search engine, and search results are obtained. The search results are then re-arranged or reordered based on a semantic similarity between the search results and the word lattice.

    摘要翻译: 本文提供了一种用于排序查询结果的方法和装置。 在操作期间,接收到口语查询并将其转换为文本表示,例如单词格。 搜索字符串然后从单词格中创建。 例如,可以从单词格的N克(例如单字母和双字母)创建集合搜索字符串。 搜索字符串可以基于由语音识别系统分配给n-gram的置信度来排序和截断。 搜索字符串集合被发送到至少一个搜索引擎,并且获得搜索结果。 然后基于搜索结果和单词格之间的语义相似度重新排列或重新排序搜索结果。

    Method and Apparatus for Voice Searching for Stored Content Using Uniterm Discovery
    4.
    发明申请
    Method and Apparatus for Voice Searching for Stored Content Using Uniterm Discovery 有权
    使用Uniterm发现的语音搜索存储内容的方法和装置

    公开(公告)号:US20090210226A1

    公开(公告)日:2009-08-20

    申请号:US12032258

    申请日:2008-02-15

    申请人: Changxue Ma

    发明人: Changxue Ma

    IPC分类号: G10L15/08

    摘要: A method, system and communication device for enabling voice-to-voice searching and ordered content retrieval via audio tags assigned to individual content, which tags generate uniterms that are matched against components of a voice query. The method includes storing content and tagging at least one of the content with an audio tag. The method further includes receiving a voice query to retrieve content stored on the device. When the voice query is received, the method completes a voice-to-voice search utilizing uniterms of the audio tag, scored against the phoneme latent lattice model generated by the voice query to identify matching terms within the audio tags and corresponding stored content. The retrieved content(s) associated with the identified audio tags having uniterms that score within the phoneme lattice model are outputted in an order corresponding to an order in which the uniterms are structured within the voice query.

    摘要翻译: 一种用于通过分配给各个内容的音频标签启用语音到语音搜索和排序内容检索的方法,系统和通信设备,该标签生成与语音查询的组件匹配的单位。 该方法包括存储内容并且将具有音频标签的内容中的至少一个标记。 该方法还包括接收语音查询以检索存储在设备上的内容。 当接收到语音查询时,该方法使用音频标签的单位完成语音到语音搜索,对由语音查询产生的音素潜在网格模型进行评分,以识别音频标签内的匹配项和对应的存储内容。 与所识别的音频标签相关联的检索到的内容,其具有在音素格子模型内得分的单位格式,其顺序与语音查询内的单元格结构的顺序相对应地输出。

    System and Method for Enabling a Mobile Device as a Portable Character Input Peripheral Device
    6.
    发明申请
    System and Method for Enabling a Mobile Device as a Portable Character Input Peripheral Device 有权
    将移动设备用作便携式字符输入外围设备的系统和方法

    公开(公告)号:US20090066541A1

    公开(公告)日:2009-03-12

    申请号:US11853912

    申请日:2007-09-12

    IPC分类号: G06F3/023

    CPC分类号: G06F3/0231 G06F3/0237

    摘要: A portable electronic communication device, designed for voice and data communication is utilized as a peripheral input device for transmitting/providing character inputs, entered in the first device's touch input mechanism, to a second electronic device. The first device has a mode switching utility that switches the first device between a first standard communication mode and a second peripheral input device mode. When the first device is in the second peripheral input device mode, the first device operates as a peripheral input device for the second device. A character input recognition utility executes on the first device to provide the functions of: detecting an input on the touch screen input mechanism; generating an electronic representation of the input; establishing a communication link between the second communication transmitter and an identified second device; and forwarding the electronic representation of the character input to the communication transmitter for transmission to the identified second device.

    摘要翻译: 用于语音和数据通信的便携式电子通信设备被用作用于将输入到第一设备的触摸输入机构中的字符输入发送/提供给第二电子设备的外围输入设备。 第一设备具有模式切换实用程序,用于在第一标准通信模式和第二外围输入设备模式之间切换第一设备。 当第一设备处于第二外设输入设备模式时,第一设备作为第二设备的外围输入设备工作。 字符输入识别实用程序在第一设备上执行以提供以下功能:检测触摸屏输入机构上的输入; 生成输入的电子表示; 建立第二通信发射机和识别的第二设备之间的通信链路; 以及将所述字符输入的电子表示转发到所述通信发射机以便发送到所识别的第二设备。

    METHOD AND SYSTEM FOR MANAGING PRONUNCIATION DICTIONARIES IN A SPEECH APPLICATION
    7.
    发明申请
    METHOD AND SYSTEM FOR MANAGING PRONUNCIATION DICTIONARIES IN A SPEECH APPLICATION 审中-公开
    用于管理语音应用中的发音词典的方法和系统

    公开(公告)号:US20070239455A1

    公开(公告)日:2007-10-11

    申请号:US11278983

    申请日:2006-04-07

    IPC分类号: G10L13/08

    CPC分类号: G10L15/187 G10L13/08

    摘要: A voice toolkit (100) and a method (700) for managing pronunciation dictionaries are provided. The visual toolkit can include a user-interface (110) for entering in a text and a corresponding spoken utterance, a text-to-speech system (120) for synthesizing a pronunciation from the text, a talking speech recognizer (132) for generating pronunciations of the spoken utterance, and a voice processor (130) for validating at least one pronunciation. A developer can type a text of a word into the toolkit and listen to the pronunciation to determine whether the pronunciation is acceptable. If the pronunciation is incorrect the developer can speak the word for providing a spoken utterance having a correct pronunciation.

    摘要翻译: 提供了一种用于管理发音词典的语音工具包(100)和方法(700)。 视觉工具包可以包括用于输入文本的用户界面(110)和对应的说话话语,用于从文本合成发音的文本到语音系统(120),用于生成语音识别器(132) 讲话语音的发音,以及用于验证至少一个发音的语音处理器(130)。 开发人员可以在工具包中输入单词的文字,并听发音来确定发音是否可以接受。 如果发音不正确,开发人员可以说出提供具有正确发音的口语发音。

    Noise reduction on wireless headset input via dual channel calibration within mobile phone
    8.
    发明授权
    Noise reduction on wireless headset input via dual channel calibration within mobile phone 有权
    无线耳机输入通过手机双通道校准降噪

    公开(公告)号:US07983428B2

    公开(公告)日:2011-07-19

    申请号:US11746455

    申请日:2007-05-09

    申请人: Changxue Ma Chen Liu

    发明人: Changxue Ma Chen Liu

    IPC分类号: H04B15/00

    CPC分类号: H04M9/082 H04M1/6066

    摘要: A communication device includes: (1) a wireless adapter at which a wireless headset is communicatively connected to the communication device and at which is received a first acoustic input that includes a speech input and a first ambient noise input; (2) a microphone that receives a second acoustic input, which includes a second ambient noise input; and (3) a dual-channel adaptive noise canceller that utilizes the second ambient noise input to filter the first ambient noise input out of the first acoustic input to generate an acoustic output that primarily comprises the speech input.

    摘要翻译: 通信设备包括:(1)无线适配器,其中无线耳机通信地连接到通信设备,并且其中接收到包括语音输入和第一环境噪声输入的第一声输入; (2)麦克风,其接收包括第二环境噪声输入的第二声输入; 以及(3)双通道自适应噪声消除器,其利用所述第二环境噪声输入来对从所述第一声输入中输出的所述第一环境噪声进行滤波,以产生主要包括所述语音输入的声输出。

    SEARCH-BASED DYNAMIC VOICE ACTIVATION
    9.
    发明申请
    SEARCH-BASED DYNAMIC VOICE ACTIVATION 审中-公开
    基于搜索的动态语音激活

    公开(公告)号:US20090172546A1

    公开(公告)日:2009-07-02

    申请号:US12126077

    申请日:2008-05-23

    IPC分类号: G06F3/16

    摘要: A method, apparatus, and electronic device for voice navigation are disclosed. A voice input mechanism 310 may receive a verbal input from a user to a voice user interface program invisible to the user. A processor 104 may identify in a graphical user interface (GUI) a set of GUI items. The processor 104 may convert the set of GUI items to a set of voice searchable indices 400. The processor 104 may correlate a matching GUI item of the set of GUI items to a phonemic representation of the verbal input.

    摘要翻译: 公开了用于语音导航的方法,装置和电子装置。 语音输入机构310可以接收来自用户的语音输入到用户不可见的语音用户界面程序。 处理器104可以在图形用户界面(GUI)中识别一组GUI项目。 处理器104可以将该GUI项目集合转换成一组语音可搜索索引400.处理器104可以将该GUI项目集合中的匹配GUI项目与口头输入的音位表示相关联。

    METHOD AND APPARATUS FOR DETECTING AFFECTS IN SPEECH
    10.
    发明申请
    METHOD AND APPARATUS FOR DETECTING AFFECTS IN SPEECH 审中-公开
    用于检测语音影响的方法和装置

    公开(公告)号:US20070192097A1

    公开(公告)日:2007-08-16

    申请号:US11275350

    申请日:2006-02-14

    IPC分类号: G10L15/00

    CPC分类号: G10L25/48

    摘要: A method and apparatus for speaker independent real-time affect detection includes generating (205) a sequence of audio frames from a segment of speech, generating (210) a sequence of feature sets by generating a feature set for each frame, and applying (215) the sequence of feature sets to a sequential classifier to determine a most likely affect expressed in the segment of speech.

    摘要翻译: 一种用于独立于扬声器的实时影响检测的方法和装置,包括从语音段产生(205)音频帧序列(205),生成(210)特征集序列,生成每个帧的特征集,并应用 )特征集的序列到顺序分类器以确定在语音段中表达的最可能的影响。