SEARCH-BASED DYNAMIC VOICE ACTIVATION
    1.
    发明申请
    SEARCH-BASED DYNAMIC VOICE ACTIVATION 审中-公开
    基于搜索的动态语音激活

    公开(公告)号:US20090172546A1

    公开(公告)日:2009-07-02

    申请号:US12126077

    申请日:2008-05-23

    IPC分类号: G06F3/16

    摘要: A method, apparatus, and electronic device for voice navigation are disclosed. A voice input mechanism 310 may receive a verbal input from a user to a voice user interface program invisible to the user. A processor 104 may identify in a graphical user interface (GUI) a set of GUI items. The processor 104 may convert the set of GUI items to a set of voice searchable indices 400. The processor 104 may correlate a matching GUI item of the set of GUI items to a phonemic representation of the verbal input.

    摘要翻译: 公开了用于语音导航的方法,装置和电子装置。 语音输入机构310可以接收来自用户的语音输入到用户不可见的语音用户界面程序。 处理器104可以在图形用户界面(GUI)中识别一组GUI项目。 处理器104可以将该GUI项目集合转换成一组语音可搜索索引400.处理器104可以将该GUI项目集合中的匹配GUI项目与口头输入的音位表示相关联。

    Speech dialog method and system
    2.
    发明申请
    Speech dialog method and system 有权
    语音对话方法和系统

    公开(公告)号:US20060247921A1

    公开(公告)日:2006-11-02

    申请号:US11118670

    申请日:2005-04-29

    IPC分类号: G10L11/04

    摘要: An electronic device (300) for speech dialog includes functions that receive (305, 105) a speech phrase that comprises a request phrase that includes an instantiated variable (215), generate (335, 115) pitch and voicing characteristics (315) of the instantiated variable, and performs voice recognition (319, 125) of the instantiated variable to determine a most likely set of acoustic states (235). The electronic device may generate (335, 140) a synthesized value of the instantiated variable using the most likely set of acoustic states and the pitch and voicing characteristics of the instantiated variable. The electronic device may use a table of previously entered values of variables that have been determined to be unique, and in which the values are associated with a most likely set of acoustic states and the pitch and voicing characteristics determined at the receipt of each value to disambiguate (425, 430) a newly received instantiated variable.

    摘要翻译: 一种用于语音对话的电子设备(300)包括接收(305,105)语音短语的功能,该语音短语包括包含实例化变量(215)的请求短语,产生(335,115)音调和语音特征(315) 并且执行所述实例化变量的语音识别(319,125)以确定最可能的一组声学状态(235)。 电子设备可以使用最可能的声学状态集合和实例化变量的音调和语音特征来生成(335,140)实例化变量的合成值。 电子设备可以使用已经被确定为唯一的先前输入的变量值的表,并且其中值与最可能的一组声学状态相关联,并且在接收每个值时确定的音高和发声特性 消除歧义(425,430)一个新接收的实例变量。

    Personal synergic filtering of multimodal inputs
    3.
    发明申请
    Personal synergic filtering of multimodal inputs 审中-公开
    个人协同过滤多模态输入

    公开(公告)号:US20070106506A1

    公开(公告)日:2007-05-10

    申请号:US11268113

    申请日:2005-11-07

    IPC分类号: G10L15/00

    摘要: A method and apparatus is provided for identifying an input sequence entered by a user of a communication unit. The method includes the steps of providing a database containing a plurality of partial sequences from the user of the communication unit, recognizing an identity of at least some information items of the input sequence entered by the user, comparing the recognized sequence of information items with the plurality of partial sequences within the database and selecting a partial sequence of the plurality of sequences within the database with a closest relative match to the recognized sequence as the input sequence intended by the user.

    摘要翻译: 提供了一种用于识别由通信单元的用户输入的输入序列的方法和装置。 该方法包括以下步骤:从通信单元的用户提供包含多个部分序列的数据库,识别用户输入的输入序列的至少一些信息项的标识,将识别的信息项序列与 数据库内的多个部分序列,并且以数据库中的多个序列的部分序列与所识别的序列的最接近的相对匹配来选择用户所期望的输入序列。

    Content item retrieval based on a free text entry
    4.
    发明授权
    Content item retrieval based on a free text entry 失效
    基于自由文本输入的内容项检索

    公开(公告)号:US08041700B2

    公开(公告)日:2011-10-18

    申请号:US12419341

    申请日:2009-04-07

    申请人: Changxue Ma

    发明人: Changxue Ma

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30675

    摘要: A method and apparatus for textual searching of a database is provided herein. During operation a user will input a letter into a search engine. The search engine will score words based on the letter and display results of the highest-scored words. Another letter will again be received and the process repeated. In situations where titles are returned to the user, additional steps of associating the words with a title and scoring the title take place. The highest-scored titles are provided to the user as the displayed results.

    摘要翻译: 本文提供了一种用于文本搜索数据库的方法和装置。 在操作期间,用户将输入一个字母到搜索引擎。 搜索引擎将根据最高得分字的字母和显示结果对单词进行分数。 再次收到另一封信,重复过程。 在将标题返回给用户的情况下,会发生将单词与标题相关联并对标题进行评分的其他步骤。 作为显示结果,向用户提供最高分的标题。

    METHOD AND APPARATUS FOR ORDERING RESULTS OF A QUERY
    5.
    发明申请
    METHOD AND APPARATUS FOR ORDERING RESULTS OF A QUERY 审中-公开
    用于订购查询结果的方法和装置

    公开(公告)号:US20110071826A1

    公开(公告)日:2011-03-24

    申请号:US12564968

    申请日:2009-09-23

    IPC分类号: G10L15/26 G06F17/30

    CPC分类号: G10L15/083 G06F16/3343

    摘要: A method and apparatus for ordering results from a query is provided herein. During operation, a spoken query is received and converted to a textual representation, such as a word lattice. Search strings are then created from the word lattice. For example a set search strings may be created from the N-grams, such as unigrams and bigrams, of the word lattice. The search strings may be ordered and truncated based on confidence values assigned to the n-grams by the speech recognition system. The set of search strings are sent to at least one search engine, and search results are obtained. The search results are then re-arranged or reordered based on a semantic similarity between the search results and the word lattice.

    摘要翻译: 本文提供了一种用于排序查询结果的方法和装置。 在操作期间,接收到口语查询并将其转换为文本表示,例如单词格。 搜索字符串然后从单词格中创建。 例如,可以从单词格的N克(例如单字母和双字母)创建集合搜索字符串。 搜索字符串可以基于由语音识别系统分配给n-gram的置信度来排序和截断。 搜索字符串集合被发送到至少一个搜索引擎,并且获得搜索结果。 然后基于搜索结果和单词格之间的语义相似度重新排列或重新排序搜索结果。

    Method and Apparatus for Voice Searching for Stored Content Using Uniterm Discovery
    6.
    发明申请
    Method and Apparatus for Voice Searching for Stored Content Using Uniterm Discovery 有权
    使用Uniterm发现的语音搜索存储内容的方法和装置

    公开(公告)号:US20090210226A1

    公开(公告)日:2009-08-20

    申请号:US12032258

    申请日:2008-02-15

    申请人: Changxue Ma

    发明人: Changxue Ma

    IPC分类号: G10L15/08

    摘要: A method, system and communication device for enabling voice-to-voice searching and ordered content retrieval via audio tags assigned to individual content, which tags generate uniterms that are matched against components of a voice query. The method includes storing content and tagging at least one of the content with an audio tag. The method further includes receiving a voice query to retrieve content stored on the device. When the voice query is received, the method completes a voice-to-voice search utilizing uniterms of the audio tag, scored against the phoneme latent lattice model generated by the voice query to identify matching terms within the audio tags and corresponding stored content. The retrieved content(s) associated with the identified audio tags having uniterms that score within the phoneme lattice model are outputted in an order corresponding to an order in which the uniterms are structured within the voice query.

    摘要翻译: 一种用于通过分配给各个内容的音频标签启用语音到语音搜索和排序内容检索的方法,系统和通信设备,该标签生成与语音查询的组件匹配的单位。 该方法包括存储内容并且将具有音频标签的内容中的至少一个标记。 该方法还包括接收语音查询以检索存储在设备上的内容。 当接收到语音查询时,该方法使用音频标签的单位完成语音到语音搜索,对由语音查询产生的音素潜在网格模型进行评分,以识别音频标签内的匹配项和对应的存储内容。 与所识别的音频标签相关联的检索到的内容,其具有在音素格子模型内得分的单位格式,其顺序与语音查询内的单元格结构的顺序相对应地输出。

    System and Method for Enabling a Mobile Device as a Portable Character Input Peripheral Device
    8.
    发明申请
    System and Method for Enabling a Mobile Device as a Portable Character Input Peripheral Device 有权
    将移动设备用作便携式字符输入外围设备的系统和方法

    公开(公告)号:US20090066541A1

    公开(公告)日:2009-03-12

    申请号:US11853912

    申请日:2007-09-12

    IPC分类号: G06F3/023

    CPC分类号: G06F3/0231 G06F3/0237

    摘要: A portable electronic communication device, designed for voice and data communication is utilized as a peripheral input device for transmitting/providing character inputs, entered in the first device's touch input mechanism, to a second electronic device. The first device has a mode switching utility that switches the first device between a first standard communication mode and a second peripheral input device mode. When the first device is in the second peripheral input device mode, the first device operates as a peripheral input device for the second device. A character input recognition utility executes on the first device to provide the functions of: detecting an input on the touch screen input mechanism; generating an electronic representation of the input; establishing a communication link between the second communication transmitter and an identified second device; and forwarding the electronic representation of the character input to the communication transmitter for transmission to the identified second device.

    摘要翻译: 用于语音和数据通信的便携式电子通信设备被用作用于将输入到第一设备的触摸输入机构中的字符输入发送/提供给第二电子设备的外围输入设备。 第一设备具有模式切换实用程序,用于在第一标准通信模式和第二外围输入设备模式之间切换第一设备。 当第一设备处于第二外设输入设备模式时,第一设备作为第二设备的外围输入设备工作。 字符输入识别实用程序在第一设备上执行以提供以下功能:检测触摸屏输入机构上的输入; 生成输入的电子表示; 建立第二通信发射机和识别的第二设备之间的通信链路; 以及将所述字符输入的电子表示转发到所述通信发射机以便发送到所识别的第二设备。

    METHOD AND SYSTEM FOR MANAGING PRONUNCIATION DICTIONARIES IN A SPEECH APPLICATION
    9.
    发明申请
    METHOD AND SYSTEM FOR MANAGING PRONUNCIATION DICTIONARIES IN A SPEECH APPLICATION 审中-公开
    用于管理语音应用中的发音词典的方法和系统

    公开(公告)号:US20070239455A1

    公开(公告)日:2007-10-11

    申请号:US11278983

    申请日:2006-04-07

    IPC分类号: G10L13/08

    CPC分类号: G10L15/187 G10L13/08

    摘要: A voice toolkit (100) and a method (700) for managing pronunciation dictionaries are provided. The visual toolkit can include a user-interface (110) for entering in a text and a corresponding spoken utterance, a text-to-speech system (120) for synthesizing a pronunciation from the text, a talking speech recognizer (132) for generating pronunciations of the spoken utterance, and a voice processor (130) for validating at least one pronunciation. A developer can type a text of a word into the toolkit and listen to the pronunciation to determine whether the pronunciation is acceptable. If the pronunciation is incorrect the developer can speak the word for providing a spoken utterance having a correct pronunciation.

    摘要翻译: 提供了一种用于管理发音词典的语音工具包(100)和方法(700)。 视觉工具包可以包括用于输入文本的用户界面(110)和对应的说话话语,用于从文本合成发音的文本到语音系统(120),用于生成语音识别器(132) 讲话语音的发音,以及用于验证至少一个发音的语音处理器(130)。 开发人员可以在工具包中输入单词的文字,并听发音来确定发音是否可以接受。 如果发音不正确,开发人员可以说出提供具有正确发音的口语发音。

    Noise reduction on wireless headset input via dual channel calibration within mobile phone
    10.
    发明授权
    Noise reduction on wireless headset input via dual channel calibration within mobile phone 有权
    无线耳机输入通过手机双通道校准降噪

    公开(公告)号:US07983428B2

    公开(公告)日:2011-07-19

    申请号:US11746455

    申请日:2007-05-09

    申请人: Changxue Ma Chen Liu

    发明人: Changxue Ma Chen Liu

    IPC分类号: H04B15/00

    CPC分类号: H04M9/082 H04M1/6066

    摘要: A communication device includes: (1) a wireless adapter at which a wireless headset is communicatively connected to the communication device and at which is received a first acoustic input that includes a speech input and a first ambient noise input; (2) a microphone that receives a second acoustic input, which includes a second ambient noise input; and (3) a dual-channel adaptive noise canceller that utilizes the second ambient noise input to filter the first ambient noise input out of the first acoustic input to generate an acoustic output that primarily comprises the speech input.

    摘要翻译: 通信设备包括:(1)无线适配器,其中无线耳机通信地连接到通信设备,并且其中接收到包括语音输入和第一环境噪声输入的第一声输入; (2)麦克风,其接收包括第二环境噪声输入的第二声输入; 以及(3)双通道自适应噪声消除器,其利用所述第二环境噪声输入来对从所述第一声输入中输出的所述第一环境噪声进行滤波,以产生主要包括所述语音输入的声输出。