Graphic user interface schemes for supporting speech recognition input systems
    1.
    发明授权
    Graphic user interface schemes for supporting speech recognition input systems 有权
    用于支持语音识别输入系统的图形用户界面方案

    公开(公告)号:US07742923B2

    公开(公告)日:2010-06-22

    申请号:US10950020

    申请日:2004-09-24

    IPC分类号: G10L21/00

    CPC分类号: G06F3/0481

    摘要: A numbering scheme is disclosed for implementation in the context of an application display. A user is able to select an item on the display by speaking a number corresponding to a desired control item. In some cases, the screen can include so many numbers that the user loses context and is unable to identify which number they want to select. For this reason, in one embodiment, a temporal switching mechanism is implemented wherein periodic switches (e.g., second-long intervals) occur between showing numbered items and showing a non-numbered screen. In one embodiment, an optional secondary confirmation step is implemented wherein the user sees only the item they just selected and has the chance to (a) learn the programmatic name of the item they selected and/or (b) either confirm and proceed with their selection, or cancel. In one embodiment, the optional secondary confirmation step is omitted if the user speaks a number followed by a predetermined command word.

    摘要翻译: 公开了用于在应用程序显示的上下文中实现的编号方案。 用户能够通过说出与期望的控制项目对应的号码来选择显示器上的项目。 在某些情况下,屏幕可以包含如此多的数字,用户丢失上下文,并且无法识别他们想要选择的号码。 为此,在一个实施例中,实现了时间切换机制,其中在显示编号的项目之间出现周期性的开关(例如,第二长的间隔),并显示非编号的屏幕。 在一个实施例中,实现可选的辅助确认步骤,其中用户仅看到他们刚刚选择的项目,并且有机会(a)学习他们选择的项目的编程名称和/或(b)确认并继续进行他们的 选择或取消。 在一个实施例中,如果用户说出一个后面是预定命令字的号码,则省略可选的辅助确认步骤。

    Microphone feedback and control
    2.
    发明授权
    Microphone feedback and control 有权
    麦克风反馈和控制

    公开(公告)号:US07643999B2

    公开(公告)日:2010-01-05

    申请号:US10996770

    申请日:2004-11-24

    IPC分类号: G10L21/06

    CPC分类号: G10L15/22 G06F3/0481

    摘要: A system and method for positioning a software User Interface (UI) window on a display screen is provided, wherein the method includes displaying the software UI window on the display screen and identifying at least one suitable location on the display screen responsive to an active target window area of a target application UI window. The method further includes determining whether the software UI window is disposed at the at least one suitable location on the display screen and if the software UI window is disposed in a location other than the at least one suitable location on the display screen, positioning the software UI window at the at least one suitable location on the display screen.

    摘要翻译: 提供了一种用于在显示屏幕上定位软件用户界面(UI)窗口的系统和方法,其中所述方法包括在显示屏幕上显示软件UI窗口,并且响应于活动目标来识别显示屏幕上的至少一个合适位置 目标应用程序UI窗口的窗口区域。 该方法还包括确定软件UI窗口是否被布置在显示屏幕上的至少一个适当位置,并且如果软件UI窗口被布置在除了显示屏幕上的至少一个适当位置之外的位置,则定位该软件 UI窗口在显示屏幕上的至少一个合适的位置。

    Recognizing multiple semantic items from single utterance
    6.
    发明授权
    Recognizing multiple semantic items from single utterance 有权
    从单一语音识别多个语义项

    公开(公告)号:US08725492B2

    公开(公告)日:2014-05-13

    申请号:US12042460

    申请日:2008-03-05

    IPC分类号: G06F17/28

    CPC分类号: G10L15/1815

    摘要: Semantically distinct items are extracted from a single utterance by repeatedly recognizing the same utterance using constraints provided by semantic items already recognized. User feedback for selection or correction of partially recognized utterance may be used in a hierarchical, multi-modal, or single step manner. An accuracy of recognition is preserved while the less structured and more natural single utterance recognition form is allowed to be used.

    摘要翻译: 通过使用已经识别的语义项提供的约束重复地识别相同的话语,从单个话语中提取语义上不同的项目。 用于部分识别的话语的选择或校正的用户反馈可以以分层,多模式或单步的方式使用。 识别的准确性得到保留,而较少结构化和更自然的单个话语识别形式被允许使用。

    Recognizing multiple semantic items from single utterance
    7.
    发明申请
    Recognizing multiple semantic items from single utterance 有权
    从单一语音识别多个语义项

    公开(公告)号:US20090228270A1

    公开(公告)日:2009-09-10

    申请号:US12042460

    申请日:2008-03-05

    IPC分类号: G10L15/00

    CPC分类号: G10L15/1815

    摘要: Semantically distinct items are extracted from a single utterance by repeatedly recognizing the same utterance using constraints provided by semantic items already recognized. User feedback for selection or correction of partially recognized utterance may be used in a hierarchical, multi-modal, or single step manner. An accuracy of recognition is preserved while the less structured and more natural single utterance recognition form is allowed to be used.

    摘要翻译: 通过使用已经识别的语义项提供的约束重复地识别相同的话语,从单个话语中提取语义上不同的项目。 用于部分识别的话语的选择或校正的用户反馈可以以分层,多模式或单步的方式使用。 识别的准确性得到保留,而较少结构化和更自然的单个话语识别形式被允许使用。

    Speech recognition disambiguation on mobile devices
    8.
    发明授权
    Speech recognition disambiguation on mobile devices 有权
    移动设备上的语音识别消歧

    公开(公告)号:US08224656B2

    公开(公告)日:2012-07-17

    申请号:US12049243

    申请日:2008-03-14

    IPC分类号: G10L21/06

    CPC分类号: G10L15/30

    摘要: A method, program storage device and mobile device provide speech disambiguation. Audio for speech recognition processing is transmitted by the mobile device. Results representing alternates identified to match the transmitted audio are received. The alternates are displayed in a disambiguation dialog screen for making corrections to the alternates. Corrections are made to the alternates using the disambiguation dialog screen until a correct result is displayed. The correct result is selected. Content associated with the selected correct result is received in parallel with the receiving of the results representing alternates identified to match the transmitted audio.

    摘要翻译: 一种方法,程序存储设备和移动设备提供语音消歧。 用于语音识别处理的音频由移动设备发送。 接收到表示与发送音频相匹配的候补的结果。 替代物显示在消歧对话屏幕中,用于对替代物进行更正。 使用消歧对话框屏幕对替代品进行更正,直到显示正确的结果。 选择正确的结果。 与所选择的正确结果相关联的内容与接收到表示被识别为匹配所发送的音频的替代的结果并行地接收。

    Speech Recognition Disambiguation on Mobile Devices
    9.
    发明申请
    Speech Recognition Disambiguation on Mobile Devices 有权
    移动设备语音识别消歧

    公开(公告)号:US20090234647A1

    公开(公告)日:2009-09-17

    申请号:US12049243

    申请日:2008-03-14

    IPC分类号: G10L15/26

    CPC分类号: G10L15/30

    摘要: A method, program storage device and mobile device provide speech disambiguation. Audio for speech recognition processing is transmitted by the mobile device. Results representing alternates identified to match the transmitted audio are received. The alternates are displayed in a disambiguation dialog screen for making corrections to the alternates. Corrections are made to the alternates using the disambiguation dialog screen until a correct result is displayed. The correct result is selected. Content associated with the selected correct result is received in parallel with the receiving of the results representing alternates identified to match the transmitted audio.

    摘要翻译: 一种方法,程序存储设备和移动设备提供语音消歧。 用于语音识别处理的音频由移动设备发送。 接收到表示与发送音频相匹配的候补的结果。 替代物显示在消歧对话屏幕中,用于对替代物进行更正。 使用消歧对话框屏幕对替代品进行更正,直到显示正确的结果。 选择正确的结果。 与所选择的正确结果相关联的内容与接收到表示被识别为匹配所发送的音频的替代的结果并行地接收。

    Incorporation of speech engine training into interactive user tutorial
    10.
    发明申请
    Incorporation of speech engine training into interactive user tutorial 审中-公开
    将言语引擎训练纳入交互式用户教程

    公开(公告)号:US20070055520A1

    公开(公告)日:2007-03-08

    申请号:US11265726

    申请日:2005-11-02

    IPC分类号: G10L15/04

    摘要: The present invention combines speech recognition tutorial training with speech recognizer voice training. The system prompts the user for speech data and simulates, with predefined screenshots, what happens when speech commands are received. At each step in the tutorial process, when the user is prompted for an input, the system is configured such that only a predefined set (which may be one) of user inputs will be recognized by the speech recognizer. When a successful recognition is being made, the speech data is used to train the speech recognition system.

    摘要翻译: 本发明将语音识别教程训练与语音识别器语音训练相结合。 系统提示用户语音数据,并用预定义的屏幕截图模拟接收到语音命令时会发生什么。 在教程过程的每个步骤中,当用户被提示输入时,系统被配置为使得只有预定义的用户输入(可以是一个)用户输入的集合将被语音识别器识别。 当成功识别时,语音数据用于训练语音识别系统。