Universal remote control adapted to receive voice input
    1.
    发明授权
    Universal remote control adapted to receive voice input 有权
    通用遥控器适用于接收语音输入

    公开(公告)号:US06629077B1

    公开(公告)日:2003-09-30

    申请号:US09721092

    申请日:2000-11-22

    IPC分类号: G10L2106

    CPC分类号: G10L15/26 G08C2201/31

    摘要: A universal remote control adapted to receive a voice input. The voice input is received by the remote control and compared to a plurality of voice command templates that are stored in the memory of the remote control. If the voice input matches one or more of the plurality of voice command templates, a valid voice input has been received by the remote control. Valid voice input may be a remote control command or keystroke data, input as an entire word or as individual characters. In response to a valid voice input, the remote control may transmit an operational command code and/or alphanumeric symbol code corresponding to keystroke data to a consumer electronic device.

    摘要翻译: 适用于接收语音输入的通用遥控器。 语音输入由遥控器接收并与存储在遥控器的存储器中的多个语音命令模板进行比较。 如果语音输入匹配多个语音命令模板中的一个或多个,则遥控器已经接收到有效的语音输入。 有效的语音输入可以是遥控命令或按键数据,作为整个单词输入或作为单个字符输入。 响应于有效的语音输入,遥控器可以向消费者电子设备发送对应于击键数据的操作命令代码和/或字母数字符号代码。

    Human image dialogue device and a recording medium storing a human image dialogue device
    4.
    发明授权
    Human image dialogue device and a recording medium storing a human image dialogue device 失效
    人体图像对话装置和存储人体图像对话装置的记录介质

    公开(公告)号:US06434525B1

    公开(公告)日:2002-08-13

    申请号:US09318806

    申请日:1999-05-26

    IPC分类号: G10L2106

    CPC分类号: G10L21/06

    摘要: A device is provided that generates the gestures and expressions of a human image on a computer without expending a great amount of labor. The words for the system response to the input of a user and the state of the dialogue are described in a dialogue flow memory unit, a dialogue flow analysis unit analyzes the spoken text of the flow, extracts the key words associated with a movement pattern by referring to a text movement association table, and the movement expression generation unit generates the movements corresponding to the movement pattern. In the generation of the movement, movement patterns determined in advance are selected according to the state of the dialogue written in the dialogue flow, and the movement pattern is determined or modified by the key words. In addition, in a text output control unit, words are displayed by switching between the display of a “conversation balloon” or the display of a “message board” according to the state of the dialogue written in the dialogue flow.

    摘要翻译: 提供了一种在计算机上生成人物图像的手势和表达的装置,而不需要大量劳动。 在对话流存储单元中描述用于系统对用户输入的响应的单词和对话状态,对话流分析单元分析流的语音文本,通过以下方式提取与移动模式相关联的关键词: 参考文本移动关联表,并且运动表达式生成单元生成与运动模式相对应的运动。 在运动的产生中,根据在对话流中写入的对话的状态来选择预先确定的移动模式,并且通过关键词确定或修改移动模式。 此外,在文本输出控制单元中,根据写在对话流中的对话状态,通过切换“对话气球”的显示或“留言板”的显示来显示单词。

    Code image recording apparatus having a microphone, a loudspeaker and a printer
    5.
    发明授权
    Code image recording apparatus having a microphone, a loudspeaker and a printer 失效
    具有麦克风,扬声器和打印​​机的代码图像记录装置

    公开(公告)号:US06311160B1

    公开(公告)日:2001-10-30

    申请号:US09164723

    申请日:1998-10-01

    申请人: Shinichi Imade

    发明人: Shinichi Imade

    IPC分类号: G10L2106

    CPC分类号: G10L19/00

    摘要: The operation mode control section sets an operation mode flag that authorizes the loudspeaker to replay the speech input through the microphone while the speech is being compressed and encoded into speech data by the compression/encoding section and then further processed by the encoding processing section. When an order is issued by the user to confirm the input speech by means of the replay operation section during the encoding operation, the speech output control section receives a permit signal for reproducing the speech through the loudspeaker after expanding the speech data by means of the speech data expansion processing section. The encoding operation proceeds concurrently during the speech reproducing operation.

    摘要翻译: 操作模式控制部分设置一个操作模式标志,该操作模式标志通过压缩/编码部分将语音压缩并编码为语音数据,然后由编码处理部分进一步处理,来设置扬声器重播通过麦克风输入的语音。 当在编码操作期间用户通过重播操作部分发出命令以确认输入语音时,语音输出控制部分在通过扬声器扩展语音数据之后通过扬声器接收用于再现语音的许可信号 语音数据扩展处理部分。 编码操作在语音再现操作期间同时进行。

    Mechanism for managing multiple speech applications
    6.
    发明授权
    Mechanism for managing multiple speech applications 有权
    管理多个语音应用程序的机制

    公开(公告)号:US06192339B1

    公开(公告)日:2001-02-20

    申请号:US09187571

    申请日:1998-11-04

    申请人: Cory W. Cox

    发明人: Cory W. Cox

    IPC分类号: G10L2106

    CPC分类号: G10L15/32 G10L15/30

    摘要: In one embodiment of the method and apparatus for managing multiple speech applications, a common development platform and a common environment are provided. The common environment interfaces with the speech applications, receives information from an application information storage and a plurality of speech input sources, allows the speech applications to execute simultaneously and transitions from one said speech application to another seamlessly. In addition, the speech applications are developed based on the common development platform. Thus, application developers may utilize the common development platform to design and implement the speech applications independently.

    摘要翻译: 在用于管理多个语音应用的方法和装置的一个实施例中,提供了共同的开发平台和公共环境。 与语音应用的通用环境接口,从应用信息存储和多个语音输入源接收信息,允许语音应用同时执行,并从一个所述语音应用到另一个无缝地转换。 此外,语音应用程序是基于共同的开发平台而开发的。 因此,应用程序开发人员可以利用共同的开发平台来独立地设计和实现语音应用。

    Application server configured for dynamically generating web pages for voice enabled web applications
    7.
    发明授权
    Application server configured for dynamically generating web pages for voice enabled web applications 有权
    应用服务器配置为动态生成支持语音的Web应用程序的网页

    公开(公告)号:US06766298B1

    公开(公告)日:2004-07-20

    申请号:US09480485

    申请日:2000-01-11

    IPC分类号: G10L2106

    CPC分类号: H04M3/4938

    摘要: A unified web-based voice messaging system provides voice application control between a web browser and an application server via an hypertext transport protocol (HTTP) connection on an Internet Protocol (IP) network. The web browser receives an HTML page from the application server having an XML element that defines data for an audio operation to be performed by an executable audio resource. The application server executes the voice-enabled web application by runtime execution of extensible markup language (XML) documents that define the voice-enabled web application to be executed. The application server, in response to receiving a user request from a user, accesses a selected XML page that defines at least a part of the voice application to be executed for the user. The application server then parses the XML page, and executes the operation describer by the XML page.

    摘要翻译: 统一的基于web的语音消息系统通过互联网协议(IP)网络上的超文本传输​​协议(HTTP)连接在web浏览器和应用服务器之间提供语音应用控制。 Web浏览器从应用服务器接收具有XML元素的HTML页面,该XML元素定义要由可执行音频资源执行的音频操作的数据。 应用服务器通过运行时执行可定义要执行的支持语音的Web应用程序的可扩展标记语言(XML)文档来执行支持语音的Web应用程序。 响应于从用户接收用户请求,应用服务器访问定义要为用户执行的语音应用的至少一部分的所选择的XML页面。 然后,应用服务器解析XML页面,并通过XML页面执行操作描述符。

    System and process for voice-controlled information retrieval
    9.
    发明授权
    System and process for voice-controlled information retrieval 失效
    用于语音信息检索的系统和过程

    公开(公告)号:US06636831B1

    公开(公告)日:2003-10-21

    申请号:US09289784

    申请日:1999-04-09

    IPC分类号: G10L2106

    摘要: A system and process for voice-controlled information retrieval. A conversation template is executed. The conversation template includes a script of tagged instructions including voice prompts and information content. A voice command identifying information content to be retrieved is processed. A remote method invocation is sent requesting the identified information content to an applet process associated with a Web browser. The information content is retrieved on the Web browser responsive to the remote method invocation.

    摘要翻译: 用于语音信息检索的系统和过程。 会话模板被执行。 对话模板包括包括语音提示和信息内容的标记指令的脚本。 处理识别要检索的信息内容的语音命令。 发送远程方法调用,请求所识别的信息内容到与Web浏览器相关联的小程序进程。 响应于远程方法调用,在Web浏览器上检索信息内容。

    Voice-interactive docking station for a portable computing device
    10.
    发明授权
    Voice-interactive docking station for a portable computing device 有权
    用于便携式计算设备的语音交互坞站

    公开(公告)号:US06539358B1

    公开(公告)日:2003-03-25

    申请号:US09577860

    申请日:2000-05-24

    IPC分类号: G10L2106

    摘要: A voice-interactive docking station is provided for use with a portable computing device. The portable computing device includes at least one information management application and a corresponding database for storing the data associated with the information management application. The docking station generally includes a speech input device for receiving speech input, a speech recognizer for translating the speech input into voice command data, and an interface application for interacting with the applications residing on the portable computing device. In particular, the interface application, in response to voice command data, accesses the data associated with the information management application residing on the portable computing device. The docking station may further include a text-to-speech synthesizer for converting output data from the interface application into speech output data, and an audio system for generating audio output from the speech output data.

    摘要翻译: 提供了一种与便携式计算设备一起使用的语音交互式坞站。 便携式计算设备包括至少一个信息管理应用和用于存储与信息管理应用相关联的数据的相应数据库。 对接站通常包括用于接收语音输入的语音输入设备,用于将语音输入转换成语音命令数据的语音识别器,以及用于与驻留在便携式计算设备上的应用进行交互的接口应用。 特别地,接口应用程序响应于语音命令数据,访问与驻留在便携式计算设备上的信息管理应用相关联的数据。 坞站还可以包括用于将来自接口应用的输出数据转换为语音输出数据的文本到语音合成器,以及用于从语音输出数据生成音频输出的音频系统。