专利检索 ipc:"G10L2106" 第 1 页

1.

发明授权
Universal remote control adapted to receive voice input 有权
标题翻译：通用遥控器适用于接收语音输入

公开(公告)号：US06629077B1

公开(公告)日：2003-09-30

申请号：US09721092

申请日：2000-11-22

申请人： Paul D. Arling , Patrick H. Hayes

发明人： Paul D. Arling , Patrick H. Hayes

IPC分类号： G10L2106

CPC分类号： G10L15/26 , G08C2201/31

摘要： A universal remote control adapted to receive a voice input. The voice input is received by the remote control and compared to a plurality of voice command templates that are stored in the memory of the remote control. If the voice input matches one or more of the plurality of voice command templates, a valid voice input has been received by the remote control. Valid voice input may be a remote control command or keystroke data, input as an entire word or as individual characters. In response to a valid voice input, the remote control may transmit an operational command code and/or alphanumeric symbol code corresponding to keystroke data to a consumer electronic device.

摘要翻译： 适用于接收语音输入的通用遥控器。语音输入由遥控器接收并与存储在遥控器的存储器中的多个语音命令模板进行比较。如果语音输入匹配多个语音命令模板中的一个或多个，则遥控器已经接收到有效的语音输入。有效的语音输入可以是遥控命令或按键数据，作为整个单词输入或作为单个字符输入。响应于有效的语音输入，遥控器可以向消费者电子设备发送对应于击键数据的操作命令代码和/或字母数字符号代码。

2.

发明授权
Arrangement for defining and processing voice enabled web applications using extensible markup language documents 有权
标题翻译：使用可扩展标记语言文档定义和处理支持语音的Web应用程序的安排

公开(公告)号：US06490564B1

公开(公告)日：2002-12-03

申请号：US09501516

申请日：2000-02-09

申请人： Lewis Dean Dodrill , Geetha Ravishankar , Satish Joshi , Keith M. Basil , Ryan Alan Danner , Steven J. Martin , Swaminathan Ravishankar

发明人： Lewis Dean Dodrill , Geetha Ravishankar , Satish Joshi , Keith M. Basil , Ryan Alan Danner , Steven J. Martin , Swaminathan Ravishankar

IPC分类号： G10L2106

CPC分类号： H04L69/329 , G10L13/00 , G10L15/22 , G10L15/30 , G10L2015/228 , H04L29/06 , H04L51/04 , H04L67/28 , H04L67/2819 , H04L67/2842 , H04L67/34 , H04M3/4938 , H04M3/53 , H04M3/5307 , H04M3/533 , H04M7/12 , H04M2203/253 , H04M2203/4509

摘要： A unified web-based voice messaging system provides voice application control between a web browser and an application server via an hypertext transport protocol (HTTP) connection on an Internet Protocol (IP) network. The application server executes the voice-enabled web application by runtime execution of extensible markup language (XML) documents that define the voice-enabled web application to be executed. Each voice application operation can be defined as any one of a user interface operation, a logic operation, or a function operation. Each XML document includes XML tags that specify the user interface operation, the logic operation and/or the function operation to be performed within a corresponding voice application operation, the XML tags being based on prescribed rule sets that specify the executable functions to be performed by the application runtime environment.

摘要翻译： 统一的基于web的语音消息系统通过互联网协议（IP）网络上的超文本传输协议（HTTP）连接在web浏览器和应用服务器之间提供语音应用控制。应用服务器通过运行时执行可定义要执行的支持语音的Web应用程序的可扩展标记语言（XML）文档来执行支持语音的Web应用程序。每个语音应用操作可以被定义为用户界面操作，逻辑操作或功能操作中的任何一个。每个XML文档包括指定要在相应语音应用操作中执行的用户界面操作，逻辑操作和/或功能操作的XML标签，所述XML标签基于规定的规则集，所述规则规则集指定要由应用程序运行时环境。

3.

发明授权
Information recording medium, apparatus and method for performing after-recording on the recording medium 有权
标题翻译：用于在记录介质上执行后记录的信息记录介质，装置和方法

公开(公告)号：US06480828B1

公开(公告)日：2002-11-12

申请号：US09675156

申请日：2000-09-29

申请人： Tomoyuki Okada , Kaoru Murase , Noriko Sugimoto , Kazuhiro Tsuga

发明人： Tomoyuki Okada , Kaoru Murase , Noriko Sugimoto , Kazuhiro Tsuga

IPC分类号： G10L2106

CPC分类号： G11B27/34 , G11B20/10527 , G11B27/034 , G11B27/036 , G11B27/105 , G11B27/3027 , G11B27/329 , G11B27/36 , G11B2020/10592 , G11B2220/216 , G11B2220/2562 , G11B2220/2575 , H04N5/05 , H04N5/85 , H04N9/8042 , H04N9/8063

摘要： The invention provides an information recording medium, such as an optical disk, having a large capacity and being capable of performing read/write operations at high speeds. The recording medium includes an audio stream prepared for after-recording data, and a audio attribute information having a bit rate information to the recorded audio stream as a management information. A recorder according to the invention has a check unit for checking, in advance, the possibility of after-recording operation of the recorder to the audio stream to be after-recorded with reference to the bit rate information of the audio attribute information.

摘要翻译： 本发明提供了一种具有大容量并且能够以高速执行读/写操作的诸如光盘的信息记录介质。记录介质包括为后期记录数据准备的音频流和具有比特率信息的音频属性信息作为管理信息记录在记录音频流中。根据本发明的记录器具有一个检查单元，用于参考音频属性信息的比特率信息预先检查记录器对待记录的音频流的后记录操作的可能性。

4.

发明授权
Human image dialogue device and a recording medium storing a human image dialogue device 失效
标题翻译：人体图像对话装置和存储人体图像对话装置的记录介质

公开(公告)号：US06434525B1

公开(公告)日：2002-08-13

申请号：US09318806

申请日：1999-05-26

申请人： Izumi Nagisa , Dai Kusui

发明人： Izumi Nagisa , Dai Kusui

IPC分类号： G10L2106

CPC分类号： G10L21/06

摘要： A device is provided that generates the gestures and expressions of a human image on a computer without expending a great amount of labor. The words for the system response to the input of a user and the state of the dialogue are described in a dialogue flow memory unit, a dialogue flow analysis unit analyzes the spoken text of the flow, extracts the key words associated with a movement pattern by referring to a text movement association table, and the movement expression generation unit generates the movements corresponding to the movement pattern. In the generation of the movement, movement patterns determined in advance are selected according to the state of the dialogue written in the dialogue flow, and the movement pattern is determined or modified by the key words. In addition, in a text output control unit, words are displayed by switching between the display of a “conversation balloon” or the display of a “message board” according to the state of the dialogue written in the dialogue flow.

摘要翻译： 提供了一种在计算机上生成人物图像的手势和表达的装置，而不需要大量劳动。在对话流存储单元中描述用于系统对用户输入的响应的单词和对话状态，对话流分析单元分析流的语音文本，通过以下方式提取与移动模式相关联的关键词：参考文本移动关联表，并且运动表达式生成单元生成与运动模式相对应的运动。在运动的产生中，根据在对话流中写入的对话的状态来选择预先确定的移动模式，并且通过关键词确定或修改移动模式。此外，在文本输出控制单元中，根据写在对话流中的对话状态，通过切换“对话气球”的显示或“留言板”的显示来显示单词。

5.

发明授权
Code image recording apparatus having a microphone, a loudspeaker and a printer 失效
标题翻译：具有麦克风，扬声器和打印机的代码图像记录装置

公开(公告)号：US06311160B1

公开(公告)日：2001-10-30

申请号：US09164723

申请日：1998-10-01

申请人： Shinichi Imade

发明人： Shinichi Imade

IPC分类号： G10L2106

CPC分类号： G10L19/00

摘要： The operation mode control section sets an operation mode flag that authorizes the loudspeaker to replay the speech input through the microphone while the speech is being compressed and encoded into speech data by the compression/encoding section and then further processed by the encoding processing section. When an order is issued by the user to confirm the input speech by means of the replay operation section during the encoding operation, the speech output control section receives a permit signal for reproducing the speech through the loudspeaker after expanding the speech data by means of the speech data expansion processing section. The encoding operation proceeds concurrently during the speech reproducing operation.

摘要翻译： 操作模式控制部分设置一个操作模式标志，该操作模式标志通过压缩/编码部分将语音压缩并编码为语音数据，然后由编码处理部分进一步处理，来设置扬声器重播通过麦克风输入的语音。当在编码操作期间用户通过重播操作部分发出命令以确认输入语音时，语音输出控制部分在通过扬声器扩展语音数据之后通过扬声器接收用于再现语音的许可信号语音数据扩展处理部分。编码操作在语音再现操作期间同时进行。

6.

发明授权
Mechanism for managing multiple speech applications 有权
标题翻译：管理多个语音应用程序的机制

公开(公告)号：US06192339B1

公开(公告)日：2001-02-20

申请号：US09187571

申请日：1998-11-04

申请人： Cory W. Cox

发明人： Cory W. Cox

IPC分类号： G10L2106

CPC分类号： G10L15/32 , G10L15/30

摘要： In one embodiment of the method and apparatus for managing multiple speech applications, a common development platform and a common environment are provided. The common environment interfaces with the speech applications, receives information from an application information storage and a plurality of speech input sources, allows the speech applications to execute simultaneously and transitions from one said speech application to another seamlessly. In addition, the speech applications are developed based on the common development platform. Thus, application developers may utilize the common development platform to design and implement the speech applications independently.

摘要翻译： 在用于管理多个语音应用的方法和装置的一个实施例中，提供了共同的开发平台和公共环境。与语音应用的通用环境接口，从应用信息存储和多个语音输入源接收信息，允许语音应用同时执行，并从一个所述语音应用到另一个无缝地转换。此外，语音应用程序是基于共同的开发平台而开发的。因此，应用程序开发人员可以利用共同的开发平台来独立地设计和实现语音应用。

7.

发明授权
Application server configured for dynamically generating web pages for voice enabled web applications 有权
标题翻译：应用服务器配置为动态生成支持语音的Web应用程序的网页

公开(公告)号：US06766298B1

公开(公告)日：2004-07-20

申请号：US09480485

申请日：2000-01-11

申请人： Lewis Dean Dodrill , Geetha Ravishankar , Satish Joshi , Keith M. Basil , Ryan Alan Danner , James Richard Grove, Jr. , Steven J. Martin

发明人： Lewis Dean Dodrill , Geetha Ravishankar , Satish Joshi , Keith M. Basil , Ryan Alan Danner , James Richard Grove, Jr. , Steven J. Martin

IPC分类号： G10L2106

CPC分类号： H04M3/4938

摘要： A unified web-based voice messaging system provides voice application control between a web browser and an application server via an hypertext transport protocol (HTTP) connection on an Internet Protocol (IP) network. The web browser receives an HTML page from the application server having an XML element that defines data for an audio operation to be performed by an executable audio resource. The application server executes the voice-enabled web application by runtime execution of extensible markup language (XML) documents that define the voice-enabled web application to be executed. The application server, in response to receiving a user request from a user, accesses a selected XML page that defines at least a part of the voice application to be executed for the user. The application server then parses the XML page, and executes the operation describer by the XML page.

摘要翻译： 统一的基于web的语音消息系统通过互联网协议（IP）网络上的超文本传输协议（HTTP）连接在web浏览器和应用服务器之间提供语音应用控制。 Web浏览器从应用服务器接收具有XML元素的HTML页面，该XML元素定义要由可执行音频资源执行的音频操作的数据。应用服务器通过运行时执行可定义要执行的支持语音的Web应用程序的可扩展标记语言（XML）文档来执行支持语音的Web应用程序。响应于从用户接收用户请求，应用服务器访问定义要为用户执行的语音应用的至少一部分的所选择的XML页面。然后，应用服务器解析XML页面，并通过XML页面执行操作描述符。

8.

发明授权
Information processing apparatus, information processing method and program storage medium 失效
标题翻译：装置，方法和程序存储介质，涉及识别口头文本并将相应的标题附加到图片上

公开(公告)号：US06757657B1

公开(公告)日：2004-06-29

申请号：US09640596

申请日：2000-08-17

申请人： Kiyonobu Kojima , Yasuhiko Kato , Shuji Yonekura , Satoshi Fujimura , Takashi Sasai , Naoki Fujisawa , Junji Ooi

发明人： Kiyonobu Kojima , Yasuhiko Kato , Shuji Yonekura , Satoshi Fujimura , Takashi Sasai , Naoki Fujisawa , Junji Ooi

IPC分类号： G10L2106

CPC分类号： G06F1/1616 , G06F1/1679 , G06F1/1686 , G06F1/169 , G06F3/16 , G06F17/30265 , H04N1/00127 , H04N1/00212 , H04N1/00236 , H04N1/00241 , H04N1/00326 , H04N1/0035 , H04N1/32128 , H04N2201/3225 , H04N2201/3243 , H04N2201/3261 , H04N2201/3273 , H04N2201/3278

摘要： An information processing apparatus including an image-sensing controller controlling image-sensing so as to take a picture upon detection of execution of a first operation, a word generator recognizing speech upon detection of execution of a second operation and generating a word or a phrase corresponding to the recognized voice, and a portion associating the word or a phrase with the picture. Accordingly a word, a generated phrase or the like can be easily associated with an image-sensed still picture (with ease).

摘要翻译： 一种信息处理设备，包括：图像感测控制器，其在检测到第一操作的执行时控制图像感测以拍摄图像，字检测器在检测到第二操作的执行时识别语音，并且生成相应的单词或短语到识别的声音，以及将单词或短语与图片相关联的部分。因此，单词，生成的短语等可以容易地与图像感测的静止图像相关联。

9.

发明授权
System and process for voice-controlled information retrieval 失效
标题翻译：用于语音信息检索的系统和过程

公开(公告)号：US06636831B1

公开(公告)日：2003-10-21

申请号：US09289784

申请日：1999-04-09

申请人： Jack H. Profit, Jr. , N. Gregg Brown , Peter S. Mezey , Lianne M. Colombo

发明人： Jack H. Profit, Jr. , N. Gregg Brown , Peter S. Mezey , Lianne M. Colombo

IPC分类号： G10L2106

CPC分类号： H04M3/4938 , G10L15/193 , G10L15/26

摘要： A system and process for voice-controlled information retrieval. A conversation template is executed. The conversation template includes a script of tagged instructions including voice prompts and information content. A voice command identifying information content to be retrieved is processed. A remote method invocation is sent requesting the identified information content to an applet process associated with a Web browser. The information content is retrieved on the Web browser responsive to the remote method invocation.

摘要翻译： 用于语音信息检索的系统和过程。会话模板被执行。对话模板包括包括语音提示和信息内容的标记指令的脚本。处理识别要检索的信息内容的语音命令。发送远程方法调用，请求所识别的信息内容到与Web浏览器相关联的小程序进程。响应于远程方法调用，在Web浏览器上检索信息内容。

10.

发明授权
Voice-interactive docking station for a portable computing device 有权
标题翻译：用于便携式计算设备的语音交互坞站

公开(公告)号：US06539358B1

公开(公告)日：2003-03-25

申请号：US09577860

申请日：2000-05-24

申请人： Bradley S. Coon , Ronald K. Reger

发明人： Bradley S. Coon , Ronald K. Reger

IPC分类号： G10L2106

CPC分类号： B60R11/0247 , B60R11/0241 , B60R11/0252 , B60R2011/0085 , B60R2011/0087 , G06F1/1632 , G06F3/16 , G10L15/26

摘要： A voice-interactive docking station is provided for use with a portable computing device. The portable computing device includes at least one information management application and a corresponding database for storing the data associated with the information management application. The docking station generally includes a speech input device for receiving speech input, a speech recognizer for translating the speech input into voice command data, and an interface application for interacting with the applications residing on the portable computing device. In particular, the interface application, in response to voice command data, accesses the data associated with the information management application residing on the portable computing device. The docking station may further include a text-to-speech synthesizer for converting output data from the interface application into speech output data, and an audio system for generating audio output from the speech output data.

摘要翻译： 提供了一种与便携式计算设备一起使用的语音交互式坞站。便携式计算设备包括至少一个信息管理应用和用于存储与信息管理应用相关联的数据的相应数据库。对接站通常包括用于接收语音输入的语音输入设备，用于将语音输入转换成语音命令数据的语音识别器，以及用于与驻留在便携式计算设备上的应用进行交互的接口应用。特别地，接口应用程序响应于语音命令数据，访问与驻留在便携式计算设备上的信息管理应用相关联的数据。坞站还可以包括用于将来自接口应用的输出数据转换为语音输出数据的文本到语音合成器，以及用于从语音输出数据生成音频输出的音频系统。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类