Information viewing/listening system
    1.
    发明授权
    Information viewing/listening system 有权
    信息查看/收听系统

    公开(公告)号:US07657743B2

    公开(公告)日:2010-02-02

    申请号:US10751991

    申请日:2004-01-07

    IPC分类号: H04L9/32

    摘要: To enable use of content data recorded in an arbitrary player using a mobile terminal, such as a cellular phone, in response to an inquiry made by a mobile terminal to a player, the player transmits a response to the mobile terminal. When a user performs a predetermined operation on the mobile terminal, the mobile terminal creates a one-time password and transmits the one-time password and operation information concerning the user-performed operation to the player. The player transmits a terminal ID and the like to a service center. Upon reception of the information, the service center creates a one-time password and transmits the one-time password and an operation permission command to the player. The player compares the transmitted password with the password created by the mobile terminal. When the passwords are verified, the user operation instruction is made valid.

    摘要翻译: 为了能够使用诸如蜂窝电话的移动终端在任意播放器中记录的内容数据,响应于移动终端对玩家的查询,播放器向移动终端发送响应。 当用户在移动终端上执行预定的操作时,移动终端创建一次性密码,并向玩家发送关于用户执行的操作的一次密码和操作信息。 播放器将终端ID等发送到服务中心。 在接收到信息时,服务中心创建一次性密码,并向玩家发送一次性密码和操作许可命令。 播放器将发送的密码与移动终端创建的密码进行比较。 当验证密码时,用户操作指令有效。

    Bifurcated speaker specific and non-speaker specific speech recognition
method and apparatus
    2.
    发明授权
    Bifurcated speaker specific and non-speaker specific speech recognition method and apparatus 失效
    分岔扬声器专用和非扬声器特定语音识别方法和装置

    公开(公告)号:US6070139A

    公开(公告)日:2000-05-30

    申请号:US699874

    申请日:1996-08-20

    摘要: Bifurcated speaker specific and non-speaker specific method and apparatus is provided for enabling speech-based remote control and for recognizing the speech of an unspecified speaker at extremely high recognition rates regardless of the speaker's age, sex, or individual speech mannerisms. A device main unit is provided with a speech recognition processor for recognizing speech and taking an appropriate action, and with a user terminal containing specific speaker capture and/or preprocessing capabilities. The user terminal exchanges data with the speech recognition processor using radio transmission. The user terminal may be provided with a conversion rule generator that compares the speech of a user with previously compiled standard speech feature data and, based on this comparison result, generates a conversion rule for converting the speaker's speech feature parameters to corresponding standard speaker's feature information. The speech recognition processor, in turn, may reference the conversion rule developed in the user terminal and perform speech recognition based on the input speech feature parameters that have been converted above.

    摘要翻译: 提供分岔扬声器专用和非说话者的具体方法和装置,用于实现基于语音的远程控制并且以非常高的识别率识别未指定的说话者的语音,而不管说话者的年龄,性别或个人言语行为。 设备主单元设置有用于识别语音并采取适当动作的语音识别处理器,并且具有包含特定扬声器捕获和/或预处理能力的用户终端。 用户终端使用无线电传输与语音识别处理器交换数据。 用户终端可以设置有将用户的语音与先前编译的标准语音特征数据进行比较的转换规则生成器,并且基于该比较结果,生成用于将说话者的语音特征参数转换为对应的标准说话者的特征信息的转换规则 。 语音识别处理器又可以参考在用户终端中开发的转换规则,并且基于上面转换的输入语音特征参数来执行语音识别。

    Voice-activated interactive speech recognition device and method
    3.
    发明授权
    Voice-activated interactive speech recognition device and method 失效
    语音激活交互式语音识别装置及方法

    公开(公告)号:US5983186A

    公开(公告)日:1999-11-09

    申请号:US700181

    申请日:1996-08-20

    CPC分类号: G10L15/26 G10L2025/783

    摘要: Techniques for implementing adaptable voice activation operations for interactive speech recognition devices and instruments. Specifically, such speech recognition devices and instruments include an input sound signal power or volume detector in communication with a central CPU for bringing the CPU out of an initial sleep state upon detection of perceived voice exceeding a predetermined threshold volume level and is continuously perceived for at least a certain period of time. If both these conditions are satisfied, the CPU is transitioned into an active mode so that the perceived voice can be analyzed against a set of registered key words to determine if a "power on" command or similar instruction has been received. If so, the CPU maintains an active state in normal speech recognition processing ensues until a "power off" command is received. However, if the perceived and analyzed voice can not be recognized, it is deemed to be background noise and the minimum threshold is selectively updated to accommodate the volume level of the perceived but unrecognized voice. Other aspects include tailoring the volume level of the synthesized voice response according to the perceived volume level as detected by the input sound signal power detector, as well as modifying audible response volume in accordance with updated volume threshold levels.

    摘要翻译: 用于实现交互式语音识别设备和仪器的适应性语音激活操作的技术。 具体地说,这样的语音识别装置和仪器包括与中央CPU通信的输入声音信号功率或音量检测器,用于在检测到超过预定阈值音量水平的感知语音时使CPU退出初始睡眠状态,并且持续感觉到 至少一段时间。 如果满足这两个条件,则CPU转换到活动模式,使得可以针对一组注册的关键字分析感知的语音,以确定是否已经接收到“开机”命令或类似的指令。 如果是这样,则在正常语音识别处理中CPU保持活动状态,直到接收到“断电”命令。 然而,如果感知和分析的语音不能被识别,则将其视为背景噪声,并且选择性地更新最小阈值以适应感知但未被识别的语音的音量级别。 其他方面包括根据由输入声音信号功率检测器检测到的感知音量水平定制合成语音响应的音量级别,以及根据更新的音量阈值水平修改可听见的响应音量。

    Interactive voice recognition method and apparatus using
affirmative/negative content discrimination
    4.
    发明授权
    Interactive voice recognition method and apparatus using affirmative/negative content discrimination 失效
    使用肯定/否定内容歧视的交互式语音识别方法和装置

    公开(公告)号:US5899972A

    公开(公告)日:1999-05-04

    申请号:US536550

    申请日:1995-09-29

    CPC分类号: G10L15/22

    摘要: A technique for improving voice recognition in low-cost, speech interactive devices. This technique calls for implementing a affirmative/negative discrimination unit in parallel with a word detection unit to permit comprehension of spoken commands or messages issued by binary questions when no recognizable words are found. Preferably, affirmative/negative discrimination will include either spoken vowel analysis or negative language descriptor detection of the perceived message or command. Other facets include keyword identification within the perceived message or command, confidence match level comparison or correlation table compilation in order to increase recognition accuracy of word-based recognition, volume analysis, and inclusion of ambient environment information in generating responses to perceived messages or queries.

    摘要翻译: 一种用于在低成本语音交互设备中改善语音识别的技术。 该技术要求与单词检测单元并行地实现肯定/否定鉴别单元,以便在找不到可识别的单词时允许理解由二进制问题发出的口语命令或消息。 优选地,肯定/否定的判别将包括对所感知的消息或命令的口头元音分析或负面语言描述符检测。 其他方面包括所感知的消息或命令内的关键字识别,置信匹配级别比较或相关表编译,以便增加基于字的识别的识别精度,体积分析以及包含周围环境信息以产生对所感知的消息或查询的响应。

    Cartridge-based, interactive speech recognition device with
response-creation capability
    5.
    发明授权
    Cartridge-based, interactive speech recognition device with response-creation capability 失效
    具有响应创造能力的基于墨盒的交互式语音识别装置

    公开(公告)号:US5842168A

    公开(公告)日:1998-11-24

    申请号:US700175

    申请日:1996-08-20

    摘要: A technique for improving speech recognition in low-cost, speech interactive devices. This technique calls for selectively implementing a speaker-specific word enrollment and detection unit in parallel with a word detection unit to permit comprehension of spoken commands or messages when no recognizable words are found. Preferably, specific speaker detection will be based on the speaker's own personal list of words or expression. Other facets include complementing non-specific pre-registered word characteristic information with individual, speaker-specific verbal characteristics to improve recognition in cases where the speaker has unusual speech mannerisms or accent and response alteration in which speaker-specification registration functions are leveraged to provide access and permit changes to a predefined responses table according to user needs and tastes. Also disclosed is the externalization and modularization of non-specific speaker recognition, action and response information to enhance adaptability of the speech recognizer without sacrificing product cost competitiveness or overall device responsiveness.

    摘要翻译: 一种用于在低成本语音交互设备中改善语音识别的技术。 该技术要求与字检测单元并行地选择性地实现与扬声器相关的字注册和检测单元,以便在找不到可识别字时允许理解口语命令或消息。 优选地,具体的说话者检测将基于说话者自己的单词或表达的个人列表。 其他方面包括补充非特定的预先登记的单词特征信息,具有单独的具有说话者的语言特征,以在讲话者具有不寻常的语音方式或重音和响应改变的情况下改善识别,其中利用说话者说明书注册功能来提供访问 并允许根据用户需求和口味对预定义的响应表进行更改。 还公开了非特定说话人识别,动作和响应信息的外部化和模块化,以增强语音识别器的适应性,而不牺牲产品成本竞争力或整体设备响应性。

    Continuous speech recognition method and program medium with alternative choice selection to confirm individual words
    6.
    发明授权
    Continuous speech recognition method and program medium with alternative choice selection to confirm individual words 有权
    连续语音识别方法和程序介质,具有可选择选择,以确认单个单词

    公开(公告)号:US06564185B1

    公开(公告)日:2003-05-13

    申请号:US09370982

    申请日:1999-08-10

    IPC分类号: G10L1522

    CPC分类号: G10L15/22

    摘要: The invention relates to a method and apparatus for recognition processing of continuous words of a group which is structured by a plurality of words such that a recognition result of all of the words which structures the continuous words is effectively and accurately confirmed. All of the continuous words which have been input are recognition processed, the recognition result of all of the continuous words is output, a response from a speaker showing an affirmative/negative recognition result is input and recognition processed. If affirmative is determined, the recognition result at that time is confirmed for all of the continuous words. If negative is determined, for each word from a first to an nth (third in this case) which structures continuous words, the content showing affirmative/negative from the speaker is recognized, affirmative or negative is determined, and the recognition result at that time is confirmed as a recognition processing target word.

    摘要翻译: 本发明涉及一种用于识别处理由多个单词构成的组的连续单词的方法和装置,使得能够有效和准确地确认构成连续单词的所有单词的识别结果。 所输入的所有连续词都是识别处理的,所有连续词的识别结果都被输出,表示肯定/否定识别结果的说话者的响应被输入并进行识别处理。 如果确定肯定,则确认所有连续词的识别结果。 如果确定为否定,对于构成连续词的第一至第n(在这种情况下为第三)的每个单词,确定从说话者显示肯定/否定的内容,确定肯定或否定,并且当时的识别结果 被确认为识别处理对象字。

    Speech recognition method, speech recognition device, and recording medium on which is recorded a speech recognition processing program
    7.
    发明授权
    Speech recognition method, speech recognition device, and recording medium on which is recorded a speech recognition processing program 有权
    语音识别方法,语音识别装置和其上记录有语音识别处理程序的记录介质

    公开(公告)号:US06446039B1

    公开(公告)日:2002-09-03

    申请号:US09378997

    申请日:1999-08-23

    IPC分类号: G10L1528

    CPC分类号: G10L15/285 G10L15/06

    摘要: This invention concerns obtaining high recognition capability while there is a large limitation on memory capacity and processing ability of a CPU. When several words are selected as registration words among a plurality of recognizable words, a recognition target speaker speaks the respective registration words, registration word data for the respective registration words from the sound data is created and saved in a RAM. When the recognition target speaker speaks a registration word, sound is recognized using the registration word data, and when recognizable words other than the registration words are recognized, sound is recognized using specific speaker group sound model data. Furthermore, speaker learning processing is performed using the registration word data and the specific speaker group sound model data, and when recognizable words other than the registration words are recognized, sound is recognized using post-speaker learning data for speaker adaptation.

    摘要翻译: 本发明涉及在CPU的存储容量和处理能力存在很大限制的情况下获得高识别能力。 当在多个可识别字中选择多个字作为注册字时,识别目标说话者说出各自的注册字,从声音数据中创建并保存用于各声部数据的登记字数据。 当识别目标扬声器使用注册字时,使用注册字数据识别声音,并且当识别出除注册字之外的可识别字时,使用特定扬声器组声音模型数据识别声音。 此外,使用注册字数据和特定扬声器组声音模型数据进行说话者学习处理,并且当识别出除注册字之外的可识别词时,使用用于说话者适配的后讲话者学习数据来识别声音。

    Voice model learning data creation method and its apparatus
    8.
    发明授权
    Voice model learning data creation method and its apparatus 失效
    语音模型学习数据创建方法及其设备

    公开(公告)号:US06349281B1

    公开(公告)日:2002-02-19

    申请号:US09010799

    申请日:1998-01-22

    IPC分类号: G10L1506

    CPC分类号: G10L15/07

    摘要: A voice model learning data creation method and apparatus makes possible the creation of an inexpensive voice model in a short period of time when creating a voice model for a new word not in a preexisting database. Verbal data from several persons is selected from among the verbal data held in the database. This selected verbal data is referred to as standard speaker data, and is stored in a standard speaker data storage component. The remaining verbal data in the preexisting database is designated as learning speaker data, as is stored in a learning speaker data storage component. A data conversion function from the standard speaker data space to the learning speaker data space is derived. Then, the learning data for the new word is created by the data conversion function. Thus, the data which is obtained from the standard speaker speaking the new word is converted to the learning speaker data space.

    摘要翻译: 语音模型学习数据创建方法和装置使得在为不在预先存在的数据库中的新单词创建语音模型的短时间内创建便宜的语音模型成为可能。 从数据库中保存的语言数据中选出来自多个人的语言数据。 该选择的语言数据被称为标准扬声器数据,并存储在标准扬声器数据存储部件中。 预先存在的数据库中的剩余语言数据被指定为学习扬声器数据,如存储在学习扬声器数据存储部件中那样。 导出从标准扬声器数据空间到学习扬声器数据空间的数据转换功能。 然后,通过数据转换功能创建新单词的学习数据。 因此,从标准说话者说出的新单词获得的数据被转换为学习扬声器数据空间。

    Cartridge-based, interactive speech recognition method with a response
creation capability
    9.
    发明授权
    Cartridge-based, interactive speech recognition method with a response creation capability 有权
    基于墨盒的交互式语音识别方法,具有响应创造能力

    公开(公告)号:US5946658A

    公开(公告)日:1999-08-31

    申请号:US165512

    申请日:1998-10-02

    摘要: A technique for improving speech recognition in low-cost, speech interactive devices. This technique calls for selectively implementing a speaker-specific word enrollment and detection unit in parallel with a word detection unit to permit comprehension of spoken commands or messages when no recognizable words are found. Preferably, specific speaker detection will be based on the speaker's own personal list of words or expression. Other facets include complementing non-specific pre-registered word characteristic information with individual, speaker-specific verbal characteristics to improve recognition in cases where the speaker has unusual speech mannerisms or accent and response alteration in which speaker-specification registration functions are leveraged to provide access and permit changes to a predefined responses table according to user needs and tastes. Also disclosed is the externalization and modularization of non-specific speaker recognition, action and response information to enhance adaptability of the speech recognizer without sacrificing product cost competitiveness or overall device responsiveness.

    摘要翻译: 一种用于在低成本语音交互设备中改善语音识别的技术。 该技术要求与字检测单元并行地选择性地实现与扬声器特定的单词注册和检测单元,以便在找不到可识别的单词时允许理解口语命令或消息。 优选地,具体的说话者检测将基于说话者自己的单词或表达的个人列表。 其他方面包括补充非特定的预先登记的单词特征信息,具有单独的具有说话者的语言特征,以在讲话者具有不寻常的语音方式或重音和响应改变的情况下改善识别,其中利用说话者说明书注册功能来提供访问 并允许根据用户需求和口味对预定义的响应表进行更改。 还公开了非特定说话人识别,动作和响应信息的外部化和模块化,以增强语音识别器的适应性,而不牺牲产品成本竞争力或整体设备响应性。

    Information viewing/listening system, information player, and information provider
    10.
    发明授权
    Information viewing/listening system, information player, and information provider 有权
    信息查看/收听系统,信息播放器和信息提供者

    公开(公告)号:US07458101B2

    公开(公告)日:2008-11-25

    申请号:US10751957

    申请日:2004-01-07

    摘要: To impose a restriction on actual use, such as playback, of content data and to reduce or prevent unauthorized use of content data by a unit that is not registered with an information provider, in response to an operation request to play content data recorded in a user unit, the user unit transmits current time information, the unit ID, content information, and operation information to a service center and, using these pieces of information, creates a one-time password that is valid only for a predetermined period of time. On the basis of the information transmitted from the user unit and current time information obtained from a time keeping unit of the service center, the service center similarly creates a one-time password and an operation permission command and transmits them to the user unit. The user unit compares the two one-time passwords. When the two one-time passwords match each other, the user-requested operation (playback) is executed.

    摘要翻译: 响应于播放记录在信息提供者中的内容数据的操作请求,对实际使用(例如播放)限制内容数据的限制,并且减少或防止未由信息提供者注册的单元未经授权使用内容数据 用户单元,用户单元将当前时间信息,单元ID,内容信息和操作信息发送到服务中心,并且使用这些信息创建仅在预定时间段内有效的一次性密码。 基于从用户单元发送的信息和从服务中心的时间保持单元获得的当前时间信息,服务中心类似地创建一次密码和操作许可命令并将其发送到用户单元。 用户单元比较两个一次性密码。 当两个一次性密码彼此匹配时,执行用户请求的操作(播放)。