Sound special reproducing method and information reproducing apparatus
    1.
    发明授权
    Sound special reproducing method and information reproducing apparatus 失效
    声音专用再现方法和信息再生装置

    公开(公告)号:US06832193B1

    公开(公告)日:2004-12-14

    申请号:US09615313

    申请日:2000-07-13

    申请人: Kazuo Hoshi

    发明人: Kazuo Hoshi

    IPC分类号: G10L2100

    摘要: The present invention relates to a method and apparatus for making it easy to understand the contents of sound during special reproduction. Herein, the MPEG multiple separation circuit separates digital data read out from the optical disk into audio data and video data, the sound recognition text conversion circuit converts audio data decoded in the MPEG audio decoder into text data by sound recognition, and the on-screen character processor generates video signals in which the characters representing text data are displayed, being overlapped with reproduced images. In case of special reproduction such as double-speed reproduction, the characters representing text data are displayed, being overlapped with special reproduced images.

    摘要翻译: 本发明涉及在特殊再现期间容易理解声音内容的方法和装置。 这里,MPEG多重分离电路将从光盘读出的数字数据分离成音频数据和视频数据,声音识别文本转换电路通过声音识别将MPEG音频解码器中解码的音频数据转换为文本数据,并且屏幕上显示 字符处理器产生其中显示表示文本数据的字符与再现的图像重叠的视频信号。 在诸如双速再现的特殊再现的情况下,显示表示文本数据的字符,与特殊的再现图像重叠。

    Methodology for developing interactive systems
    2.
    发明授权
    Methodology for developing interactive systems 有权
    开发互动系统的方法

    公开(公告)号:US06823313B1

    公开(公告)日:2004-11-23

    申请号:US09417166

    申请日:1999-10-12

    IPC分类号: G10L2100

    CPC分类号: G10L15/22

    摘要: A method is provided for developing a computer-based dialogue interface for an automated or computerized system using input device technology. The dialogue interface is disposed between the automated system and an end user, with the dialogue interface receiving input from the end user and providing output to the end user in response to the input. In an illustrative embodiment, the method comprises the following steps. A system designer(s) defines a plurality of requirements applicable to the dialogue interface. The dialogue interface is then designed to meet these requirements. The automated system is simulated with at least a first person, and the end user is simulated with at least a second person. The dialogue interface is evaluated by facilitating an interaction between the first and the second persons through the dialogue interface. Based on the interaction between the first and the second persons, the dialogue interface is evaluated. Based on the evaluation of the dialogue interface, the dialogue interface is refined. After performing the above steps, the automated system is then developed based upon the dialogue interface.

    摘要翻译: 提供了一种用于使用输入设备技术开发用于自动化或计算机化系统的基于计算机的对话界面的方法。 对话界面设置在自动化系统和最终用户之间,对话界面接收终端用户的输入,并响应输入向终端用户提供输出。 在说明性实施例中,该方法包括以下步骤。 系统设计者定义适用于对话界面的多种要求。 然后设计对话界面以满足这些要求。 使用至少第一人员模拟自动化系统,并且至少使用第二人模拟最终用户。 通过对话界面促进第一和第二人之间的交互来评估对话界面。 基于第一人与第二人之间的相互作用,评估对话界面。 基于对话界面的评估,对话界面得到改进。 在执行上述步骤之后,基于对话界面开发自动化系统。

    Mellin-transform information extractor for vibration sources
    3.
    发明授权
    Mellin-transform information extractor for vibration sources 有权
    用于振动源的Mellin变换信息提取器

    公开(公告)号:US06675140B1

    公开(公告)日:2004-01-06

    申请号:US09493661

    申请日:2000-01-28

    IPC分类号: G10L2100

    CPC分类号: G06F17/14

    摘要: The signal processing method includes the steps of: wavelet-transforming an input signal in a computer; and extracting features of the signal by Mellin-transforming the output of the wavelet transform step in synchrony with the input signal in a computer.

    摘要翻译: 信号处理方法包括以下步骤:对计算机中的输入信号进行小波变换; 以及通过与计算机中的输入信号同步地对小波变换步骤的输出进行Mellin变换来提取信号的特征。

    Method and means of voice control of a computer, including its mouse and keyboard
    4.
    发明授权
    Method and means of voice control of a computer, including its mouse and keyboard 失效
    计算机语音控制的方法和手段,包括其鼠标和键盘

    公开(公告)号:US06668244B1

    公开(公告)日:2003-12-23

    申请号:US08683824

    申请日:1996-07-18

    IPC分类号: G10L2100

    摘要: New method and means for controlling the environment of disabled individuals through their voice, which includes the operation of lights or any number of appliances and a personal computer wherein the keyboard and the mouse are separately controlled by voice commands, without interference with normal application, (including dictation programs), operating within the computer. Effectively the voice control provides parallel mouse and keyboard commands with normal mouse and keyboard commands.

    摘要翻译: 用于通过声音控制残疾人的环境的新方法和手段,包括操作灯或任何数量的电器和个人计算机,其中键盘和鼠标由语音命令分开控制,而不会干扰正常应用( 包括听写程序),在计算机内运行。 有效地,语音控制提供具有普通鼠标和键盘命令的并行鼠标和键盘命令。

    Load-adjusted speech recogintion
    5.
    发明授权
    Load-adjusted speech recogintion 有权
    负载调整语音识别

    公开(公告)号:US06629075B1

    公开(公告)日:2003-09-30

    申请号:US09591161

    申请日:2000-06-09

    申请人: Johan Schalkwyk

    发明人: Johan Schalkwyk

    IPC分类号: G10L2100

    CPC分类号: G10L15/285

    摘要: A speech recognition system includes a user interface configured to provide signals indicative of a user's speech. A speech recognizer of the system includes a processor configured to use the signals from the user interface to perform speech recognition operations to attempt to recognize speech indicated by the signals. A control mechanism is coupled to the voice recognizer and is configured to affect processor usage for speech recognition operations in accordance with a loading of the processor.

    摘要翻译: 语音识别系统包括被配置为提供指示用户的语音的信号的用户界面。 该系统的语音识别器包括:处理器,被配置为使用来自用户接口的信号执行语音识别操作,以试图识别由该信号指示的语音。 控制机构耦合到语音识别器,并且被配置为根据处理器的加载来影响用于语音识别操作的处理器使用。

    Merging of speech interfaces from concurrent use of devices and applications
    6.
    发明授权
    Merging of speech interfaces from concurrent use of devices and applications 失效
    从并发使用设备和应用程序合并语音界面

    公开(公告)号:US06615177B1

    公开(公告)日:2003-09-02

    申请号:US09546768

    申请日:2000-04-11

    IPC分类号: G10L2100

    CPC分类号: G10L15/26 H04M2201/40

    摘要: According to the present invention network devices that can be controlled via a speech unit included in the network can send a device-document describing its functionality and its speech interface to said speech unit. The speech unit combines those documents to a general document that forms the basis to translate recognized user-commands into user-network-commands to control the connected network-devices. A device-document comprises at least the vocabulary and the commands associated therewith for the corresponding device. Furtheron, pronunciation, grammar for word sequences, rules for speech understanding and dialog can be contained in such documents as well as the same information for multiple languages or information for dynamic dialogs in speech understanding. It is possible that one device contains several documents and dynamically sends them to the speech unit in case they are needed. Furtheron, the present invention enables a device to change its functionality dynamically based on changing content, since a network device send its specifications regarding its speech capabilities to the speech unit while the speech unit is in use.

    摘要翻译: 根据本发明,可以通过网络中包括的语音单元进行控制的网络设备可以将描述其功能的设备文档及其语音接口发送到所述语音单元。 语音单元将这些文档组合到一般文档中,该文档构成将识别的用户命令转换为用户网络命令以控制所连接的网络设备的基础。 装置文件至少包括与对应装置相关联的词汇表和命令。 更重要的是,语言理解和对话语言的发音,语法,语音理解和对话的规则可以包含在这样的文档中,以及用于多种语言的相同信息或用于语音理解中的动态对话的信息。 一个设备可能包含多个文档,并在需要时动态地将它们发送到语音单元。 因此,本发明使得设备能够基于变化的内容动态地改变其功能,因为网络设备在语音单元正在使用时将其关于其语音能力的规范发送到语音单元。

    System and method for overlapping audio elements in a customized personal radio broadcast
    7.
    发明授权
    System and method for overlapping audio elements in a customized personal radio broadcast 有权
    用于在定制的个人无线电广播中重叠音频元素的系统和方法

    公开(公告)号:US06609096B1

    公开(公告)日:2003-08-19

    申请号:US09657256

    申请日:2000-09-07

    IPC分类号: G10L2100

    摘要: A method for overlapping stored audio elements in a system for providing a customized radio broadcast. The method includes the steps of dividing a first audio element into a plurality of audio element components; selecting one of said audio element components; decompressing the selected audio element component; selecting a second audio element; decompressing the second audio element; mixing the decompressed audio element component with the decompressed second audio element to form a mixed audio element component; and compressing the mixed audio element component to form a compressed overlapping audio element component. The compressed overlapping audio element component may replace the selected audio component. The first audio element may be a song, while the second audio element may be a DJ introduction. Accordingly, the compressed overlapping audio element may be broadcast followed by the remaining components of the song audio element.

    摘要翻译: 用于将存储的音频元素重叠在用于提供定制的无线电广播的系统中的方法。 该方法包括以下步骤:将第一音频元素划分成多个音频元素分量; 选择所述音频元素组件之一; 解压缩所选择的音频元素组件; 选择第二音频元素; 解压缩第二音频元素; 将解压缩的音频元素分量与解压缩的第二音频元素混合以形成混合音频元素分量; 以及压缩混合音频元素分量以形成压缩的重叠音频元素分量。 压缩的重叠音频元素组件可以替代所选择的音频分量。 第一音频元素可以是歌曲,而第二音频元素可以是DJ简介。 因此,压缩的重叠音频元素可以被广播,之后是歌曲音频元素的剩余组件。

    Method, system and product for modifying the dynamic range of encoded audio signals
    9.
    发明授权
    Method, system and product for modifying the dynamic range of encoded audio signals 失效
    用于修改编码音频信号的动态范围的方法,系统和产品

    公开(公告)号:US06516299B1

    公开(公告)日:2003-02-04

    申请号:US08771462

    申请日:1996-12-20

    申请人: Eliot M. Case

    发明人: Eliot M. Case

    IPC分类号: G10L2100

    摘要: A method, system and product for modifying the dynamic range of an encoded audio signal. The method includes receiving the encoded audio signal, the encoded audio signal having a first set of scale factors associated with a first dynamic range, and identifying a playback destination for the encoded audio signal, the playback destination having a second dynamic range. The method also includes mapping the first set of scale factors to a second set of scale factors associated with the second dynamic range, and replacing the first set of scale factors in the encoded audio signal with the second set of scale factors to create a modified encoded audio signal for decoding and reassembly at the playback destination. The system includes control logic for performing the method. The product includes a storage medium having computer readable programmed instructions for performing the method.

    摘要翻译: 一种用于修改编码音频信号的动态范围的方法,系统和产品。 该方法包括接收经编码的音频信号,编码的音频信号具有与第一动态范围相关联的第一组比例因子,以及识别编码的音频信号的回放目的地,所述回放目的地具有第二动态范围。 该方法还包括将第一组比例因子映射到与第二动态范围相关联的第二组缩放因子,并用第二组缩放因子替换编码音频信号中的第一组比例因子,以创建经修改的编码 用于在回放目的地解码和重组的音频信号。 该系统包括用于执行该方法的控制逻辑。 该产品包括具有用于执行该方法的计算机可读编程指令的存储介质。

    Scalable low resource dialog manager
    10.
    发明授权
    Scalable low resource dialog manager 有权
    可扩展的低资源对话管理器

    公开(公告)号:US06513009B1

    公开(公告)日:2003-01-28

    申请号:US09460961

    申请日:1999-12-14

    IPC分类号: G10L2100

    CPC分类号: G10L15/22

    摘要: A spoken language interface between a user and at least one application or system includes a dialog manager operatively coupled to the application or system, an audio input system, an audio output system, a speech decoding engine and a speech synthesizing engine; and at least one user interface data set operatively coupled to the dialog manager, the user interface data set representing spoken language interface elements and data recognizable by the application. The dialog manager enables connection between the input audio system and the speech decoding engine such that a spoken utterance provided by the user is provided from the input audio system to the speech decoding engine. The speech decoding engine decodes the spoken utterance to generate a decoded output which is returned to the dialog manager. The dialog manager uses the decoded output to search the user interface data set for a corresponding spoken language interface element and data which is returned to the dialog manager when found, and provides the spoken language interface element associated data to the application for processing in accordance therewith. The application, on processing that element, provides a reference to an interface element to be spoken. The dialog manager enables connection between the audio output system and the speech synthesizing engine such that the speech synthesizing engine which, accepting data from that element, generates a synthesized output that expresses that element, the audio output system audibly presenting the synthesized output to the user.

    摘要翻译: 用户与至少一个应用或系统之间的口语界面包括可操作地耦合到应用或系统的对话管理器,音频输入系统,音频输出系统,语音解码引擎和语音合成引擎; 以及可操作地耦合到对话管理器的至少一个用户界面数据集,表示语言界面元素的用户界面数据集和应用可识别的数据。 对话管理器使得输入音频系统和语音解码引擎之间能够连接,使得由用户提供的讲话话语从输入音频系统提供给语音解码引擎。 语音解码引擎对口语发音进行解码以产生被返回给对话管理器的解码输出。 对话管理器使用解码的输出来搜索用于相应的语言接口元素的用户界面数据集和在发现对话管理器时返回到对话管理器的数据,并且将口语语言接口元素相关联的数据提供给应用以进行处理 。 在处理该元素时,该应用程序提供了要使用的接口元素的引用。 该对话管理器使音频输出系统与语音合成引擎之间能够连接,从而使从该元件接收数据的语音合成引擎生成表示该元素的合成输出,音频输出系统向用户可听地呈现合成输出 。