METHOD FOR VOICE DATA TRANSMISSION
    1.
    发明申请
    METHOD FOR VOICE DATA TRANSMISSION 审中-公开
    方法传输语音数据

    公开(公告)号:WO99040568A1

    公开(公告)日:1999-08-12

    申请号:PCT/DE1998/003812

    申请日:1998-12-30

    CPC classification number: G10L15/07 G10L15/16 G10L19/0018

    Abstract: In order to transmit voice data, the voice data flow is broken down into phonemes. A code character is assigned to each phoneme in a selective language and/or speaker-specific phoneme catalog (PN1, PN2) and transmitted to a voice synthesis device (SS) located at the transmission target (SD2), whereby the amount of data to be transmitted is considerably reduced. The voice data flow is broken down into phonemes by a neuronal network (NN), which is trained to recognize the phonemes stored in the selective language and/or speaker-specific phoneme catalog (PN1, PN2). The flow of code characters received is once again converted into a sequence of phonemes and emitted by the voice synthesis device (SS).

    Abstract translation: 用于语音数据流的语音数据的传输被划分成音素和以可选择的语音和/或说话者特定音素目录(PN1,PN2)为每个音素相关联的代码符号在发送目的地(SD2)语音合成器(SS)发送此,从而将被发送 数据IA的量 大大降低。 语音数据流的分解成音素是由经过训练的音素被存储在识别所选择的语言和/或说话者特定音素目录(PN1,PN2)的神经网络(NN)中进行。 从语音合成装置(SS)接收到的代码符号流被转换回成音素和输出的序列。

    SIGNAL DECOMPOSITION METHOD FOR SPEECH CODING
    3.
    发明申请
    SIGNAL DECOMPOSITION METHOD FOR SPEECH CODING 审中-公开
    用于语音编码的信号分解方法

    公开(公告)号:WO99063521A1

    公开(公告)日:1999-12-09

    申请号:PCT/US1999/012427

    申请日:1999-06-03

    CPC classification number: G10L19/012

    Abstract: A signal including speech and background noise is encoded by first decomposing the signal into speech and noise components. A first speech encoding algorithm is then used to generate codebook indices for the speech component and a second algorithm is applied to generate codebook indices for the noise component. The speech encoding algorithm performs better since it faces clean speech, while a simple, very low bit rate algorithm may be used to encode the noise.

    Abstract translation: 通过首先将信号分解为语音和噪声分量来编码包括语音和背景噪声的信号。 然后使用第一语音编码算法来产生用于语音分量的码本索引,并且应用第二算法来生成噪声分量的码本索引。 语音编码算法表现得更好,因为它面对清晰的语音,而简单的,非常低的比特率算法可用于对噪声进行编码。

    PROCESSING SPEECH CODING PARAMETERS IN A TELECOMMUNICATION SYSTEM
    4.
    发明申请
    PROCESSING SPEECH CODING PARAMETERS IN A TELECOMMUNICATION SYSTEM 审中-公开
    在电信系统中处理语音编码参数

    公开(公告)号:WO9627183A3

    公开(公告)日:1996-10-10

    申请号:PCT/FI9600116

    申请日:1996-02-28

    Inventor: VAINIO JANNE

    Abstract: The present invention relates to processing speech coding parameters in a telecommunication system. The speech coding parameters of a speech frame, produced by a speech encoder, are divided into groups, i.e. so-called virtual channels, in which speech parameter error correction, channel coding and processing of error-free or erroneous speech parameters are performed independently. At the receiving end, the processing (505) of erroneous and error-free speech parameters can thus be controlled independently on each virtual transmission channel (502) according to the quality of each virtual transmission channel. The speech parameters of the high-quality virtual channels of a speech frame can thus be processed as error-free, replacing the speech coding parameters of the low-quality virtual channels only. The independently processed (505) speech parameters of the virtual channels are thus reassembled (507) into a speech frame, which is applied to decoding. Since part of the information of also erroneous speech frames is utilized, the use of speech information received from a transmission channel can be increased in speech decoding, which reduces for instance interruptions occurring in speech as compared with a situation where all speech frames erroneous even to a slight degree were discarded. The increased and more focused error indication also reduces the number of undetected errors and thus reduces significantly the worst audible disturbances.

    APPARATUS AND METHODS FOR MULTICHANNEL DIGITAL AUDIO CODING
    5.
    发明申请
    APPARATUS AND METHODS FOR MULTICHANNEL DIGITAL AUDIO CODING 审中-公开
    多通道数字音频编码的装置和方法

    公开(公告)号:WO2006030289A1

    公开(公告)日:2006-03-23

    申请号:PCT/IB2005/002724

    申请日:2005-09-14

    Inventor: YOU, Yuli

    CPC classification number: G10L19/025 G10L19/008 G10L19/032

    Abstract: A low bit rate digital audio coding system includes an encoder which assigns codebooks to groups of quantization indexes based on their local properties resulting in codebook application ranges that are independent of block quantization boundaries. The invention also incorporates a resolution filter bank, or a tri-mode resolution filter bank, which is selectively switchable between high and low frequency resolution modes or high, low and intermediate modes such as when detecting transient in a frame. The result is a multichannel audio signal having a significantly lower bit rate for efficient transmission or storage. The decoder is essentially an inverse of the structure and methods of the encoder, and results in a reproduced audio signal that cannot be audibly distinguished from the original signal.

    Abstract translation: 低比特率数字音频编码系统包括:编码器,其基于其本地属性将码本分配给量化索引组,导致独立于块量化边界的码本应用范围。 本发明还包括分辨率滤波器组或三模式分辨率滤波器组,其可以在高频和低频分辨率模式或高,低和中间模式之间选择性地切换,例如当检测到帧中的瞬态时。 结果是具有用于有效传输或存储的显着较低比特率的多声道音频信号。 解码器基本上是编码器的结构和方法的逆,并且导致不能与原始信号可听区分的再现音频信号。

    METHOD OF ACCESSING A DIAL-UP SERVICE
    6.
    发明申请
    METHOD OF ACCESSING A DIAL-UP SERVICE 审中-公开
    接入服务的方法

    公开(公告)号:WO98054695A1

    公开(公告)日:1998-12-03

    申请号:PCT/US1998/009705

    申请日:1998-05-14

    Abstract: A method of accessing a dial-up service involves the following steps: (a) dialing a service number (172); (b) speaking a number of digits to form a first utterance (174); (c) recognizing the digits using speaker independent speaker recognition (176); (d) when a user has used the dial-up service previously, verifying the user based on the first utterance using a speaker verification system (178); (e) when the user cannot be verified, requesting the user enter a personal identification number (182); and (f) when the personal identification number is valid (184), providing access the dial-up service (186).

    Abstract translation: 接入拨号服务的方法包括以下步骤:(a)拨打服务号码(172); (b)说出一些数字形成第一个发音(174); (c)使用讲话者独立的演讲人识别识别数字(176); (d)当用户以前使用过拨号服务时,使用扬声器验证系统(178)基于第一话语来验证用户; (e)当用户不能被验证时,请求用户输入个人识别号码(182); 和(f)当个人识别号码有效时(184),提供接入拨号服务(186)。

    TEXT-TO-SPEECH E-MAIL READER WITH MULTI-MODAL REPLY PROCESSOR
    8.
    发明申请
    TEXT-TO-SPEECH E-MAIL READER WITH MULTI-MODAL REPLY PROCESSOR 审中-公开
    具有多模式响应处理器的文本到语音电子邮件读取器

    公开(公告)号:WO00008832A1

    公开(公告)日:2000-02-17

    申请号:PCT/US1999/016860

    申请日:1999-07-26

    Abstract: A multi-user e-mail reader system (18) allows several users to access their e-mail accounts simultaneously and have the e-mail messages played back with speech synthesis. The user navigates through various functional states of the system (Fig. 4) using either touch-tone keypad commands or optionally voiced commands interpreted by a speech recognizer (60). Users can send reply e-mail messages without the use of a computer, by invoking the system's text processor (68). The text processor operates in conjunction with a keypad-to-ASCII conversion mechanism (64) that allows fully punctuated and properly addressed e-mail messages to be composed from the touch-tone phone. Digital audio sound file attachments may be recorded through the telephone handset and attached to an outgoing e-mail message. A system for storing canned messages (74) allows the user to quickly send pre-composed reply messages, either as stored or after editing using the text processor. The text processor uses a virtual cursor pointer (72) that may be indexed forward and backward at different granularities (78, 80), depending on whether the system is in play mode or record mode (76). The granularity can also be changed by the user.

    Abstract translation: 多用户电子邮件阅读器系统(18)允许多个用户同时访问他们的电子邮件帐户,并且通过语音合成播放电子邮件消息。 用户使用语音识别器(60)解释的触摸键盘命令或可选的浊音命令来浏览系统的各种功能状态(图4)。 用户可以通过调用系统的文本处理器(68)来发送回复电子邮件而不使用计算机。 文本处理器与键盘到ASCII转换机制(64)一起操作,其允许由按键式电话组成的完全标点和正确地寻址的电子邮件消息。 数字音频声音文件附件可以通过电话听筒记录并附加到外发电子邮件。 用于存储固定消息(74)的系统允许用户使用文本处理器快速发送预先编写的应答消息,如存储或编辑之后。 根据系统是处于播放模式还是记录模式(76),文本处理器使用虚拟光标指针(72),虚拟光标指针(72)可根据不同粒度(78,80)向前和向后进行索引。 粒度也可以由用户改变。

    METHOD AND DEVICE FOR MASKING ERRORS
    9.
    发明申请
    METHOD AND DEVICE FOR MASKING ERRORS 审中-公开
    方法和系统错误隐藏

    公开(公告)号:WO99063520A1

    公开(公告)日:1999-12-09

    申请号:PCT/EP1999/003765

    申请日:1999-05-31

    Abstract: In order to mask errors, the invention provides that binary representations of parameter values are precoded on the transmission side by a linear block code before transmission over a faulty channel. In addition, the redundant information which is added in such a way is not used on the reception side for detecting errors within the binary parameter representations, rather it is exploited in the course of a parameter estimation for improving the quality of the estimated parameter values.

    Abstract translation: 用于传输之前在有故障的信道参数值的错误隐藏二进制表示,发送端通过一个线性块码预编码,并且将这样添加的冗余信息没有在用于二进制参数表示内错误检测接收端使用,但作为用于提高所述估计的参数值的质量的参数估计的一部分利用。

    UNIVERSAL VOICE OPERATED COMMAND AND CONTROL ENGINE
    10.
    发明申请
    UNIVERSAL VOICE OPERATED COMMAND AND CONTROL ENGINE 审中-公开
    通用语音操作命令和控制引擎

    公开(公告)号:WO99005671A1

    公开(公告)日:1999-02-04

    申请号:PCT/US1998/015213

    申请日:1998-07-23

    Abstract: The present invention is a system for controlling graphical user interface by voice commands. The present invention constitutes a means for receiving issued voice commands from a standard voice recognition system (18), a means for monitoring the state of a target application (16), a means for determining active voice commands from the state of the target application (12), a means for determining whether issued voice command is an active voice command, a means for associating each active voice command with a block of script code data (14), a means for issuing the block of script code data associated with the issued voice command to the graphical user interface when the issued voice command is determined to be an active voice command.

    Abstract translation: 本发明是一种通过语音命令来控制图形用户界面的系统。 本发明构成了用于从标准语音识别系统(18)接收发出的语音命令的装置,用于监视目标应用程序(16)的状态的装置,用于从目标应用程序的状态确定主动语音命令的装置( 12),用于确定所发出的语音命令是否是主动语音命令的装置,用于将每个主动语音命令与脚本代码数据块相关联的装置(14),用于发出与发出的语音命令相关联的脚本代码数据块的装置 当发出的语音命令被确定为主动语音命令时,向图形用户界面发出语音命令。

Patent Agency Ranking