AUDIO FILE FORMAT CONVERSION
    71.
    发明申请
    AUDIO FILE FORMAT CONVERSION 审中-公开
    音频文件格式转换

    公开(公告)号:WO2005013491A3

    公开(公告)日:2005-03-24

    申请号:PCT/EP2004007744

    申请日:2004-07-13

    CPC classification number: G10L19/173

    Abstract: According to the invention, the manipulation of audio data can be simplified, such as, for example, with relation to the combination of individual audio channels to give multi-channel audio data streams, or for the general manipulation of an audio data stream, whereby a data block is modified (56) in an audio data stream (10), divided into data blocks (10a, 10b) with determining blocks (14, 16) and data block audio data (18) such as, for example, by inclusion in, addition to, or replacement of a part thereof, itself containing a length indicator which expresses a data amount or length of the block audio data, or a data amount or length of the data block, such as to give a second audio data stream with modified data blocks. Alternatively, an audio data stream (10) with pointers in determining blocks (14, 10), which point to the determining block audio data (44, 46), allocated to the determining blocks, but distributed in various data blocks, is converted into an audio stream, whereby the determining block audio data (44, 46) are combined to give coherent determining block audio data (48). The coherent determining block audio data (48) can be contained with the corresponding determining block (14, 16) in a self-contained channel element (52a).

    Abstract translation: 音频数据的处理可变得容易,例如 一般对于各个音频数据的组合流提供给多通道音频数据流或通过在音频数据流(10)为数据块处理的音频数据流(10A,10B)与判定块(14,16)和数据块的音频数据(18),被构造,一个 修改后的数据块(56),如 它们通过添加或添加或通过替换部分,以使相同长度指示符包含指示数据块的数据量或长度是音频数据或数据量或长度desDatenblocks以获得第二音频数据流与修改的数据块。 或者,它是具有指针的音频数据流(10)在形成于指派但分布在不同的数据块确定块的音频数据(44,46)被转移到的音频数据流的判定块判定块(14,10),其中,所述确定块的音频数据(44,46 )zuzusammenhängenden确定块的音频数据(48)进行了总结。 连续测定块的音频数据(48)然后可以用他们的确定块(14,16)一起可以包括在一个本身完成了信道构件(52A)。

    AUDIODATEIFORMATUMWANDLUNG
    72.
    发明申请
    AUDIODATEIFORMATUMWANDLUNG 审中-公开
    音频文件格式转换

    公开(公告)号:WO2005013491A2

    公开(公告)日:2005-02-10

    申请号:PCT/EP2004/007744

    申请日:2004-07-13

    IPC: H03M

    CPC classification number: G10L19/173

    Abstract: Die Handhabung mit Audiodaten kann erleichtert werden, wie z.B. im Hinblick auf die Zusammenfassung einzelner Audiodatenströme zu Mehrkanal-Audiodatenströmen oder die Handhabung eines Audiodatenstroms allgemein, indem in einem Audiodatenstrom (10), der in Datenblöcke (10a, 10b) mit Bestimmungsblock (14, 16) und Datenblockaudiodaten (18) gegliedert ist, ein Datenblock modifiziert (56) wird, wie z.B. durch Ergänzung bzw. Hinzufügung oder durch Ersetzung eines Teils desselben, damit derselbe eine Längenangabeenthält, die eine Datenmenge bzw. Länge der Datenblockaudiodaten oder eine Datenmenge bzw. Länge desDatenblocks angibt, um einen zweiten Audiodatenstrom mit modifizierten Datenblöcken zu erhalten. Oder es wird ein Audiodatenstrom (10) mit Zeigern in Bestimmungsblöcken (14, 10), die auf die den Bestimmungsblöcken zugeordneten aber in verschiedene Datenblöcke verteilten Bestimmungsblockaudiodaten (44, 46) zeigen, in einen Audiodatenstrom überführt, bei dem die Bestimmungsblockaudiodaten (44, 46) zuzusammenhängenden Bestimmungsblockaudiodaten (48) zusammengefasst sind. Die zusammenhängenden Bestimmungsblockaudiodaten (48) können dann zusammen mit ihrem Bestimmungsblock (14, 16) in einem in sich abgeschlossenem Kanalelement (52a) enthalten sein.

    Abstract translation: 对音频数据的处理可以很方便,例如, 关于个体Audiodatenstr&OUML的组合;我到多声道AudiodatenstrÖ男人或通过溶解,在音频数据流(10)成数据&OUML通常处理的音频数据流;块(10A,10B)与判定块(14,16)和数据块的音频数据( 18),数据块被修改(56),例如 通过补充或替换其一部分以包括指示数据块音频数据的数据量或数据块的数据量的长度指示到第二音频数据流 修改数据块。 或者,它是与在Bestimmungsbl&OUML PUSH指针的音频数据流(10)(14,10)形成在所述Bestimmungsbl&OUML但CKEN各种Datenbl&OUML相关联; CKE分布确定块的音频数据(44,46)显示,在一个音频数据流导航用途Berf导航用途 其中目的地块音频数据(44,46)被组合成连续的目的地块音频数据(48)。 然后,连续的目的地块音频数据(48)可以与其目的地块(14,16)一起被包括在独立的信道单元(52a)中,

    METHODS AND DEVICES FOR SOURCE CONTROLLED VARIABLE BIT-RATE WIDEBAND SPEECH CODING
    73.
    发明申请
    METHODS AND DEVICES FOR SOURCE CONTROLLED VARIABLE BIT-RATE WIDEBAND SPEECH CODING 审中-公开
    用于源控制的可变比特率宽带语音编码的方法和设备

    公开(公告)号:WO2004034379A3

    公开(公告)日:2004-12-23

    申请号:PCT/CA0301571

    申请日:2003-10-09

    Inventor: JELINEK MILAN

    CPC classification number: G10L19/24 G10L19/012 G10L19/173

    Abstract: Speech signal classification and encoding systems and methods are disclosed herein. The signal classification is done in three steps each of them discriminating a specific signal class. First, a voice activity detector (VAD) discriminates between active and inactive speech frames. If an inactive speech frame is detected (background noise signal) then the classification chain ends and the frame is encoded with comfort noise generation (CNG). If an active speech frame is detected, the frame is subjected to a second classifier dedicated to discriminate unvoiced frames. If the classifier classifies the frame as unvoiced speech signal, the classification chain ends, and the frame is encoded using a coding method optimized for unvoiced signals. Otherwise, the speech frame is passed through to the "stable voiced" classification module. If the frame is classified as stable voiced frame, then the frame is encoded using a coding method optimized for stable voiced signals. Otherwise, the frame is likely to contain a non-stationary speech segment such as a voiced onset or rapidly evolving voiced speech signal. In this case a general-purpose speech coder is used at a high bit rate for sustaining good subjective quality .

    Abstract translation: 在此公开了语音信号分类和编码系统和方法。 信号分类分三个步骤完成,每个步骤区分特定的信号类别。 首先,语音活动检测器(VAD)区分活动和非活动语音帧。 如果检测到不活动的语音帧(背景噪声信号),则分类链结束,并且该帧被编码以舒适噪声产生(CNG)。 如果检测到活动语音帧,则该帧经受专用于区分无声帧的第二分类器。 如果分类器将该帧分类为清音语音信号,则分类链结束,并且使用针对清音信号优化的编码方法对帧进行编码。 否则,语音帧被传递到“稳定浊音”分类模块。 如果帧被分类为稳定浊音帧,则使用针对稳定浊音信号优化的编码方法对帧进行编码。 否则,帧可能包含非平稳的语音片段,例如浊音起始或快速演变的浊音语音信号。 在这种情况下,通用语音编码器以高比特率使用,以维持良好的主观质量。

    600 BPS MIXED EXCITATION LINEAR PREDICTION TRANSCODING
    74.
    发明申请
    600 BPS MIXED EXCITATION LINEAR PREDICTION TRANSCODING 审中-公开
    600 BPS混合激励线性预测平移

    公开(公告)号:WO2004070541A2

    公开(公告)日:2004-08-19

    申请号:PCT/US2004/002421

    申请日:2004-01-29

    IPC: G06F

    CPC classification number: G10L19/087 G10L19/173

    Abstract: Vector quantization techniques reduce the effective bit rate to 600 bps while maintaining intelligible speech. Four frames of speech are combined into one frame. The system uses mixed excitation linear prediction speech model parameters to quantized the frame and achieve a fixed rate of 600 bps. The system allows voice communication over bandwidth constrained channels.

    Abstract translation: 矢量量化技术将有效比特率降低到600bps,同时保持可理解的语音。 四帧语音组合成一帧。 该系统采用混合激励线性预测语音模型参数对帧进行量化,达到固定速率600bps。 该系统允许通过带宽受限通道进行语音通信。

    METHOD AND APPARATUS FOR FAST CELP IF PARAMETER MAPPING
    75.
    发明申请
    METHOD AND APPARATUS FOR FAST CELP IF PARAMETER MAPPING 审中-公开
    如果参数映射快速CELP的方法和设备

    公开(公告)号:WO2004038924A1

    公开(公告)日:2004-05-06

    申请号:PCT/AU2003/001412

    申请日:2003-10-24

    CPC classification number: G10L19/173

    Abstract: An apparatus and method for mapping CELP parameters between a source codec and a destination codec. The apparatus includes an LSP mapping module, an adaptive codebook mapping module coupled to the LSP mapping module, and a fixed codebook mapping module coupled to the LSP mapping module and the adaptive codebook mapping module. The LSP mapping module includes an LP overflow module and an LSP parameter modification module. The adaptive codebook mapping module includes a first pitch gain codebook. The fixed codebook mapping module includes a first target processing module, a pulse search module, a fixed codebook gain estimation module, a pulse position searching module.

    Abstract translation: 用于在源编解码器和目标编解码器之间映射CELP参数的装置和方法。 该装置包括LSP映射模块,耦合到LSP映射模块的自适应码本映射模块以及耦合到LSP映射模块和自适应码本映射模块的固定码本映射模块。 LSP映射模块包括一个LP溢出模块和一个LSP参数修改模块。 自适应码本映射模块包括第一音调增益码本。 固定码本映射模块包括第一目标处理模块,脉冲搜索模块,固定码本增益估计模块,脉冲位置搜索模块。

    METHODS AND DEVICES FOR SOURCE CONTROLLED VARIABLE BIT-RATE WIDEBAND SPEECH CODING
    76.
    发明申请
    METHODS AND DEVICES FOR SOURCE CONTROLLED VARIABLE BIT-RATE WIDEBAND SPEECH CODING 审中-公开
    源控制可变比特率宽带语音编码的方法和设备

    公开(公告)号:WO2004034379A2

    公开(公告)日:2004-04-22

    申请号:PCT/CA2003/001571

    申请日:2003-10-09

    Inventor: JELINEK, Milan

    CPC classification number: G10L19/24 G10L19/012 G10L19/173

    Abstract: Speech signal classification and encoding systems and methods are disclosed herein. The signal classification is done in three steps each of them discriminating a specific signal class. First, a voice activity detector (VAD) discriminates between active and inactive speech frames. If an inactive speech frame is detected (background noise signal) then the classification chain ends and the frame is encoded with comfort noise generation (CNG). If an active speech frame is detected, the frame is subjected to a second classifier dedicated to discriminate unvoiced frames. If the classifier classifies the frame as unvoiced speech signal, the classification chain ends, and the frame is encoded using a coding method optimized for unvoiced signals. Otherwise, the speech frame is passed through to the "stable voiced" classification module. If the frame is classified as stable voiced frame, then the frame is encoded using a coding method optimized for stable voiced signals. Otherwise, the frame is likely to contain a non-stationary speech segment such as a voiced onset or rapidly evolving voiced speech signal. In this case a general-purpose speech coder is used at a high bit rate for sustaining good subjective quality .

    Abstract translation: 本文公开了语音信号分类和编码系统和方法。 信号分类通过三个步骤完成,每个步骤区分特定的信号类别。 首先,语音活动检测器(VAD)在有效和无效的语音帧之间进行区分。 如果检测到无效语音帧(背景噪声信号),则分类链结束,并且以舒适噪声产生(CNG)编码该帧。 如果检测到活动语音帧,则该帧经受专用于区分清音帧的第二分类器。 如果分类器将帧分类为无声语音信号,则分类链结束,并且使用针对无声信号优化的编码方法对帧进行编码。 否则,将语音帧传递到“稳定浊音”分类模块。 如果帧被分类为稳定的有声帧,则使用针对稳定浊音信号优化的编码方法对帧进行编码。 否则,该帧可能包含诸如有声开始或快速演进的有声语音信号之类的非平稳语音段。 在这种情况下,通用语音编码器以高比特率被使用以维持良好的主观质量。

    TRANSCODING OF SPEECH IN A PACKET NETWORK ENVIRONMENT
    77.
    发明申请
    TRANSCODING OF SPEECH IN A PACKET NETWORK ENVIRONMENT 审中-公开
    一个分组网络环境中的语音翻译

    公开(公告)号:WO2003098598A1

    公开(公告)日:2003-11-27

    申请号:PCT/US2003/006335

    申请日:2003-02-26

    CPC classification number: H04W88/181 G10L19/173

    Abstract: There is provided transcoding of speech in a packet network environment. A decoder configured to receive a first bit-stream encoded according to a first coding scheme. The decoder decodes the bit-stream according to the first coding scheme, generates a plurality of first speech samples, and extracts a plurality of first speech parameters, which may include spectral characteristics, energy, pitch and/or pitch gain. A converter then converts the plurality first speech samples and plurality of first speech parameters to a plurality of second speech samples and a plurality of second speech parameters for use according to a second coding scheme. The first and second coding schemes may be, for example, G.711, G.723.1, G.726 or G.729, and may be parametric or non-parametric. An encoder receives the plurality of second speech samples and plurality of second speech parameters and generates a second bit-stream according to the second coding scheme.

    Abstract translation: 在分组网络环境中提供语音转码。 解码器,被配置为接收根据第一编码方案编码的第一比特流。 解码器根据第一编码方案解码比特流,产生多个第一语音样本,并且提取多个第一语音参数,其可以包括频谱特性,能量,音调和/或音调增益。 然后,A转换器将多个第一语音样本和多个第一语音参数转换为多个第二语音样本和多个第二语音参数,以便根据第二编码方案使用。 第一和第二编码方案可以是例如G.711,G.723.1,G.726或G.729,并且可以是参数或非参数的。 编码器接收多个第二语音样本和多个第二语音参数,并根据第二编码方案生成第二比特流。

    SYSTEM AND METHOD FOR REDUCING DATA QUALITY DEGRADATION DUE TO ENCODING/DECODING
    78.
    发明申请
    SYSTEM AND METHOD FOR REDUCING DATA QUALITY DEGRADATION DUE TO ENCODING/DECODING 审中-公开
    用于减少编码/解码的数据质量降级的系统和方法

    公开(公告)号:WO2002099986A1

    公开(公告)日:2002-12-12

    申请号:PCT/US2002/017478

    申请日:2002-06-04

    CPC classification number: G10L19/005 G10L19/173

    Abstract: Transliteration architectures reduce the number of encoding/decoding steps required to transmit telephony data. The reduction of encoding/decoding steps improves the quality of the transmitted data due to the avoidance of the significant adverse effects on the data from encoding and decoding. The reduction is accomplished using a transliteration device or through bypassing the transliteration device. A universal vocoder is proposed that allows the vocoding element to encode or decode data according to any desired vocoder format. Network routing considerations allow optimal decisions on which vocoder formats to use. Network routing decisions can be bases on vocoder formats used.

    Abstract translation: 音译架构减少传输电话数据所需的编码/解码步骤数量。 由于避免了对编码和解码的数据的显着的不利影响,编码/解码步骤的减少提高了传输数据的质量。 使用音译设备或通过绕过音译设备实现缩小。 提出了一种通用声码器,其允许声码元件根据任何期望的声码器格式对数据进行编码或解码。 网络路由考虑允许使用哪些声码器格式的最佳决策。 网络路由决定可以基于使用的声码器格式。

    METHOD AND APPARATUS FOR INTEROPERABILITY BETWEEN VOICE TRANSMISSION SYSTEMS DURING SPEECH INACTIVITY
    79.
    发明申请
    METHOD AND APPARATUS FOR INTEROPERABILITY BETWEEN VOICE TRANSMISSION SYSTEMS DURING SPEECH INACTIVITY 审中-公开
    在语音不活动期间语音传输系统之间的互操作性的方法和装置

    公开(公告)号:WO2002065458A2

    公开(公告)日:2002-08-22

    申请号:PCT/US2002/003013

    申请日:2002-01-30

    CPC classification number: G10L19/173

    Abstract: The disclosed embodiments provide a method and apparatus for interoperability between CTX and DTX communications systems during transmissions of silence or background noise. Continuous eight rate encoded noise frames are translated to discontinuous SID frames for transmission to DTX systems (402-410). Discontinuous SID frames are translated to continuous eight rate encoded noise frames for decoding by a CTX system (602-606). Applications of CTX to DTX interoperability comprise CDMA and GSM interoperability (narrowband voice transmission systems), CDMA next generation vocoder (The Selectable Mode Vocoder) interoperability with the new ITU-T 4 kbps vocoder operating in DTX-mode for Voice Over IP applications, future voice transmission systems that have a common speech encoder/decoder but operate in differing CTX or DTX modes during speech non-activity, and CDMA wideband voice transmission system interoperability with other wideband voice transmission systems with common wideband vocoders but with different modes of operation (DTX or CTX) during voice non-activity).

    Abstract translation: 所公开的实施例提供了在静音或背景噪声的传输期间CTX和DTX通信系统之间的互操作性的方法和装置。 连续的八个速率编码的噪声帧被转换为不连续的SID帧以传输到DTX系统(402-410)。 不连续的SID帧被转换为连续的八个速率编码的噪声帧,用于由CTX系统(602-606)进行解码。 CTX到DTX互操作性的应用包括CDMA和GSM互操作性(窄带语音传输系统),CDMA下一代声码器(可选模式声码器)与用于IP语音应用的DTX模式下运行的新ITU-T 4 kbps声码器的互操作性,未来 语音传输系统具有通用语音编码器/解码器,但在语音非活动期间以不同的CTX或DTX模式工作,以及CDMA宽带语音传输系统与具有普通宽带声码器但具有不同操作模式(DTX)的其他宽带语音传输系统的互操作性 或CTX)在语音非活动期间)。

    CELP TRANSCODING
    80.
    发明申请
    CELP TRANSCODING 审中-公开

    公开(公告)号:WO0048170A9

    公开(公告)日:2001-09-07

    申请号:PCT/US0003855

    申请日:2000-02-14

    Applicant: QUALCOMM INC

    Inventor: DEJACO ANDREW P

    CPC classification number: G10L19/12 G10L19/173

    Abstract: A method and apparatus for CELP-based to CELP-based vocoder packet translation. The apparatus includes a formant parameter translator and an excitation parameter translator. The formant parameter translator includes a model order converter and a time base converter. The method includes the steps of translating the formant filter coefficients of the input packet from the input CELP format to the output CELP format and translating the pitch and codebook parameters of the input speech packet from the input CELP format to the output CELP format. The step of translating the formant filter coefficients includes the steps of converting the model order of the formant filter coefficients from the model order of the input CELP format to the model order of the output CELP format and converting the time base of the resulting coefficients from the input CELP format time base to the output CELP format time base.

    Abstract translation: 一种用于基于CELP的基于CELP的声码器分组转换的方法和装置。 该装置包括共振峰参数转换器和激励参数转换器。 共振峰参数转换器包括模型顺序转换器和时基转换器。 该方法包括以下步骤:将输入分组的共振峰滤波器系数从输入CELP格式转换为输出CELP格式,并将输入语音分组的音调和码本参数从输入CELP格式转换为输出CELP格式。 翻译共振峰滤波器系数的步骤包括将共振峰滤波器系数的模型顺序从输入CELP格式的模型顺序转换为输出CELP格式的模型阶数的步骤,并将所得系数的时基从 输入CELP格式时基输出CELP格式时基。

Patent Agency Ranking