Patent search cpc:"G10L19/173" Page 1

1.

发明申请
サーバ装置、およびサーバ装置の情報処理方法、並びにプログラム审中-公开
Title translation: 服务器设备和服务器信息处理方法和程序

公开(公告)号：WO2016009863A1

公开(公告)日：2016-01-21

申请号：PCT/JP2015/069380

申请日：2015-07-06

Applicant: ソニー株式会社

Inventor： 徳永　竜二 , 福地　弘行

IPC: H04N21/2343 , H04N21/233

CPC classification number: H04L65/601 , G06F3/165 , G10L19/167 , G10L19/173 , H04L65/4076 , H04L65/4084 , H04L65/602 , H04L65/80 , H04L67/10 , H04N21/233 , H04N21/2335 , H04N21/4852

Abstract: 　本技術は、再生装置に特別な構成を付加すること無く配信されるコンテンツを、再生装置の音声出力機能に最適化して再生できるようにするサーバ装置、およびサーバ装置の情報処理方法、並びにプログラムに関する。測定用音声を再生装置であるテレビジョン受像機で出力させる。携帯端末は、再生装置より出力された音声を集音して、サーバ装置に送信する。サーバ装置は、音声に基づいて、再生装置毎の音声出力による性能と機能に応じた調整情報を記憶し、コンテンツを再生する際に、調整情報に基づいて、コンテンツデータを調整して再生装置に配信する。本技術は、コンテンツ配信システムに適用することができる。

Abstract translation: 本发明的技术涉及：能够针对再现装置的音频输出功能对内容进行广播而不向再现装置添加特殊配置的服务器装置，并由其进行再现; 服务器设备信息处理方法; 和一个程序。使作为再现装置的电视接收机输出测量用音频。便携式终端从再现设备收集音频输出，并将其发送到服务器设备。基于音频，服务器装置存储与再现装置的音频输出的再现装置的性能和功能对应的调整信息，并且在再现内容时，服务器装置基于该调整信息来调整内容数据的调整信息并将数据广播到再现设备。本发明的技术可以应用于内容广播系统。

2.

发明申请
TRANSITION FROM A TRANSFORM CODING/DECODING TO A PREDICTIVE CODING/DECODING 审中-公开
Title translation: 从变换编码/解码到预测编码/解码的过渡

公开(公告)号：WO2015071613A2

公开(公告)日：2015-05-21

申请号：PCT/FR2014052923

申请日：2014-11-14

Applicant: ORANGE

Inventor： FAURE JULIEN , RAGOT STÉPHANE

IPC: G10L19/20

CPC classification number: G10L19/0212 , G10L19/022 , G10L19/04 , G10L19/173 , G10L19/20 , G10L19/26

Abstract: The invention pertains to a method of decoding a digital audio signal, comprising the steps of decoding (E602) according to an inverse transform decoding of a previous frame of samples of the digital signal, which frame is received and coded according to a transform coding, decoding (E608) according to a predictive decoding of a current frame of samples of the digital signal, which frame is received and coded according to a predictive coding. The predictive decoding of the current frame is a transition predictive decoding which does not use any adaptive dictionary arising from the previous frame and the method furthermore comprises a step of reinitialization (E606) of at least one state of the predictive decoding to a predetermined default value, an add-overlap step (E609) which combines a signal segment synthesized by predictive decoding of the current frame and a signal segment synthesized by inverse transform decoding, corresponding to a stored segment of the decoding of the previous frame. The invention pertains correlatively to a method of coding comprising a reinitialization of at least one state of the predictive coding to a predetermined default value. It pertains to a coder and decoder implementing the respective methods.

Abstract translation: 本发明涉及一种对数字音频信号进行解码的方法，包括以下步骤：根据数字信号的先前帧的逆变换解码来解码（E602），该帧根据变换编码被接收和编码，根据对数字信号的当前采样帧的预测解码的解码（E608），该帧根据预测编码被接收和编码。当前帧的预测解码是不使用从前一帧产生的任何自适应字典的转换预测解码，并且该方法还包括将预测解码的至少一种状态重新初始化（E606）到预定默认值的步骤加法重叠步骤（E609），其组合通过对当前帧的预测解码合成的信号段和通过逆变换解码合成的信号段，对应于存储的前一帧的解码段。本发明与包括将预测编码的至少一个状态重新初始化为预定默认值的编码方法相关。它涉及实现相应方法的编码器和解码器。

3.

发明申请
REDUCED COMPLEXITY CONVERTER SNR CALCULATION 审中-公开
Title translation: 降低复杂度转换器SNR计算

公开(公告)号：WO2014072260A3

公开(公告)日：2014-07-10

申请号：PCT/EP2013072961

申请日：2013-11-04

Applicant: DOLBY INT AB , DOLBY LAB LICENSING CORP

Inventor： SCHUG MICHAEL , WILLIAMS PHILLIP

IPC: G10L19/032 , G10L19/16

CPC classification number: G10L19/008 , G10L19/02 , G10L19/032 , G10L19/173

Abstract: The present document relates to audio encoding / decoding. In particular, the present document relates to a method and system for reducing the complexity of a bit allocation process used in the context of audio encoding / decoding. An audio encoder (300) configured to encode an audio signal according to a first audio codec system is described. The audio encoder (300) comprises a transform unit (302) configured to determine a set of spectral coefficients (312) based on the audio signal. Furthermore, the encoder (300) comprises a floating-point encoding unit (304) configured to determine a set of scale factors and a set of scaled values (314), based on the set of spectral coefficients (312); and to encode the set of scale factors to yield a set of encoded scale factors (313). In addition, the encoder (300) comprises a bit allocation and quantization unit (305, 306) configured to determine a total number of available bits for quantizing the set of scaled values (314), based on a first target data-rate and based on the number of bits used for the set of encoded scale factors (313); to determine a first control parameter (315) indicative of an allocation of the total number of available bits for quantizing the scaled values of the set of scaled values (314); and to quantize the set of scaled values (314) in accordance to the first control parameter (315) to yield a set of quantized scaled values (317). Furthermore, the encoder (300) comprises a transcoding simulation unit (320) configured to determine a second control parameter (321) based on the first control parameter (315); wherein the second control parameter (321) enables a transcoder to convert the first bitstream into a second bitstream at a second target data-rate; wherein the second bitstream accords to a second audio codec system different from the first audio codec system; and wherein the first bitstream comprises the second control parameter.

Abstract translation: 本文件涉及音频编码/解码。特别地，本文件涉及用于降低在音频编码/解码的上下文中使用的比特分配过程的复杂度的方法和系统。描述了被配置为根据第一音频编解码器系统对音频信号进行编码的音频编码器（300）。音频编码器（300）包括被配置为基于音频信号确定一组频谱系数（312）的变换单元（302）。此外，编码器（300）包括浮点编码单元（304），其被配置为基于频谱系数集合（312）确定一组比例因子和一组缩放值（314）; 并编码一组比例因子以产生一组经编码的比例因子（313）。另外，编码器（300）包括位分配和量化单元（305,306），其被配置为基于第一目标数据速率和基于所述量化单元确定用于量化所述一组缩放值（314）的可用比特的总数关于用于编码比例因子集合的位数（313）; 以确定指示用于量化所述一组缩放值（314）的所述缩放值的可用比特的总数的分配的第一控制参数（315）。并且根据第一控制参数（315）量化缩放值集合（314）以产生一组量化缩放值（317）。此外，编码器（300）包括被配置为基于第一控制参数（315）确定第二控制参数（321）的代码转换模拟单元（320）。其中所述第二控制参数（321）使得代码转换器能够以第二目标数据速率将所述第一比特流转换为第二比特流; 其中所述第二位流符合与所述第一音频编解码器系统不同的第二音频编解码器系统; 并且其中所述第一比特流包括所述第二控制参数。

4.

发明申请
REDUCED COMPLEXITY CONVERTER SNR CALCULATION 审中-公开
Title translation: 降低复杂度转换器SNR计算

公开(公告)号：WO2014072260A2

公开(公告)日：2014-05-15

申请号：PCT/EP2013/072961

申请日：2013-11-04

Applicant: DOLBY INTERNATIONAL AB , DOLBY LABORATORIES LICENSING CORPORATION

Inventor： SCHUG, Michael , WILLIAMS, Phillip

IPC: G10L19/03

CPC classification number: G10L19/008 , G10L19/02 , G10L19/032 , G10L19/173

Abstract: The present document relates to audio encoding / decoding. In particular, the present document relates to a method and system for reducing the complexity of a bit allocation process used in the context of audio encoding / decoding. An audio encoder (300) configured to encode an audio signal according to a first audio codec system is described. The audio encoder (300) comprises a transform unit (302) configured to determine a set of spectral coefficients (312) based on the audio signal. Furthermore, the encoder (300) comprises a floating-point encoding unit (304) configured to determine a set of scale factors and a set of scaled values (314), based on the set of spectral coefficients (312); and to encode the set of scale factors to yield a set of encoded scale factors (313). In addition, the encoder (300) comprises a bit allocation and quantization unit (305, 306) configured to determine a total number of available bits for quantizing the set of scaled values (314), based on a first target data-rate and based on the number of bits used for the set of encoded scale factors (313); to determine a first control parameter (315) indicative of an allocation of the total number of available bits for quantizing the scaled values of the set of scaled values (314); and to quantize the set of scaled values (314) in accordance to the first control parameter (315) to yield a set of quantized scaled values (317). Furthermore, the encoder (300) comprises a transcoding simulation unit (320) configured to determine a second control parameter (321) based on the first control parameter (315); wherein the second control parameter (321) enables a transcoder to convert the first bitstream into a second bitstream at a second target data-rate; wherein the second bitstream accords to a second audio codec system different from the first audio codec system; and wherein the first bitstream comprises the second control parameter.

Abstract translation: 本文件涉及音频编码/解码。特别地，本文件涉及用于降低在音频编码/解码的上下文中使用的比特分配处理的复杂度的方法和系统。描述了被配置为根据第一音频编解码器系统对音频信号进行编码的音频编码器（300）。音频编码器（300）包括被配置为基于音频信号确定一组频谱系数（312）的变换单元（302）。此外，编码器（300）包括浮点编码单元（304），该浮点编码单元（304）被配置为基于该组频谱系数（312）确定一组缩放因子和一组缩放值（314）。并对该组比例因子进行编码以产生一组编码比例因子（313）。另外，编码器（300）包括比特分配和量化单元（305,306），该比特分配和量化单元被配置为基于第一目标数据速率来确定用于量化该组缩放值（314）的可用比特的总数用于该组编码比例因子的比特数（313）; 确定指示可用比特总数的分配的第一控制参数（315），用于量化该组缩放值（314）的缩放值; 并且根据第一控制参数（315）量化该组缩放值（314）以产生一组量化缩放值（317）。此外，编码器（300）包括被配置为基于第一控制参数（315）确定第二控制参数（321）的转码模拟单元（320）; 其中所述第二控制参数（321）使代码转换器能够以第二目标数据速率将所述第一比特流转换为第二比特流; 其中所述第二比特流符合不同于所述第一音频编解码器系统的第二音频编解码器系统; 并且其中第一比特流包括第二控制参数。

5.

发明申请
PROCESSING AUDIO OBJECTS IN PRINCIPAL AND SUPPLEMENTARY ENCODED AUDIO SIGNALS 审中-公开
Title translation: 在主要和补充编码的音频信号中处理音频对象

公开(公告)号：WO2014035864A1

公开(公告)日：2014-03-06

申请号：PCT/US2013/056576

申请日：2013-08-26

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： HOOKS, Spencer , SANCHEZ, Freddie

IPC: G10L19/008 , G10L19/16 , H04H20/89 , H04M3/56 , G10L19/20 , H04S3/00

CPC classification number: G10L19/008 , G10L19/167 , G10L19/173 , H04H20/89 , H04H60/04 , H04S1/005 , H04S1/007 , H04S3/004 , H04S3/008 , H04S2420/03

Abstract: Methods and apparatuses are disclosed that can combine audio content from two encoded input signals into a new encoded output signal without requiring a decode or re-encode of audio content in either encoded input signal. Encoded data representing audio content and spatial location of audio objects in two different input encoded signals are combined to generate an encoded output signal that has encoded data representing audio objects from both of the input encoded signals.

Abstract translation: 公开了可以将来自两个编码的输入信号的音频内容组合成新的编码输出信号的方法和装置，而不需要在编码的输入信号中对音频内容进行解码或重新编码。表示音频内容的编码数据和在两个不同的输入编码信号中的音频对象的空间位置被组合以产生编码的输出信号，其编码表示来自两个输入编码信号的音频对象的数据。

6.

发明申请
ENCODING AND REPRODUCTION OF THREE DIMENSIONAL AUDIO SOUNDTRACKS 审中-公开
Title translation: 编码和复制三维音频声音

公开(公告)号：WO2012125855A1

公开(公告)日：2012-09-20

申请号：PCT/US2012/029277

申请日：2012-03-15

Applicant: DTS, INC. , JOT, Jean-Marc , FEJZO, Zoran , JOHNSTON, James, D.

Inventor： JOT, Jean-Marc , FEJZO, Zoran , JOHNSTON, James, D.

IPC: G01L19/00

CPC classification number: G10L19/008 , G10L19/173 , G10L19/20 , H04S3/004 , H04S3/008 , H04S2400/01 , H04S2400/03 , H04S2420/01

Abstract: The present invention provides a novel end-to-end solution for creating, encoding, transmitting, decoding and reproducing spatial audio soundtracks. The provided soundtrack encoding format is compatible with legacy surround- sound encoding formats, so that soundtracks encoded in the new format may be decoded and reproduced on legacy playback equipment with no loss of quality compared to legacy formats.

Abstract translation: 本发明提供了用于创建，编码，传输，解码和再现空间音频声轨的新型端到端解决方案。所提供的音轨编码格式与传统的环绕声编码格式兼容，使得以旧格式编码的音轨可以在传统播放设备上被解码和再现，与传统格式相比，不损失质量。

7.

发明申请
PSYCHOACOUSTIC FILTER DESIGN FOR RATIONAL RESAMPLERS 审中-公开
Title translation: PSYCHOACOUSTIC过滤器设计用于RATIONAL RESAMPLERS

公开(公告)号：WO2012076689A1

公开(公告)日：2012-06-14

申请号：PCT/EP2011/072311

申请日：2011-12-09

Applicant: DOLBY INTERNATIONAL AB , SCHILDBACH, Wolfgang

Inventor： SCHILDBACH, Wolfgang

IPC: G10L19/14 , H03H17/06

CPC classification number: G10L21/0232 , G06T5/00 , G10L19/032 , G10L19/173 , G10L19/26 , H03H17/0685

Abstract: The present document relates to the design of anti-aliasing and/or anti-imaging filters for resamplers using rational resampling factors. In particular, the present document relates to a method for designing such filters having a reduced number of filter coefficients or an increased perceptual performance, as well as to the filters designed using such method. A method for designing a filter (102) configured to reduce imaging and/or aliasing of an output audio signal (113) at an output sampling rate (fs out ) is described. The output audio signal (113) is a resampled version of an input audio signal (110) at an input sampling rate (fs in ). The ratio of the output sampling rate (fs out ) and the input sampling rate (fs in ) is a rational number N/M. The filter (102) operates at an upsampled sampling rate which equals N times the input sampling rate (fs in ). The method comprises the steps of selecting an allowed deviation of the frequency response (531, 532) of the filter (102) within a stop band of the filter (102) based on a perceptual frequency response indicative of an auditory spectral sensitivity; wherein the allowed deviation indicates a deviation of the frequency response (531, 532) of the filter (102) from a predetermined attenuation within the stop band; and of determining coefficients of the filter (102) such that the frequency response (531, 532) of the filter (102) is fitted to the allowed deviation of the frequency response (531, 532).

Abstract translation: 本文件涉及使用合理的重采样因子的重采样器的抗混叠和/或抗成像滤波器的设计。特别地，本文件涉及一种用于设计具有减少的滤波器系数数量或增加的感知性能的滤波器的方法，以及使用这种方法设计的滤波器。描述了一种用于设计滤波器（102）的方法，其被配置为以输出采样率（fsout）来减少输出音频信号（113）的成像和/或混叠。输出音频信号（113）是输入采样率（fsin）的输入音频信号（110）的再采样版本。输出采样率（fsout）和输入采样率（fsin）的比值是有理数N / M。滤波器（102）以等于输入采样率（fsin）的N倍的上采样率进行操作。该方法包括以下步骤：基于指示听觉光谱灵敏度的感知频率响应，在滤波器（102）的阻带内选择滤波器（102）的频率响应（531,532）的允许偏差; 其中所述允许偏差指示所述滤波器（102）的频率响应（531,532）与所述阻带内的预定衰减的偏差; 以及确定所述滤波器（102）的系数，使得所述滤波器（102）的频率响应（531,532）适应于所述频率响应（531,532）的允许偏差。

8.

发明申请
MULTI-STAGING RECURSIVE AUDIO FRAME-BASED RESAMPLING AND TIME MAPPING 审中-公开
Title translation: 多层次的基于音频帧的帧和时间映射

公开(公告)号：WO2010074849A1

公开(公告)日：2010-07-01

申请号：PCT/US2009/065093

申请日：2009-11-19

Applicant: TANDBERG TELEVISION INC. , WANG, Zhicheng, Lancelot , JIANG, Jianguang

Inventor： WANG, Zhicheng, Lancelot , JIANG, Jianguang

IPC: G10L19/14 , H03H17/06

CPC classification number: G10L19/173 , H03H17/0294 , H03H17/0685

Abstract: A multi-stage recursive sample rate converter ("SRC") typically embodied as digital signal processor provides for an efficient structure for converting digital audio samples at one frequency, such as 48 kHz, to another frequency, such as 44.1 kHz. A parameter codebook comprising memory stores parameters used at a plurality of stages by the SRC. For each stage, a controller coordinates the SRC to use the appropriate set of parameters from the codebook, process an input audio sample stream, and store the intermediate results in a buffer. The controller then causes the intermediate results to be processed again as input to the SRC in a subsequent stage of processing using a different set of parameters. The process is repeated until all stages are completed, and the final results arc the output digital audio data stream at the desired sampling rate.

Abstract translation: 通常实现为数字信号处理器的多级递归采样率转换器（“SRC”）提供了将一个频率（如48 kHz）的数字音频采样转换为另一个频率（如44.1 kHz）的有效结构。包括存储器的参数码本存储由SRC在多个级中使用的参数。对于每个阶段，控制器协调SRC以使用来自码本的适当的参数集，处理输入音频样本流，并将中间结果存储在缓冲器中。然后，控制器使得使用不同的参数集在后续处理阶段中再次处理中间结果作为SRC的输入。重复该过程直到所有阶段完成，并且最终结果以期望的采样率以输出的数字音频数据流为准。

9.

发明申请
一种转码方法和装置审中-公开

公开(公告)号：WO2009132544A1

公开(公告)日：2009-11-05

申请号：PCT/CN2009/071133

申请日：2009-04-01

Applicant: 华为技术有限公司 , 鲍长春 , 徐昊 , 唐繁荣 , 胡翔宇

Inventor： 鲍长春 , 徐昊 , 唐繁荣 , 胡翔宇

IPC: G10L19/14 , G10L19/04 , H03M7/30

CPC classification number: G10L19/173

10.

发明申请
MULTIPLE STREAM DECODER 审中-公开
Title translation: 多流道解码器

公开(公告)号：WO2008118834A1

公开(公告)日：2008-10-02

申请号：PCT/US2008/057974

申请日：2008-03-24

Applicant: HARRIS CORPORATION

Inventor： CHAMBERLAIN, Mark, W.

IPC: G10L19/14 , G10L19/00

CPC classification number: G10L19/173 , G10L19/008

Abstract: A method is provided for decoding data streams in a voice communication system. The method includes: receiving two or more data streams having voice data encoded therein; decoding each data stream into a set of speech coding parameters; forming a set of combined speech coding parameters by combining the sets of decoded speech coding parameters, where speech coding parameters of a given type are combined with speech coding parameters of the same type; and inputting the set of combined speech coding parameters into a speech synthesizer.

Abstract translation: 提供了一种用于对语音通信系统中的数据流进行解码的方法。该方法包括：接收其中编码有语音数据的两个或多个数据流; 将每个数据流解码成一组语音编码参数; 通过组合解码语音编码参数的集合来形成一组组合语音编码参数，其中给定类型的语音编码参数与相同类型的语音编码参数组合; 以及将该组合的语音编码参数输入到语音合成器中。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification