专利检索 cpc:"G10L19/20" 第 6 页

51.

发明授权
Method, apparatus, and computer program product for categorical spatial analysis-synthesis on spectrum of multichannel audio signals 有权
标题翻译：用于分类空间分析的方法，装置和计算机程序产品 - 多声道音频信号的频谱合成

公开(公告)号：US09420375B2

公开(公告)日：2016-08-16

申请号：US14039357

申请日：2013-09-27

申请人： Nokia Technologies Oy

发明人： Pushkar Prasad Patwardhan , Ravi Shenoy

IPC分类号： H04R5/04 , G10L19/008 , G10L19/02 , G10L19/20 , H04S7/00

CPC分类号： H04R5/04 , G10L19/008 , G10L19/02 , G10L19/20 , H04S7/30 , H04S2400/01 , H04S2420/01

摘要： A method, apparatus and computer program product are therefore provided according to an example embodiment of the present invention in order to perform categorical analysis and synthesis of a multichannel signal to synthesize binaural signals and extract, separate, and manipulate components within the audio scene of the multichannel signal that were captured through multichannel audio means. In the context of a method, a multichannel signal is received. The method may include computing the spectrum for the multichannel signal, determining tonality of bands within the spectrum, and generating a band structure for the spectrum. The method may also include performing spatial analysis of the bands, performing source filtering using the bands, performing synthesis on the filtered band components, and generating an output signal. A corresponding apparatus and a computer program product are also provided.

摘要翻译： 因此，根据本发明的示例性实施例，提供了一种方法，装置和计算机程序产品，以便执行多通道信号的分类分析和合成，以合成双耳信号并提取，分离和操纵音频场景内的组件通过多声道音频装置捕获的多声道信号。在方法的上下文中，接收多声道信号。该方法可以包括计算多信道信号的频谱，确定频谱内的频带的音调，以及产生频谱的频带结构。该方法还可以包括执行频带的空间分析，使用频带执行源滤波，对经滤波的频带分量执行合成，以及产生输出信号。还提供了相应的装置和计算机程序产品。

52.

发明申请
CODING AND DECODING OF SPECTRAL PEAK POSITIONS 有权
标题翻译：光谱位置的编码和解码

公开(公告)号：US20160225378A1

公开(公告)日：2016-08-04

申请号：US14402406

申请日：2014-10-10

申请人： Telefonaktiebolaget L M Ericsson (publ)

发明人： Volodya Grancharov , Sigurdur SVERRISSON

IPC分类号： G10L19/02 , G10L19/00 , G10L19/22

CPC分类号： G10L19/02 , G10L19/0017 , G10L19/20 , G10L19/22 , H03M7/40 , H03M7/4031

摘要： A coder and decoder, and methods therein, are provided for coding and decoding of spectral peak positions in audio coding. According to a first aspect, an audio signal segment coding method is provided for coding of spectral peak positions. The method comprises determining which one out of two lossless spectral peak position coding schemes that requires the least number of bits to code the spectral peak positions of an audio signal segment; and selecting the spectral peak position coding scheme that requires the least number of bits to code the spectral peak positions of the audio signal segment. A first one of the two lossless spectral peak position coding schemes is suitable for periodic or semi-periodic spectral peak position distributions; and a second one of two lossless spectral peak position coding schemes is suitable for sparse spectral peak position distributions.

摘要翻译： 提供编码器和解码器及其方法，用于对音频编码中的频谱峰位置进行编码和解码。根据第一方面，提供了用于对频谱峰位置进行编码的音频信号段编码方法。该方法包括确定需要最少位数的两个无损频谱峰值位置编码方案中的哪一个编码音频信号段的频谱峰值位置; 以及选择需要最少位数来编码音频信号段的频谱峰值位置的频谱峰值位置编码方案。两个无损光谱峰值位置编码方案中的第一个适用于周期或半周期光谱峰位置分布; 并且两个无损光谱峰值位置编码方案中的第二个适用于稀疏光谱峰位置分布。

53.

发明申请
METHOD AND APPARATUS FOR PROCESSING AN AUDIO SIGNAL 有权

公开(公告)号：US20160217801A1

公开(公告)日：2016-07-28

申请号：US15089918

申请日：2016-04-04

申请人： LG Electronics Inc. , CHUNGBUK NATIONAL UNIVERSITY INDUSTRY-ACADEMIC COOPERATION FOUNDATION

发明人： Gyuhyeok JEONG , Daehwan KIM , Ingyu KANG , Lagyoung KIM , Kibong HONG , Zhigang PIAO , Insung LEE , Jongha LIM , Sanghyeon MOON , Byungsuk LEE , Hyejeong JEON

IPC分类号： G10L19/038 , G10L19/002 , G10L19/02

CPC分类号： G10L19/038 , G10L19/002 , G10L19/0204 , G10L19/0212 , G10L19/028 , G10L19/20 , G10L19/22 , G10L21/038

摘要： The present invention relates to a method for processing an audio signal, comprising: a step of performing a frequency conversion process on an audio signal to obtain a plurality of frequency transform coefficients; a step of selecting either a general mode or a non-general mode, on the basis of a pulse ratio, for the frequency transform coefficients having a high frequency band from among the plurality of frequency transform coefficients; and a step of performing, if the non-general mode is selected, the following steps: extracting a predetermined number of pulses from the frequency transform coefficients having the high frequency band, and generating pulse information; generating an original noise signal from the frequency transform coefficients having the high frequency band, excluding the pulses; generating a reference noise signal using the frequency transform coefficient having a low frequency band from among the plurality of frequency transform coefficients; and generating noise position information and noise energy information using the original noise signal and the reference noise signal.

54.

发明申请
Adaptive Audio Content Generation 有权
标题翻译：自适应音频内容生成

公开(公告)号：US20160150343A1

公开(公告)日：2016-05-26

申请号：US14900117

申请日：2014-06-17

申请人： DOLBY LABORATORIES LICENSING CORPORATION

发明人： Jun WANG , Lie LU , Mingqing HU , Dirk Jeroen BREEBAART , Nicolas R. TSINGOS

IPC分类号： H04S7/00 , G10L19/008 , G10L19/02

CPC分类号： H04S7/30 , G10L19/008 , G10L19/0204 , G10L19/20 , G10L21/0272 , H04S3/002 , H04S5/005 , H04S2400/11 , H04S2400/13 , H04S2400/15 , H04S2420/07

摘要： Embodiments of the present invention relate to adaptive audio content generation. Specifically, a method for generating adaptive audio content is provided. The method comprises extracting at least one audio object from channel-based source audio content, and generating the adaptive audio content at least partially based on the at least one audio object. Corresponding system and computer program product are also disclosed.

摘要翻译： 本发明的实施例涉及自适应音频内容生成。具体地，提供了一种用于产生自适应音频内容的方法。所述方法包括从基于频道的源音频内容中提取至少一个音频对象，以及至少部分地基于所述至少一个音频对象生成所述自适应音频内容。还公开了相应的系统和计算机程序产品。

55.

发明授权
Pitch filter for audio signals 有权
标题翻译：音频信号的滤波器

公开(公告)号：US09343077B2

公开(公告)日：2016-05-17

申请号：US14936408

申请日：2015-11-09

申请人： DOLBY INTERNATIONAL AB

发明人： Barbara Resch , Kristofer Kjörling , Lars Villemoes

IPC分类号： G10L19/00 , G10L19/26 , G10L19/12 , G10L19/125 , G10L21/003 , G10L19/02 , G10L19/107 , G10L19/20

CPC分类号： G10L19/26 , G10L19/02 , G10L19/0212 , G10L19/032 , G10L19/09 , G10L19/107 , G10L19/12 , G10L19/125 , G10L19/20 , G10L19/22 , G10L19/265 , G10L21/003 , G10L21/007 , G10L21/013

摘要： In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.

摘要翻译： 在一些实施例中，公开了用于滤波从音频比特流产生的初步音频信号的音调滤波器。音调滤波器具有从以下之一中选择的操作模式：（i）使用滤波信息对初步音频信号进行滤波以获得滤波的音频信号的有源模式，以及（ii）禁用音调滤波器的非活动模式。在具有从至少两个不同编码模式中选择的编码模式的音频编码器或音频解码器中产生初步音频信号，并且在编码中操作时，音调滤波器能够选择性地在活动模式或非活动模式下操作模式基于控制信息。

56.

发明申请
PITCH FILTER FOR AUDIO SIGNALS 有权

公开(公告)号：US20160086616A1

公开(公告)日：2016-03-24

申请号：US14936408

申请日：2015-11-09

申请人： DOLBY INTERNATIONAL AB

发明人： Barbara RESCH , Kristofer KJÖRLING , Lars VILLEMOES

IPC分类号： G10L19/26 , G10L21/003

CPC分类号： G10L19/26 , G10L19/02 , G10L19/0212 , G10L19/032 , G10L19/09 , G10L19/107 , G10L19/12 , G10L19/125 , G10L19/20 , G10L19/22 , G10L19/265 , G10L21/003 , G10L21/007 , G10L21/013

摘要： In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.

57.

发明授权
Adaptive gain reduction for encoding a speech signal 有权
标题翻译：用于对语音信号进行编码的自适应增益减小

公开(公告)号：US09269365B2

公开(公告)日：2016-02-23

申请号：US12218242

申请日：2008-07-11

申请人： Huan-Yu Su , Yang Gao

发明人： Huan-Yu Su , Yang Gao

IPC分类号： G10L19/00 , G10L19/20 , G10L19/09 , G10L19/18 , G10L25/90

CPC分类号： G10L19/12 , G10L19/0204 , G10L19/09 , G10L19/18 , G10L19/20 , G10L25/90 , G10L2019/0002 , G10L2019/0016

摘要： There is provided a method of encoding an input speech signal. The method comprises identifying a fixed codebook vector from a fixed codebook; identifying an adaptive codebook vector from a adaptive codebook; calculating an adaptive codebook gain; reducing the adaptive codebook gain by an amount; optimally selecting a fixed codebook gain based on the adaptive codebook gain while both the fixed codebook vector and the adaptive codebook vector remain fixed; and converting the input speech signal into an encoded speech using the fixed codebook gain, the adaptive codebook gain, the fixed codebook vector and the adaptive codebook vector. The amount of reducing the adaptive codebook gain may be varied.

摘要翻译： 提供了一种对输入语音信号进行编码的方法。该方法包括从固定码本识别固定码本向量; 从自适应码本识别自适应码本向量; 计算自适应码本增益; 将自适应码本增益减少一定量; 在固定码本矢量和自适应码本矢量保持固定的同时，基于自适应码本增益最优选择固定码本增益; 以及使用固定码本增益，自适应码本增益，固定码本矢量和自适应码本矢量将输入语音信号转换为编码语音。降低自适应码本增益的量可以变化。

58.

发明申请
Method for Predicting High Frequency Band Signal, Encoding Device, and Decoding Device 有权
标题翻译：预测高频带信号，编码装置和解码装置的方法

公开(公告)号：US20150332699A1

公开(公告)日：2015-11-19

申请号：US14808145

申请日：2015-07-24

申请人： Huawei Technologies Co., Ltd.

发明人： Zexin Liu , Lei Miao , Fengyan Qi

IPC分类号： G10L19/20 , G10L21/038

CPC分类号： G10L19/20 , G10L19/04 , G10L21/02 , G10L21/038

摘要： A method includes obtaining a signal type of an audio signal and a low frequency band signal of the audio signal, where the audio signal includes the low frequency band signal and a high frequency band signal; obtaining a frequency envelope of the high frequency band signal according to the signal type; predicting an excitation signal of the high frequency band signal according to the low frequency band signal; and restoring the high frequency band signal according to the frequency envelope of the high frequency band signal and the excitation signal of the high frequency band signal. By using the technical solutions of the embodiments of the present invention, an error existing between a high frequency band signal obtained by prediction and an actual high frequency band signal can be effectively reduced, and an accuracy rate of the predicted high frequency band signal can be increased.

摘要翻译： 一种方法包括获得音频信号的信号类型和音频信号的低频带信号，其中音频信号包括低频带信号和高频带信号; 根据信号类型获得高频信号的频率包络; 根据低频带信号预测高频信号的激励信号; 以及根据高频带信号的频率包络和高频带信号的激励信号恢复高频带信号。通过使用本发明的实施例的技术方案，可以有效地减少通过预测获得的高频带信号与实际的高频带信号之间存在的误差，并且预测的高频带信号的精度率可以是增加。

59.

发明申请
CLOSED LOOP QUANTIZATION OF HIGHER ORDER AMBISONIC COEFFICIENTS 有权
标题翻译：闭环式定量更高级别的健康系数

公开(公告)号：US20150332681A1

公开(公告)日：2015-11-19

申请号：US14712638

申请日：2015-05-14

申请人： QUALCOMM Incorporated

发明人： Moo Young Kim , Nils Günther Peters , Dipanjan Sen

IPC分类号： G10L19/008 , H04S3/02 , H04S5/00 , G10L19/038

CPC分类号： G10L19/008 , G10L19/032 , G10L19/038 , G10L19/20 , H04S3/02 , H04S5/005 , H04S2400/01 , H04S2420/11

摘要： In general, techniques are described for closed loop quantization of HOA coefficients that provide a three-dimensional representation of the sound field. An audio encoding device may perform closed loop quantization of an audio object based at least in part on a result of performing quantization of directional information associated with the audio object. An audio decoding device may obtain an audio object that has been closed loop quantized based at least in part on a result of performing quantization of directional information associated with the audio object, and may dequantize the audio object.

摘要翻译： 通常，描述了提供声场的三维表示的HOA系数的闭环量化的技术。音频编码装置可以至少部分地基于与音频对象相关联的方向信息的量化的结果来执行音频对象的闭环量化。音频解码装置可以至少部分地基于与音频对象相关联的方向信息的量化的结果来获得已被闭环量化的音频对象，并且可以对音频对象进行去量化。

60.

发明授权
Adaptive codebook gain control for speech coding 有权

公开(公告)号：US09190066B2

公开(公告)日：2015-11-17

申请号：US12321934

申请日：2009-01-26

申请人： Yang Gao

发明人： Yang Gao

IPC分类号： G10L19/09 , G10L19/20 , G10L19/18 , G10L25/90

CPC分类号： G10L19/12 , G10L19/0204 , G10L19/09 , G10L19/18 , G10L19/20 , G10L25/90 , G10L2019/0002 , G10L2019/0016

摘要： In accordance with one aspect of the invention, a selector supports the selection of a first encoding scheme or the second encoding scheme based upon the detection or absence of the triggering characteristic in the interval of the input speech signal. The first encoding scheme has a pitch pre-processing procedure for processing the input speech signal to form a revised speech signal biased toward an ideal voiced and stationary characteristic. The pre-processing procedure allows the encoder to fully capture the benefits of a bandwidth-efficient, long-term predictive procedure for a greater amount of speech components of an input speech signal than would otherwise be possible. In accordance with another aspect of the invention, the second encoding scheme entails a long-term prediction mode for encoding the pitch on a sub-frame by sub-frame basis. The long-term prediction mode is tailored to where the generally periodic component of the speech is generally not stationary or less than completely periodic and requires greater frequency of updates from the adaptive codebook to achieve a desired perceptual quality of the reproduced speech under a long-term predictive procedure.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类