专利检索 ipc:G10L21/04 第 1 页

1.

发明申请
HARMONIC TRANSPOSITION IN AN AUDIO CODING METHOD AND SYSTEM 有权

公开(公告)号：US20230027660A1

公开(公告)日：2023-01-26

申请号：US17954179

申请日：2022-09-27

申请人： DOLBY INTERNATIONAL AB

发明人： Per EKSTRAND , Lars VILLEMOES

IPC分类号： G10L19/022 , G10L19/24 , G10L21/038 , G10L21/04 , G10L19/02

摘要： The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.

2.

发明授权
Harmonic transposition in an audio coding method and system 有权

公开(公告)号：US11562755B2

公开(公告)日：2023-01-24

申请号：US17409592

申请日：2021-08-23

申请人： Dolby International AB

发明人： Per Ekstrand , Lars Villemoes

IPC分类号： G10L19/22 , G10L21/038 , G10L21/04 , G10L19/24 , G10L19/02 , G10L19/022

摘要： The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.

3.

发明授权
Apparatus and method for processing an input audio signal using cascaded filterbanks 有权

公开(公告)号：US11495236B2

公开(公告)日：2022-11-08

申请号：US16878313

申请日：2020-05-19

申请人： Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V. , Dolby International AB

发明人： Lars Villemoes , Per Ekstrand , Sascha Disch , Frederik Nagel , Stephan Wilde

IPC分类号： G10L19/008 , G10L21/038 , G10L21/04 , G10L19/02

摘要： An apparatus for processing an input audio signal relies on a cascade of filterbanks, the cascade having a synthesis filterbank for synthesizing an audio intermediate signal from the input audio signal, the input audio signal being represented by a plurality of first subband signals generated by an analysis filterbank, wherein a number of filterbank channels of the synthesis filterbank is smaller than a number of channels of the analysis filterbank. The apparatus furthermore has a further analysis filterbank for generating a plurality of second subband signals from the audio intermediate signal, wherein the further analysis filterbank has a number of channels being different from the number of channels of the synthesis filterbank, so that a sampling rate of a subband signal of the plurality of second subband signals is different from a sampling rate of a first subband signal of the plurality of first subband signals.

4.

发明申请
CROSS PRODUCT ENHANCED SUBBAND BLOCK BASED HARMONIC TRANSPOSITION 有权

公开(公告)号：US20220293113A1

公开(公告)日：2022-09-15

申请号：US17829733

申请日：2022-06-01

申请人： DOLBY INTERNATIONAL AB

发明人： Lars Villemoes

IPC分类号： G10L19/02 , G10L21/038 , G10L21/04 , H03G3/00 , H03G3/30 , G10L19/025 , G10L19/26

摘要： The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency QΩ+rΩ0 is generated on the basis of existing components at Ω and Ω+Ω0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.

5.

发明申请
METHOD AND APPARATUS IN AUDIO PROCESSING 有权

公开(公告)号：US20220270626A1

公开(公告)日：2022-08-25

申请号：US17450015

申请日：2021-10-05

申请人： TENCENT AMERICA LLC

发明人： Jun TIAN , Xiaozhong XU , Shan LIU

IPC分类号： G10L21/003 , G10L21/04

摘要： Aspects of the disclosure provide methods and apparatuses for audio processing. In some examples, an apparatus of audio coding includes processing circuitry. The processing circuitry decodes, from a coded bitstream, information indicative of an adjusted speech signal and a loudness adjustment to the adjusted speech signal. The adjusted speech signal is indicated in an association with multiple speech signals in a scene of an immersive media application. The processing circuitry determines a plurality of loudness adjustments to sound signals including the multiple speech signals in the scene based the plurality of loudness adjustment to the adjusted speech signal, and generates the sound signals in the scene based on the loudness adjustments to the sound signals.

6.

发明申请
Data Driven Radio Enhancement 有权

公开(公告)号：US20210151069A1

公开(公告)日：2021-05-20

申请号：US17158873

申请日：2021-01-26

申请人： BabbleLabs LLC

发明人： Samer Hijazi , Kamil Krzysztof Wojcicki , Dror Maydan , Christopher Rowen

IPC分类号： G10L21/0364 , H04L1/08 , H04L1/00 , G06N3/08 , G10L21/04 , G06N3/04 , G10L25/30

摘要： Systems and methods are disclosed for data driven radio enhancement. For example, methods may include demodulating a radio signal to obtain a demodulated audio signal; determining a window of audio samples based on the demodulated audio signal; applying an audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the audio enhancement network includes a machine learning network that has been trained using demodulated audio signals derived from radio signals; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.

7.

发明授权
Optimized scale factor for frequency band extension in an audio frequency signal decoder 有权

公开(公告)号：US10943594B2

公开(公告)日：2021-03-09

申请号：US16546898

申请日：2019-08-21

申请人： KONINKLIJKE PHILIPS N.V.

发明人： Magdalena Kaniewska , Stephane Ragot

IPC分类号： G10L21/00 , G10L19/00 , G10L21/04 , G10L19/087 , G10L25/72 , G10L19/24 , G10L21/038

摘要： A method and device are provided for determining an optimized scale factor to be applied to an excitation signal or a filter during a process for frequency band extension of an audio frequency signal. The band extension process includes decoding or extracting, in a first frequency band, an excitation signal and parameters of the first frequency band including coefficients of a linear prediction filter, generating an excitation signal extending over at least one second frequency band, filtering using a linear prediction filter for the second frequency band. The determination method includes determining an additional linear prediction filter, of a lower order than that of the linear prediction filter of the first frequency band, the coefficients of the additional filter being obtained from the parameters decoded or extracted from the first frequency and calculating the optimized scale factor as a function of at least the coefficients of the additional filter.

8.

发明授权
Linear prediction analysis device, method, program, and storage medium 有权

公开(公告)号：US10909996B2

公开(公告)日：2021-02-02

申请号：US14905158

申请日：2014-07-16

申请人： NIPPON TELEGRAPH AND TELEPHONE CORPORATION

发明人： Yutaka Kamamoto , Takehiro Moriya , Noboru Harada

IPC分类号： G10L19/06 , G10L25/12 , G10L19/02 , G10L19/032 , G10L21/04 , G10L25/06 , G10L25/18 , G10L25/27

摘要： An autocorrelation calculation unit 21 calculates an autocorrelation RO(i) from an input signal. A prediction coefficient calculation unit 23 performs linear prediction analysis by using a modified autocorrelation R′O(i) obtained by multiplying a coefficient wO(i) by the autocorrelation RO(i). It is assumed here, for each order i of some orders i at least, that the coefficient wO(i) corresponding to the order i is in a monotonically increasing relationship with an increase in a value that is negatively correlated with a fundamental frequency of the input signal of the current frame or a past frame.

9.

发明授权
Audio processing method and audio processing device for expanding or compressing audio signals 有权

公开(公告)号：US10891966B2

公开(公告)日：2021-01-12

申请号：US16135818

申请日：2018-09-19

申请人： Yamaha Corporation

发明人： Akira Maezawa

IPC分类号： G10L21/04 , G10L21/01 , G10L25/51 , G10L25/03 , G10L25/06

摘要： An audio processing device includes a feature extraction unit and signal generating unit. The feature extraction unit is configured to extract a feature quantity of a first audio signal for each of a plurality of periods. The signal generating unit is configured to for generate a second audio signal by time axis expanding/compressing either a section of the first audio signal in which the feature quantity is steadily maintained for a period time, or a section of the first audio signal in which a fluctuation of the feature quantity is repeated and excluding from the time axis expanding/compressing a section of the first audio signal in which a fluctuation of the feature quantity is not similar to that of other sections of the first audio signal.

10.

发明申请
STEREO UNFOLD WITH PSYCHOACOUSTIC GROUPING PHENOMENON 审中-公开

公开(公告)号：US20200304929A1

公开(公告)日：2020-09-24

申请号：US16605009

申请日：2018-03-23

申请人： OMNIO SOUND LIMITED

发明人： Bernt BÖHMER

IPC分类号： H04S1/00 , H04S7/00 , G10L21/04 , G10L19/022

摘要： The Stereo Unfold Technology solves the inherent problems in the stereo reproduction by utilizing modern DSP technology to extract information from the Left (L) and Right (R) stereo channels to create a number of new channels that feeds into processing algorithms. The Stereo Unfold Technology operates by sending the ordinary stereo information in the customary way towards the listener to establish the perceived location of performers in the sound field with great accuracy and then projects delayed and frequency shaped extracted signals forward as well as in other directions to provide additional psychoacoustically based clues to the ear and brain. The additional clues generate the sensation of increased detail and transparency as well as establishing the three dimensional properties of the sound sources and the acoustic environment in which they are performing. The Stereo Unfold Technology manages to create a real believable three-dimensional soundstage populated with three-dimensional sound sources generating sound in a continuous real sounding acoustic environment.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类