-
公开(公告)号:US20230027660A1
公开(公告)日:2023-01-26
申请号:US17954179
申请日:2022-09-27
发明人: Per EKSTRAND , Lars VILLEMOES
IPC分类号: G10L19/022 , G10L19/24 , G10L21/038 , G10L21/04 , G10L19/02
摘要: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.
-
公开(公告)号:US11562755B2
公开(公告)日:2023-01-24
申请号:US17409592
申请日:2021-08-23
发明人: Per Ekstrand , Lars Villemoes
IPC分类号: G10L19/22 , G10L21/038 , G10L21/04 , G10L19/24 , G10L19/02 , G10L19/022
摘要: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.
-
公开(公告)号:US11495236B2
公开(公告)日:2022-11-08
申请号:US16878313
申请日:2020-05-19
发明人: Lars Villemoes , Per Ekstrand , Sascha Disch , Frederik Nagel , Stephan Wilde
IPC分类号: G10L19/008 , G10L21/038 , G10L21/04 , G10L19/02
摘要: An apparatus for processing an input audio signal relies on a cascade of filterbanks, the cascade having a synthesis filterbank for synthesizing an audio intermediate signal from the input audio signal, the input audio signal being represented by a plurality of first subband signals generated by an analysis filterbank, wherein a number of filterbank channels of the synthesis filterbank is smaller than a number of channels of the analysis filterbank. The apparatus furthermore has a further analysis filterbank for generating a plurality of second subband signals from the audio intermediate signal, wherein the further analysis filterbank has a number of channels being different from the number of channels of the synthesis filterbank, so that a sampling rate of a subband signal of the plurality of second subband signals is different from a sampling rate of a first subband signal of the plurality of first subband signals.
-
公开(公告)号:US20220293113A1
公开(公告)日:2022-09-15
申请号:US17829733
申请日:2022-06-01
发明人: Lars Villemoes
IPC分类号: G10L19/02 , G10L21/038 , G10L21/04 , H03G3/00 , H03G3/30 , G10L19/025 , G10L19/26
摘要: The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency QΩ+rΩ0 is generated on the basis of existing components at Ω and Ω+Ω0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.
-
公开(公告)号:US20220270626A1
公开(公告)日:2022-08-25
申请号:US17450015
申请日:2021-10-05
申请人: TENCENT AMERICA LLC
发明人: Jun TIAN , Xiaozhong XU , Shan LIU
IPC分类号: G10L21/003 , G10L21/04
摘要: Aspects of the disclosure provide methods and apparatuses for audio processing. In some examples, an apparatus of audio coding includes processing circuitry. The processing circuitry decodes, from a coded bitstream, information indicative of an adjusted speech signal and a loudness adjustment to the adjusted speech signal. The adjusted speech signal is indicated in an association with multiple speech signals in a scene of an immersive media application. The processing circuitry determines a plurality of loudness adjustments to sound signals including the multiple speech signals in the scene based the plurality of loudness adjustment to the adjusted speech signal, and generates the sound signals in the scene based on the loudness adjustments to the sound signals.
-
公开(公告)号:US20210151069A1
公开(公告)日:2021-05-20
申请号:US17158873
申请日:2021-01-26
申请人: BabbleLabs LLC
摘要: Systems and methods are disclosed for data driven radio enhancement. For example, methods may include demodulating a radio signal to obtain a demodulated audio signal; determining a window of audio samples based on the demodulated audio signal; applying an audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the audio enhancement network includes a machine learning network that has been trained using demodulated audio signals derived from radio signals; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.
-
公开(公告)号:US10943594B2
公开(公告)日:2021-03-09
申请号:US16546898
申请日:2019-08-21
IPC分类号: G10L21/00 , G10L19/00 , G10L21/04 , G10L19/087 , G10L25/72 , G10L19/24 , G10L21/038
摘要: A method and device are provided for determining an optimized scale factor to be applied to an excitation signal or a filter during a process for frequency band extension of an audio frequency signal. The band extension process includes decoding or extracting, in a first frequency band, an excitation signal and parameters of the first frequency band including coefficients of a linear prediction filter, generating an excitation signal extending over at least one second frequency band, filtering using a linear prediction filter for the second frequency band. The determination method includes determining an additional linear prediction filter, of a lower order than that of the linear prediction filter of the first frequency band, the coefficients of the additional filter being obtained from the parameters decoded or extracted from the first frequency and calculating the optimized scale factor as a function of at least the coefficients of the additional filter.
-
公开(公告)号:US10909996B2
公开(公告)日:2021-02-02
申请号:US14905158
申请日:2014-07-16
发明人: Yutaka Kamamoto , Takehiro Moriya , Noboru Harada
IPC分类号: G10L19/06 , G10L25/12 , G10L19/02 , G10L19/032 , G10L21/04 , G10L25/06 , G10L25/18 , G10L25/27
摘要: An autocorrelation calculation unit 21 calculates an autocorrelation RO(i) from an input signal. A prediction coefficient calculation unit 23 performs linear prediction analysis by using a modified autocorrelation R′O(i) obtained by multiplying a coefficient wO(i) by the autocorrelation RO(i). It is assumed here, for each order i of some orders i at least, that the coefficient wO(i) corresponding to the order i is in a monotonically increasing relationship with an increase in a value that is negatively correlated with a fundamental frequency of the input signal of the current frame or a past frame.
-
9.
公开(公告)号:US10891966B2
公开(公告)日:2021-01-12
申请号:US16135818
申请日:2018-09-19
申请人: Yamaha Corporation
发明人: Akira Maezawa
摘要: An audio processing device includes a feature extraction unit and signal generating unit. The feature extraction unit is configured to extract a feature quantity of a first audio signal for each of a plurality of periods. The signal generating unit is configured to for generate a second audio signal by time axis expanding/compressing either a section of the first audio signal in which the feature quantity is steadily maintained for a period time, or a section of the first audio signal in which a fluctuation of the feature quantity is repeated and excluding from the time axis expanding/compressing a section of the first audio signal in which a fluctuation of the feature quantity is not similar to that of other sections of the first audio signal.
-
公开(公告)号:US20200304929A1
公开(公告)日:2020-09-24
申请号:US16605009
申请日:2018-03-23
申请人: OMNIO SOUND LIMITED
发明人: Bernt BÖHMER
IPC分类号: H04S1/00 , H04S7/00 , G10L21/04 , G10L19/022
摘要: The Stereo Unfold Technology solves the inherent problems in the stereo reproduction by utilizing modern DSP technology to extract information from the Left (L) and Right (R) stereo channels to create a number of new channels that feeds into processing algorithms. The Stereo Unfold Technology operates by sending the ordinary stereo information in the customary way towards the listener to establish the perceived location of performers in the sound field with great accuracy and then projects delayed and frequency shaped extracted signals forward as well as in other directions to provide additional psychoacoustically based clues to the ear and brain. The additional clues generate the sensation of increased detail and transparency as well as establishing the three dimensional properties of the sound sources and the acoustic environment in which they are performing. The Stereo Unfold Technology manages to create a real believable three-dimensional soundstage populated with three-dimensional sound sources generating sound in a continuous real sounding acoustic environment.
-
-
-
-
-
-
-
-
-