摘要:
A method for headphone reproduction of at least two input channel signals is proposed. Said method comprises for each pair of input channel signals from said at least two input channel signals the following steps. First, a common component, an estimated desired position corresponding to said common component, and two residual components corresponding to two input channel signals in said pair of input channel signals are determined. Said determining is being based on said pair of said input channel signals. Each of said residual components is derived from its corresponding input channel signal by subtracting a contribution of the common component. Said contribution is being related to the estimated desired position of the common component. Second, a main virtual source comprising said common component at the estimated desired position and two further virtual sources each comprising a respective one of said residual components at respective predetermined positions are synthesized.
摘要:
An audio encoder comprises a multi-channel receiver (401) which receives an M-channel audio signal where M>2. A down-mix processor (403) down-mixes the M-channel audio signal to a first stereo signal and associated parametric data and a spatial processor (407) modifies the first stereo signal to generate a second stereo signal in response to the associated parametric data and spatial parameter data for a binaural perceptual transfer function, such as a Head Related Transfer Function (HRTF). The second stereo signal is a binaural signal and may specifically be a (3D) virtual spatial signal. An output data stream comprising the encoded data and the associated parametric data is generated by an encode processor (411) and an output processor (413). The HRTF processing may allow the generation of a (3D) virtual spatial signal by conventional stereo decoders. A multi-channel decoder may reverse the process of the spatial processor (407) to generate an improved quality multi-channel signal.
摘要:
An audio decoder (100) comprising: effect means, decoding means, and rendering means. The effect means (500) generate modified down-mix audio signals from received down-mix audio signals. Said received down-mix audio signals comprise a down-mix of a plurality of audio objects. Said modified down-mix audio signals are obtained by applying effects to estimated audio signals corresponding to audio objects comprised in said received down-mix audio signals. Said estimated audio signals are derived from the received down-mix audio signals based on received parametric data. Said received parametric data comprise a plurality of object parameters for each of the plurality of audio objects. Said modified down-mix audio signals based on a type of the applied effect are decoded by decoding means or rendered by rendering means or combined with the output of rendering means. The decoding means (300) are arranged for decoding the audio objects from the down-mix audio signals or the modified down-mix audio signals based on the parametric data. The rendering means (400) are arranged for generating at least one output audio signal from the decoded audio objects.
摘要:
The invention relates to a sensor system comprising a sensor array, the sensor array comprising a substrate layer and a plurality of individual first sensor elements for measuring a desired parameter, which first sensor elements are arranged on said substrate layer and define a sensor plane, wherein the sensor array further comprises one or more second sensor elements for measuring a further desired parameter, and wherein the sensor system is configured to process sensor data from the first sensor elements in dependency of sensor data from the one or more second sensor elements.
摘要:
A device (10) for enhancing a multi-channel (e.g. stereo) audio signal has a parameter adjustment unit (13) for adjusting an original parameter (α, ILD, ICC) which represents an original inter-channel property of the audio signal. The device further comprises a processing unit (11) for processing the audio signal so as to produce an enhanced audio signal having the adjusted parameter (α′, ILD′, ICC′). The device allows stereo widening or other multi-channel signal enhancements without introducing artifacts.
摘要:
In a method of encoding input signals (CH1 to CH3; 400 to 450) in a multi-channel encoder (5; 15) to generate corresponding output data having down-mix output signals (610, 620) together with complementary parametric data (600), the method includes a first step of down-mixing input signals (CH1 to CH3; 400 to 450) to generate the corresponding down-mix output signals (610, 620), and a second step of processing the input signals (CH1 to CH3; 400 to 450) during down-mixing to generate the parametric data (600) complementary to the down-mix output signals (610, 620). Processing of the input signals (CH1 to CH3; 400 to 450) involves including information in the down-mix signals (610, 620) which is useable during subsequent decoding of the down-mix output signals (610, 620) and the parametric data (600) to determine at least some parameter data and thereby enabling representations of the input signals (CH1 to CH3; 400 to 450) to be subsequently regenerated.
摘要:
A method of synthesizing a first (L) and a second (R) output signal from an input signal (x). The method comprises: filtering (201) the input signal to generate a filtered signal (Hx); obtaining a correlation parameter (ρ) indicative of a desired correlation between the first and second output signals; obtaining a level parameter (c) indicative of a desired level difference between the first and second input signals; and transforming the input signal and the filtered signal by a matrixing operation (203) into the first and second output signals, where the matrixing operation depends on the correlation parameter and the level parameter.
摘要:
A method of synthesizing a first (L) and a second (R) output signal from an input signal (x). The method comprises: filtering (201) the input signal to generate a filtered signal; obtaining a correlation parameter indicative of a desired correlation between the first and second output signals; obtaining a level parameter (c) indicative of a desired level difference between the first and second input signals; and transforming the input signal and the filtered signal by a matrixing operation (203) into the first and second output signals, where the matrixing operation depends on the correlation parameter and the level parameter.
摘要:
A binaural object-oriented audio decoder comprising decoding means for decoding and rendering at least one audio object based on head-related transfer function parameters is proposed. Said decoding means are being arranged for positioning an audio object in a virtual three-dimensional space. Said head-related transfer function parameters are being based on an elevation parameter, an azimuth parameter, and a distance parameter. Said parameters are corresponding to the position of the audio object in the virtual three-dimensional space. The binaural object-oriented audio decoder is configured for receiving the head-related transfer function parameters, whereby said received head-related transfer function parameters are varying for the elevation parameter and the azimuth parameter only. Said binaural object-oriented audio decoder is characterized by distance processing means for modifying the received head-related transfer function parameters according to a received desired distance parameter. Said modified head-related transfer function parameters are being used to position the audio object in the three-dimensions at the desired distance. Said modification of the head-related transfer function parameters is based on a predetermined distance parameter for said received head-related function parameters.
摘要:
In summary, this application describes a psycho-acoustically motivated, parametric description of the spatial attributes of multichannel audio signals. This parametric description allows strong bitrate reductions in audio coders, since only one monaural signal has to be transmitted, combined with (quantized) parameters which describe the spatial properties of the signal. The decoder can form the original amount of audio channels by applying the spatial parameters. For near-CD-quality stereo audio, a bitrate associated with these spatial parameters of 10 kbit/s or less seems sufficient to reproduce the correct spatial impression at the receiving end.