摘要:
An embodiment of an apparatus (100) for generating audio subband values in audio subband channels comprises an analysis windower (110) for windowing a frame (120) of time-domain audio input samples being in a time sequence extending from an early sample to a later sample using an analysis window function (190) comprising a sequence of window coefficients to obtain windowed samples. The analysis window function (190) comprises a first group (200) of window coefficients and a second group (210) of window coefficients. The first group (200) of window coefficients is used for windowing later time-domain samples and the second group (210) of window coefficients is used for windowing an earlier time-domain samples. The apparatus (100) further comprises a calculator (170) for calculating the audio subband values using the windowed samples.
摘要:
An embodiment of an apparatus (100) for generating audio subband values in audio subband channels comprises an analysis windower (110) for windowing a frame (120) of time-domain audio input samples being in a time sequence extending from an early sample to a later sample using an analysis window function (190) comprising a sequence of window coefficients to obtain windowed samples. The analysis window function (190) comprises a first group (200) of window coefficients and a second group (210) of window coefficients. The first group (200) of window coefficients is used for windowing later time-domain samples and the second group (210) of window coefficients is used for windowing an earlier time-domain samples. The apparatus (100) further comprises a calculator (170) for calculating the audio subband values using the windowed samples.
摘要:
An apparatus for generating time-domain audio samples or synthesis filterbank (300) comprises a calculator (310) for calculating a frame (330) comprising a sequence of intermediate time-domain samples from audio subband values of block (320). The calculator (310) is coupled to a synthesis windower (360) to which the frame (330) of intermediate time-domain samples is provided. The synthesis windower (360) is adapted to windowing the sequence of intermediate time-domain samples using a synthesis window function (370) and provides a frame (380) of windowed intermediate time-domain samples. The synthesis windower (360) is coupled to an overlap-adder output stage (400) that obtains a block (410) of time-domain samples. The block (410) of the time-domain (output) samples can then for instance be provided to further components for further processing, storing or transforming into audible audio signals.
摘要:
An embodiment of an apparatus (100) for generating audio subband values in audio subband channels comprises an analysis windower (110) for windowing a frame (120) of time-domain audio input samples being in a time sequence extending from an early sample to a later sample using an analysis window function (190) comprising a sequence of window coefficients to obtain windowed samples. The analysis window function (190) comprises a first group (200) of window coefficients and a second group (210) of window coefficients. The first group (200) of window coefficients is used for windowing later time-domain samples and the second group (210) of window coefficients is used for windowing an earlier time-domain samples. The apparatus (100) further comprises a calculator (170) for calculating the audio subband values using the windowed samples.
摘要:
Audio decoder device for decoding a bitstream, the audio decoder device comprising: a predictive decoder for producing a decoded audio frame from the bitstream, wherein the predictive decoder comprises a parameter decoder for producing one or more audio parameters for the decoded audio frame from the bitstream and wherein the predictive decoder comprises a synthesis filter device for producing the decoded audio frame by synthesizing the one or more audio parameters for the decoded audio frame; a memory device comprising one or more memories, wherein each of the memories is configured to store a memory state for the decoded audio frame, wherein the memory state for the decoded audio frame of the one or more memories is used by the synthesis filter device for synthesizing the one or more audio parameters for the decoded audio frame; and a memory state resampling device configured to determine the memory state for synthesizing the one or more audio parameters for the decoded audio frame, which has a sampling rate, for one or more of said memories by resampling a preceding memory state for synthesizing one or more audio parameters for a preceding decoded audio frame, which has a preceding sampling rate being different from the sampling rate of the decoded audio frame, for one or more of said memories and to store the memory state for synthesizing of the one or more audio parameters for the decoded audio frame for one or more of said memories into the respective memory.
摘要:
Embodiments of the present invention provide an encoder comprising a quantization stage, an entropy encoder, a residual quantization stage and a coded signal former. The quantization stage is configured to quantize an input signal using a dead zone in order to obtain a plurality of quantized values. The entropy encoder is configured to encode the plurality of quantized values using an entropy encoding scheme in order to obtain a plurality of entropy encoded values. The residual quantization stage is configured to quantize a residual signal caused by the quantization stage, wherein the residual quantization stage is configured to determine at least one quantized residual value in dependence on the dead zone of the quantization stage. The coded signal former is configured to form a coded signal from the plurality of entropy encoded values and the at least one quantized residual value.
摘要:
A watermark generator (2400) for providing a watermark signal (2420) in dependence on binary message data (2410) comprises an information processor (2430) configured to provide, in dependence on a single message bit of the binary message data, a 2-dimensional spread information (2432) representing the message bit in the form of a set of time-frequency-domain values. The watermark generator also comprises a watermark signal provider (2440) configured to provide the watermark signal on the basis of the 2-dimensional spread information. A Watermark detector, methods and computer programs are also described.
摘要:
A watermark signal provider (2400) for providing a watermark signal (2440) suitable for being hidden in an audio signal (2430) when the watermark signal is added to the audio signal, such that the watermark signal represents watermark data (2450), is described. The watermark signal provider comprises a psychoacoustical processor (2410) for determining a masking threshold of the audio signal; and a modulator (2420) for generating the watermark signal from a superposition of sample-shaping functions spaced apart from each other at a sample time interval (T b ) of a time-discrete representation of the watermark data, each sample-shaping function being amplitude-weighted with a respective sample of the time-discrete representation, multiplied by a respective amplitude weight depending on the masking threshold, the modulator being configured such that the sample time interval is shorter than a time extension of the sample-shaping functions; and the respective amplitude weight also depends on samples of the time-discrete representation neighboring the respective sample in time.
摘要:
A watermark decoder comprises a time-frequency-domain representation provider, a memory unit, a synchronization determiner and a watermark extractor. The time-frequency-domain representation provider provides a frequency-domain representation of the watermarked signal for a plurality of time blocks. The memory unit stores the frequency-domain representation of the watermarked signal for a plurality of time blocks. Further, the synchronization determiner identifies an alignment time block based on the frequency-domain representation of the watermarked signal of a plurality of time blocks. The watermark extractor provides binary message data based on stored frequency-domain representations of the watermarked signal of time blocks temporally preceding the identified alignment time block considering a distance to the identified alignment time block.