摘要:
In summary, this application describes a psycho-acoustically motivated, parametric description of the spatial attributes of multichannel audio signals. This parametric description allows strong bitrate reductions in audio coders, since only one monaural signal has to be transmitted, combined with (quantized) parameters which describe the spatial properties of the signal. The decoder can form the original amount of audio channels by applying the spatial parameters. For near-CD-quality stereo audio, a bitrate associated with these spatial parameters of 10 kbit/s or less seems sufficient to reproduce the correct spatial impression at the receiving end.
摘要:
In summary, this application describes a psycho-acoustically motivated, parametric description of the spatial attributes of multichannel audio signals. This parametric description allows strong bitrate reductions in audio coders, since only one monaural signal has to be transmitted, combined with (quantized) parameters which describe the spatial properties of the signal. The decoder can form the original amount of audio channels by applying the spatial parameters. For near-CD-quality stereo audio, a bitrate associated with these spatial parameters of 10 kbit/s or less seems sufficient to reproduce the correct spatial impression at the receiving end.
摘要:
Multi-channel audio signals are coded into a monaural audio signal and information allowing to recover the multi-channel audio signal from the monaural audio signal and the information. The information is generated by determining a first portion of the information for a first frequency region of the multi-channel audio signal, and by determining a second portion of the information for a second frequency region of the multi-channel audio signal. The second frequency region is a portion of the first frequency region and thus is a sub-range of the first frequency region. The information is multi-layered enabling a scaling of the decoding quality versus bit rate.
摘要:
In summary, this application describes a psycho-acoustically motivated, parametric description of the spatial attributes of multichannel audio signals. This parametric description allows strong bitrate reductions in audio coders, since only one monaural signal has to be transmitted, combined with (quantized) parameters which describe the spatial properties of the signal. The decoder can form the original amount of audio channels by applying the spatial parameters. For near-CD-quality stereo audio, a bitrate associated with these spatial parameters of 10 kbit/s or less seems sufficient to reproduce the correct spatial impression at the receiving end.
摘要:
In summary, this application describes a psycho-acoustically motivated, parametric description of the spatial attributes of multichannel audio signals. This parametric description allows strong bitrate reductions in audio coders, since only one monaural signal has to be transmitted, combined with (quantized) parameters which describe the spatial properties of the signal. The decoder can form the original amount of audio channels by applying the spatial parameters. For near-CD-quality stereo audio, a bitrate associated with these spatial parameters of 10 kbit/s or less seems sufficient to reproduce the correct spatial impression at the receiving end.
摘要:
The audio reproduction apparatus (100) for sports training purposes comprises a tempo derivation unit (103) for deriving a selected tempo (T) on the basis of a data signal (d1, d2, d3) e.g. from a sports measurement device such as a heart rate meter; and an audio conditioning unit (104) arranged to deliver based on the input audio signal the output audio signal, with a tempo within a predefined accepted deviation from the selected tempo (T), whereby the audio conditioning unit (104) comprises a tempo calculation unit (106) arranged to calculate an input tempo (TI) of the input audio signal, and the audio conditioning unit (104) is arranged to deliver the output signal in dependence of the input tempo (TI).
摘要:
A basic idea of the present invention is to selectively employ one of at least two different feature extraction processes when generating a biometric template of an individual. An individual offers a physiological property, such as a fingerprint, an iris, an ear, a face, etc., from which biometric data can be derived, to a sensor of an enrollment authority. In the following, the property to be discussed will be fingerprints, even though any suitable biometric property may be used. From the fingerprint, a positional reference point of the biometric data is derived. The derivation of the positional reference point may be accomplished using any appropriate method out of a number of known methods. Such a reference point could be the location of a core, a delta, a weighted average of minutiae coordinates, or alike. Typically, the reference point includes a core of a fingerprint expressed as a three-dimensional coordinate denoted by means of xr, yr, and angle αr. A contribution indicator is calculated for the derived positional reference point, and it is determined whether the derived positional reference point can be considered reliable. Depending on the reliability of the derived reference point, one of the two different feature extraction processes is selected; either the first feature set is extracted using a method which is invariant of the derived reference point, or a method is used taking into account the derived reference point. The better the estimation of the reference point is, the more reliable the reference point-dependent extraction method is. Finally, the biometric template is generated using the extracted first feature set.
摘要:
The invention describes a method of deriving a set of features (S) of an audio input signal (M), which method comprises identifying a number of first-order features (f1, f2, . . . , ff) of the audio input signal (M), generating a number of correlation values (ρ1, ρ2, . . . , ρI) from at least part of the first-order features (f1, f2, . . . , ff), and compiling the set of features (S) for the audio input signal (M) using the correlation values (ρ1, ρ2, . . . , ρI). The invention further describes a method of classifying an audio input signal (M) into a group, and a method of comparing audio input signals (M, M′) to determine a degree of similarity between the audio input signals (M, M′). The invention also describes a system (1) for deriving a set of features (S) of an audio input signal (M), a classifying system (4) for classifying an audio input signal (M) into a group, and a comparison system (5) for comparing audio input signals (M, M′) to determine a degree of similarity between the audio input signals (M, M′).
摘要:
The invention relates to a method for generating a representation of a finger print minutiae information. The invention also relates to a method for generating a representation of a finger print for biometric template protection purposes Biometric template protection techniques provide technological means to protect the privacy of biometric reference information stored in biometric. systems These methods stand in sharp contrast to approaches where biometric information is protected only by legislation and procedures around storage facilities. These systems are not reliable as they are susceptible to human and procedural errors. Template protection guarantees the protection of biometric information without the assumption that individuals are trusted or procedures are properly implemented.
摘要:
An apparatus for generating a binaural audio signal includes a de-multiplexer and decoder which receives audio data comprising an audio M-channel audio signal which is a downmix of an N-channel audio signal and spatial parameter data for upmixing the M-channel audio signal to the N-channel audio signal. A conversion processor converts spatial parameters of the spatial parameter data into first binaural parameters in response to at least one binaural perceptual transfer function. A matrix processor converts the M-channel audio signal into a first stereo signal in response to the first binaural parameters. A stereo filter generates the binaural audio signal by filtering the first stereo signal. The filter coefficients for the stereo filter are determined in response to the at least one binaural perceptual transfer function by a coefficient processor. The combination of parameter conversion/processing and filtering allows a high quality binaural signal to be generated with low complexity.