摘要:
A speech wakeup method includes that a bone conduction microphone collects a bone conduction signal for speech detection, which includes information about a command word input by the sound source, and detects a wakeup word based on the bone conduction signal.
摘要:
The invention relates to an audio signal processing apparatus for processing an input earpiece audio signal upon the basis of a microphone audio signal, the audio signal processing apparatus comprising a voice activity detector being configured to determine a voice activity indicator signal upon the basis of the input earpiece audio signal, a noise magnitude determiner being configured to determine a microphone noise magnitude indicator signal upon the basis of the microphone audio signal, a gain factor determiner being configured to determine a gain factor signal upon the basis of the voice activity indicator signal and the microphone noise magnitude indicator signal, and a weighter being configured to weight the input earpiece audio signal by the gain factor signal to obtain an output earpiece audio signal.
摘要:
An apparatus and a method for enhancing a spatial perception of an audio signal are provided creating increased interaural-level differences. To obtain this effect, two dipoles are used: one for producing a left audio signal and one for producing a right audio signal.
摘要:
A virtual stereo synthesis method includes acquiring at least one sound input signal on a first side and at least one sound input signal on a second side, separately performing ratio processing on a preset head related transfer function (HRTF) left-ear component and a preset HRTF right-ear component of each sound input signal on the second side, to obtain a filtering function of each sound input signal on the second side, separately performing convolution filtering on each sound input signal on the second side and the filtering function of the sound input signal on the second side, to obtain the filtered signal on the second side, and synthesizing all of the sound input signals on the first side and all of the filtered signals on the second side into a virtual stereo signal where the method may alleviate a coloration effect, and reduce calculation complexity.
摘要:
A system and a method for evaluating an acoustic transfer function, wherein the acoustic transfer function is a transfer function from one acoustic source to a reproduction area sampled by a limited number of microphone modules.
摘要:
A method for parametric spatial audio coding of a multi-channel audio signal comprising a plurality of audio channel signals is provided, the method comprising: calculating at least two different spatial coding parameters for an audio channel signal of the plurality of audio channel signals, selecting at least one spatial coding parameter of the at least two different spatial coding parameters associated with the audio channel signal on the basis of the values of the calculated spatial coding parameters; including a quantized representation of the selected spatial coding parameter into a parameter section of an audio bitstream; and setting a parameter type flag in the parameter section of the audio bitstream indicating the type of the selected spatial coding parameter being included into the audio bitstream.
摘要:
An embodiment of the present invention provides a method for generating a downmixed signal, including: performing a time-frequency transform on a received left sound channel signal and a received right sound channel signal to obtain a frequency domain signal, and dividing the frequency domain signal into several frequency bands; calculating a sound channel energy ratio and a sound channel phase difference of each frequency band; calculating a phase difference between the downmixed signal and a first sound channel signal in each frequency band according to the sound channel energy ratio and the sound channel phase difference; and calculating a frequency domain downmixed signal according to the left sound channel signal, the right sound channel signal, and the phase difference between the downmixed signal and the first sound channel signal in each frequency band. This method effectively improves quality of stereo encoding and decoding.
摘要:
A method for parametric spatial audio coding of a multi-channel audio signal comprising a plurality of audio channel signals is provided, the method comprising: calculating at least two different spatial coding parameters for an audio channel signal of the plurality of audio channel signals, selecting at least one spatial coding parameter of the at least two different spatial coding parameters associated with the audio channel signal on the basis of the values of the calculated spatial coding parameters; including a quantized representation of the selected spatial coding parameter into a parameter section of an audio bitstream; and setting a parameter type flag in the parameter section of the audio bitstream indicating the type of the selected spatial coding parameter being included into the audio bitstream.
摘要:
An audio signal processing apparatus for processing an input audio signal is provided, the apparatus comprising a plurality of filters, each filter configured to filter the input audio signal to obtain a plurality of filtered audio signals, each filter designed according to an extended mode matching beamforming applied to a surface of a half revolution, the surface partially characterizing a loudspeaker enclosure shape, a plurality of scaling units, each scaling unit configured to scale the plurality of filtered audio signals using a plurality of gain coefficients to obtain a plurality of scaled filtered audio signals, and a plurality of adders, each adder configured to combine the plurality of scaled filtered audio signals, thereby providing an output audio signal for producing a sound field having a beam directivity pattern defined by the plurality of gain coefficients.
摘要:
A wave field synthesis apparatus for driving an array of loudspeakers with drive signals, the apparatus includes a sound field synthesizer for generating sound field drive signals for causing the array of loudspeakers to generate one or more sound fields at one or more audio zones, a binaural renderer for generating binaural drive signals for causing the array of loud-speakers to generate specified sound pressures at at least two positions, wherein the at least two positions are determined based on a detected position and/or orientation of a listener, and a decision unit for deciding whether to generate the drive signals using the sound field synthesizer or using the binaural renderer.