摘要:
In a video conferencing system, camera position means (10) are used to point the camera to a speaking person. To find the correct direction for the camera, a system is required that determines the position from which the sound is transmitted. This can be done by using at least two microphones receiving the speech signal. By measuring the transmission delay between the signals received by the microphones, the position of the speaker can be determined. According to the present invention, the delay is determined by first determining the impulse responses (h1) and (h2) and subsequently calculating a cross correlation function between the impulse responses (h1) and (h2). From the main peak in the cross correlation function, the delay value is determined.
摘要:
Audio reproduction systems are used to reproduce audio signals. A disadvantage of known audio reproduction systems is that in most cases with stereo or multi-channel signals the quality performance of reproduction is very much dependent on the listening position. The invention proposes a solution by making it possible for the audio reproduction system to localize the position of the listener and amend the reproduced audio signal in dependence of the location.
摘要:
A device for and method of calibrating a microphone, comprising a loudspeaker (3) for converting a loudspeaker input signal (5) into sound; a microphone (4) for converting received sound into a microphone output signal (16), and calibration means for calibrating an output power of the microphone relative to a desired power level. The calibration means comprise impulse response estimating means (7) for estimating an acoustic impulse response of the microphone by correlating the microphone output signal (6) and the loudspeaker input signal (5) when the microphone (4) receives the sound from the loudspeaker (3), whereby the output power of the microphone (4) is estimated.
摘要:
The invention proposes extracting one or more speech signals (151-154) as well as one or more ambient signals (131) from sound signals captured by microphones, wherein each of the speech signals corresponds to a different speaker. The invention proposes to transmit both the one or more speech signals (151-154) and the one or more ambient signals (131) to a rendering side, as opposed to sending only speech signals. This enables to reproduce the speech and ambient signals in a spatially different way at the rendering side. By reproducing the ambient signals a feeling of "being together" is created. In an embodiment, the invention enables reproducing two or more speech signals spatially different from each other and from the ambient signals so that speech intelligibility is increased despite the presence of the ambient signals.
摘要:
The present invention relates to a transmitting communication device comprising: -a camera for capturing an image sequence with a display range; -a sensor for capturing environmental events that are not perceptible within the display range, -a processor for computing a control signal from the environmental events, and -a transmitting unit for transmitting the captured image sequence and the control signal to a receiving communication device.
摘要:
A method is described, wherein multiple input signals are subjected to a combination process of adaptive beamforming and adaptive echo cancelling, and wherein for each of the input signals an individual processing history of adaptive echo cancelling data is kept and combined with current adaptive beamforming data. Accordingly an audio processing device is described which comprises at least one parallel acoustic paths for providing respective inputs signals, the acoustic paths are connected in series to beamformer paths, and the device comprises an adaptive beamformer and an adaptive echo canceller for performing adaptive beamforming and adaptive echo cancelling respectively, whereby the adaptive echo canceller is provided with storage means for storing in relation to every input signal, individual processing histories of adaptive echo cancelling data for combination with current adaptive beamforming data. Both beamformer and echo cancelling techniques can be combined such that a reduced number of calculations results.
摘要:
A method is described, wherein multiple input signals are subjected to a combination process of adaptive beamforming and adaptive echo cancelling, and wherein for each of the input signals an individual processing history of adaptive echo cancelling data is kept and combined with current adaptive beamforming data. Accordingly an audio processing device is described which comprises at least one parallel acoustic paths for providing respective inputs signals, the acoustic paths are connected in series to beamformer paths, and the device comprises an adaptive beamformer and an adaptive echo canceller for performing adaptive beamforming and adaptive echo cancelling respectively, whereby the adaptive echo canceller is provided with storage means for storing in relation to every input signal, individual processing histories of adaptive echo cancelling data for combination with current adaptive beamforming data. Both beamformer and echo cancelling techniques can be combined such that a reduced number of calculations results.
摘要:
The present invention relates to a transmitting communication device comprising: -a camera for capturing an image sequence with a display range; -a sensor for capturing environmental events that are not perceptible within the display range, -a processor for computing a control signal from the environmental events, and -a transmitting unit for transmitting the captured image sequence and the control signal to a receiving communication device.
摘要:
A digital image (110) which is displayed to a user (118) is modified to include an aspect (120) of a detected at least one characteristic of the user (118) to give the user (118) the impression that the user (118) is present within the scene displayed by the digital image (110).