摘要:
A audio conference device capable of detecting a talker's direction exactly and collecting a sound emitted from this direction at a high signal S/N ratio is provided. A detecting beam generating portion 811 applies a delay-sum process to sound collecting signal SS104 to SS113 of microphones MIC104 to MIC113 that are aligned densely in a center portion in the alignment direction, and generates detecting sound collecting beam signals MB101 to MB114. A outputting beam generating portion 812 applies a delay-sum process to sound collecting signals SS101 to SS116 of all microphones MIC101 to MIC116 that are aligned in the alignment direction to generate outputting collecting beam signals MB101′ to MB114′. A sound collecting beam selecting portion 19 detects direction data MS corresponding to a sound collecting beam signal, which has the highest signal density, out of the detecting sound collecting beam signals MB101 to MB114, and feeds the sound collecting beam signal to an outputting beam selecting portion 813. The outputting beam selecting portion 813 selects a sound collecting beam signal corresponding to the direction data MS.
摘要:
A speaker array and microphone arrays positioned on both sides of the speaker array are provided. A plurality of focal points each serving as a position of a talker are set in front of the microphone arrays respectively symmetrically with respect to a centerline of the speaker array, and a bundle of sound collecting beams is output toward the focal points. Difference values between sound collecting beams directed toward the focal points that are symmetrical with respect to the centerline are calculated to cancel sound components that detour from the speaker array to microphones. Then, it is estimated based on totals of squares of peak values of the difference values for a particular time period that the position of the talker is close to which one of the focal points, and the position of the talker is decided by comparing the totals of the squares of the peak values of the sound collecting beams directed to the focal points that are symmetrical mutually.
摘要:
A speaker array and microphone arrays positioned on both sides of the speaker array are provided. A plurality of focal points each serving as a position of a talker are set in front of the microphone arrays respectively symmetrically with respect to a centerline of the speaker array, and a bundle of sound collecting beams is output toward the focal points. Difference values between sound collecting beams directed toward the focal points that are symmetrical with respect to the centerline are calculated to cancel sound components that detour from the speaker array to microphones. Then, it is estimated based on totals of squares of peak values of the difference values for a particular time period that the position of the talker is close to which one of the focal points, and the position of the talker is decided by comparing the totals of the squares of the peak values of the sound collecting beams directed to the focal points that are symmetrical mutually.
摘要:
A position detecting system is provided, which is capable of effectively preventing erroneous detection of audio to be measured. The position detecting system includes a terminal device that inputs an audio signal from an audio device and a microphone. The audio device sequentially inputs measurement audio signals that have been formed by two or more audio signals of different frequencies to a speaker and receives a notification signal, wherein the report signal indicates that the audio of the measurement audio signal has been collected from the terminal device. The audio device clocks a time t1 and a time t2, namely clocks after the audio of the measurement audio signal is output from the speakers SP1-SP2 until the notification signals of the measurement audio signals are received by the signal receiving unit. The audio device calculates the position of the microphone by using the times t1 and t2. For each frequency component of the measurement audio signal, when an audio signal exceeding a predetermined level is inputted from the microphone, the terminal device detects it as a component of the measurement audio signal and transmits a notification signal upon detection of the measurement audio signal.
摘要:
A voice emitting and collecting device that is capable of picking up/outputting a voice emitted from a talker at a high S/N ratio by eliminating the influence of a diffracting voice despite a simple configuration is provided. A signal differencing circuit 191 outputs difference signals MS1 to MS4 between voice collecting beam signals MB11 to MB14 and voice collecting beam signals MB21 to MB24. A level comparator 195 selects the difference signal having a maximum level. A signal selecting circuit 196 selects voice collecting beam signals MB1x, MB2x of the difference signal MS that is selected/pointed by the level comparator 195. A subtracter 199 subtracts the voice collecting beam signal MB2x from the voice collecting beam signal MB1x, and output a resultant signal. Accordingly, main components of the diffracting voice can be removed from the voice collecting beam signal.
摘要:
An adaptive filter generates a pseudo echo sound signal based on a sound emission sound signal. An adder subtracts the pseudo echo sound signal from a low band component of a collected sound signal, thereby generating a sound signal with a first-adjusted low band component. An echo spectrum estimation section estimates and calculates a frequency spectrum of a reverberation echo this time from a spectrum of the pseudo echo sound signal this time, a frequency spectrum of the preceding reverberation echo, and an update coefficient based on an audio environment. An adder subtracts the frequency spectrum of the reverberation echo and the frequency spectrum of stationary noise from a spectrum of the sound signal with the first-adjusted low band component.
摘要:
A video conference device capable of suppressing a processing burden of an echo canceller in such a situation that speakers, microphones, and a camera are arranged in close vicinity of a monitor is provided. A preliminary filter portion 18 is provided in a preceding stage of an echo canceller 19. The preliminary filter portion 18 has an LPF 181, a fixed filter 182, and a post processor 183. A controlling portion 14 sets a filter coefficient corresponding to a sound collecting beam signal that a signal selecting portion 17 selected, in the fixed filter 182. This filter coefficient is set to simulate a transfer function of an acoustic transfer system that feedbacks from the speakers to the microphones. A component of a low frequency band (e.g., 1 kHz or less) out of sound signals (input sound signals) being input into the speakers is input into the fixed filter 182, and a pseudo signal is produced. The pseudo signal (feedback component) is removed by the post processor 183, and a corrected sound collecting beam signal MSs is produced.
摘要:
A level ratio calculation circuit calculates average signal level data of signal level data corresponding to each sound collection beam signal, and calculates a level ratio between the average signal level data and each of the signal level data. Since a diffraction sound is substantially equal to all the signal level data, a diffraction sound component of the average signal level data also becomes substantially equal. On the other hand, a collection sound from a speaker is specific to the signal level data of the corresponding sound collection beam signal. Therefore, at the level ratio, the portion corresponding to the diffraction sound is flat and a data level becomes high locally in only the portion corresponding to the collection sound. By using this, the sound collection beam signal including the collection sound is detected.
摘要:
A voice emitting and collecting device that is capable of picking up/outputting a voice emitted from a talker at a high S/N ratio by eliminating the influence of a diffracting voice despite a simple configuration is provided. A signal differencing circuit 191 outputs difference signals MS1 to MS4 between voice collecting beam signals MB11 to MB14 and voice collecting beam signals MB21 to MB24. A level comparator 195 selects the difference signal having a maximum level. A signal selecting circuit 196 selects voice collecting beam signals MB1x, MB2x of the difference signal MS that is selected/pointed by the level comparator 195. A subtracter 199 subtracts the voice collecting beam signal MB2x from the voice collecting beam signal MB1x, and output a resultant signal. Accordingly, main components of the diffracting voice can be removed from the voice collecting beam signal.
摘要:
Microphone arrays, which are formed by arranging a plurality of microphones, are provided on a front side and a rear side of a housing, respectively. A virtual focus is set for each of the microphone arrays in a direction opposite to a direction in which sound is picked-up sound signals picked up by the plurality of microphones are delayed such that distances to the virtual focus are the same, and the delayed sound signals are synthesized. Therefore, sound in a sound-pickup area of a predetermined angle on each of the front side and the rear side can be picked up at a high level, and even though there is a noise source in areas other than the sound-pickup area, noise from the noise source is not picked up.