摘要:
An audio signal processing system including a time-frequency conversion unit which converts an audio signal in time domain into frequency domain in frame units so as to calculate a frequency spectrum of the audio signal, a spectral change calculation unit which calculates an amount of change between a frequency spectrum of a first frame and a frequency spectrum of a second frame before the first frame based on the frequency spectrum of the first frame and the frequency spectrum of the second frame, and a judgment unit which judges the type of the noise which is included in the audio signal of the first frame in accordance with the amount of spectral change.
摘要:
A voice activity detector that detects talkspurts in a given signal at a high accuracy, so as to improve the quality of voice communication. A frequency spectrum calculator calculates frequency spectrum of a given input signal. A flatness evaluator evaluates the flatness of this power spectrum by, for example, calculating the average of power spectral components and then adding up the differences between those components and the average. The resultant sum of differences, in this case, is used as a flatness factor of the spectrum. A voice/noise discriminator determines whether the input signal contains a talkspurt or not, by comparing the flatness factor of the frequency spectrum with a predetermined threshold.
摘要:
A communication apparatus includes an image capturing unit configured to capture a face image of a user; a contour extraction unit configured to extract a face contour from the face image captured by the image capturing unit; an ear position estimation unit configured to estimate positions of ears of the user on the basis of the extracted face contour; a distance estimation unit configured to estimate a distance between the communication apparatus and the user on the basis of the extracted face contour; a sound output unit configured to output sound having a directivity; and a control unit configured to control an output range of sound output from the sound output unit on the basis of the positions of ears of the user estimated by the ear position estimation unit and the distance between the communication apparatus and the user estimated by the distance estimation unit.
摘要:
In an echo processing method and device which can detect an accurate echo section without effects of a far end signal, an echo delay, and a reduction of an echo cancellation amount, a signal of a specified frequency band is generated in conformity with a near end signal, and the signal of the specified frequency band is added to the near end signal to form a transmitting signal. Receiving signals are separated into the signal of the specified frequency band and a signal of a band other than the specified frequency band. An echo section is detected based on the signal of the specified frequency band separated. An echo component in the signal of the band other than the specified frequency band is removed and a level of the echo component is detected based on the near end signal in the echo section. Each step may be performed with a digital signal, the transmitting signal may be converted into an analog signal to be inputted to a 2-wire/4-wire converter, the receiving signal may be outputted from the 2-wire/4-wire converter to be converted into the digital signal.
摘要:
An audio signal processing system including a time-frequency conversion unit which converts an audio signal in time domain into frequency domain in frame units so as to calculate a frequency spectrum of the audio signal, a spectral change calculation unit which calculates an amount of change between a frequency spectrum of a first frame and a frequency spectrum of a second frame before the first frame based on the frequency spectrum of the first frame and the frequency spectrum of the second frame, and a judgment unit which judges the type of the noise which is included in the audio signal of the first frame in accordance with the amount of spectral change.
摘要:
In a wireless communication apparatus, an ambient apparatus number measurement unit measures the number of wireless apparatuses existing around. A movement amount measurement unit detects acceleration at the time when its own apparatus moves and measures the movement amount of its own apparatus. An ambient sound analysis unit records ambient sounds and analyzes the recorded ambient sounds to generate ambient sound analysis information. A communication controller estimates surroundings of its own apparatus based on at least one of the number of the measured wireless apparatuses, the measured movement amount, and the ambient sound analysis information, and autonomously performs a communication control of its own apparatus according to the estimated surroundings.
摘要:
An optical device includes a fast Fourier transform (FFT) unit, a signal noise ratio (SNR) calculation processing unit, a band selecting unit, an extension-signal creating unit, an addition unit, and an inverse fast Fourier transform (IFFT) unit. The FFT unit performs the Fourier transform on an input signal that is input from the outside. The SNR calculation processing unit calculates an SNR with respect to each of bands in the input signal. The band selecting unit selects a band of which SNR exceeds a threshold and is the maximum SNR, based on respective SNRs of the bands. The extension-signal creating unit creates an extension signal based on a signal acquired by the band selecting unit. The addition unit adds the extension signal to the input signal, and creates a band-extended signal. The IFFT unit performs the inverse fast Fourier transform on the band-extended signal, and creates an output signal.
摘要:
In an echo processing method and device which can detect an accurate echo section without effects of a far end signal, an echo delay, and a reduction of an echo cancellation amount, a signal of a specified frequency band is generated in conformity with a near end signal, and the signal of the specified frequency band is added to the near end signal to form a transmitting signal. Receiving signals are separated into the signal of the specified frequency band and a signal of a band other than the specified frequency band. An echo section is detected based on the signal of the specified frequency band separated. An echo component in the signal of the band other than the specified frequency band is removed and a level of the echo component is detected based on the near end signal in the echo section. Each step may be performed with a digital signal, the transmitting signal may be converted into an analog signal to be inputted to a 2-wire/4-wire converter, the receiving signal may be outputted from the 2-wire/4-wire converter to be converted into the digital signal.
摘要:
A data embedding device for embedding data in a speech code obtained by encoding a speech in accordance with a speech encoding method based on a voice generation process of a human being, includes an embedding judgment unit, every speech code, judging whether or not data should be embedded in the speech code, and an embedding unit embedding data in two or more parameter codes of a plurality of parameter codes constituting the speech code for which it is judged by the embedding judgment unit that the data should be embedded.
摘要:
Communications from a transmission side to a reception side neither changing the format of voice code data nor requiring another transmission path or increasing the transmission quantity of control information are controlled utilizing information obtained on the reception side. A system includes a first communication equipment provided with a control information embedding unit for embedding control information that is used for a control of communications from a communication partner to the own communication equipment and that is obtained on the own communication equipment side in the communication data to be transmitted to the communication partner side and a second communication equipment provided with a communication control unit for controlling communications to the first communication equipment side using control information transmitted from the first communication equipment.