摘要:
An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.
摘要:
An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.
摘要:
A method for detecting barge-in in a speech dialog system comprising determining whether a speech prompt is output by the speech dialog system, and detecting whether speech activity is present in an input signal based on a time-varying sensitivity threshold of a speech activity detector and/or based on speaker information, where the sensitivity threshold is increased if output of a speech prompt is determined and decreased if no output of a speech prompt is determined. If speech activity is detected in the input signal, the speech prompt may be interrupted or faded out. A speech dialog system configured to detect barge-in is also disclosed.
摘要:
Visual information is used to alter or set an operating parameter of an audio signal processor, other than a beamformer. A digital camera captures visual information about a scene that includes a human speaker and/or a listener. The visual information is analyzed to ascertain information about acoustics of a room. A distance between the speaker and a microphone may be estimated, and this distance estimate may be used to adjust an overall gain of the system. Distances among, and locations of, the speaker, the listener, the microphone, a loudspeaker and/or a sound-reflecting surface may be estimated. These estimates may be used to estimate reverberations within the room and adjust aggressiveness of an anti-reverberation filter, based on an estimated ratio of direct to indirect (reverberated) sound energy expected to reach the microphone. In addition, orientation of the speaker or the listener, relative to the microphone or the loudspeaker, can also be estimated, and this estimate may be used to adjust frequency-dependent filter weights to compensate for uneven frequency propagation of acoustic signals from a mouth, or to a human ear, about a human head.
摘要:
A method for detecting barge-in in a speech dialogue system comprising determining whether a speech prompt is output by the speech dialogue system, and detecting whether speech activity is present in an input signal based on a time-varying sensitivity threshold of a speech activity detector and/or based on speaker information, where the sensitivity threshold is increased if output of a speech prompt is determined and decreased if no output of a speech prompt is determined. If speech activity is detected in the input signal, the speech prompt may be interrupted or faded out. A speech dialogue system configured to detect barge-in is also disclosed.
摘要:
An arrangement is described for measuring performance characteristics of a hands free telephone system. There is a measurement system which is coupleable over a telephone audio interface directly to the hands free telephone system for measuring the performance characteristics.
摘要:
A speech processing device includes an automotive device that filters data that is sent and received across an in-vehicle bus. The device selectively acquires vehicle data related to a user settings or adjustments. An interface acquires the selected vehicle data from in-vehicle sensors in response to a user's articulation of a first code phrase. A memory stores the selected vehicle data with unique identifying data associated with the user and establishes a connection between the selected vehicle data and the user when a second code phrase is articulated. A data interface provides access to the selected vehicle data and stored relationship data and enables the processing of the data to customize the in-vehicle system. The data interface is responsive to the user's articulation of a third code phrase to process the selected vehicle data that enables the setting or adjustment of the in-vehicle system.
摘要:
An approach for adjusting an adjustable element, such as a mirror, head rest, steering wheel, heating/air condition blower, associated with a vehicle by determining the position of a speaker in the vehicle.
摘要:
A system and method of signal combining that supports different speakers in a noisy environment is provided. Particularly for deviations in the noise characteristics among the channels, various embodiments ensure a smooth transition of the background noise at speaker changes. A modified noise reduction (NR) may achieve equivalent background noise characteristics for all channels by applying a dynamic, channel specific, and frequency dependent maximum attenuation. The reference characteristics for adjusting the background noise may be specified by the dominant speaker channel. In various embodiments, an automatic gain control (AGC) with a dynamic target level may ensure similar speech signal levels in all channels.
摘要:
The invention provides a computer-implemented method for determining a time delay for time delay compensation of a microphone signal from a microphone array in a beamformer arrangement. For a given time, an instantaneous estimate of a position of a wanted sound source and/or of a direction of arrival of a signal originating from the wanted sound source is determined. The computer system then determines whether the instantaneous estimate deviates from a preset estimate of a position of the wanted sound source and/or of a direction of arrival of a signal originating from the wanted sound source according to a predetermined criterion. The predetermined criterion comprises a check whether the instantaneous estimate deviates from the preset estimate by at least a predetermined deviation threshold. If the predetermined criterion is fulfilled, the instantaneous estimate for the given time is set by the computer system as the preset estimate, and the computer system determines the time delay for time delay compensation of the microphone signal based on the instantaneous estimate.