摘要:
A device implementing a system for processing speech in an audio signal includes at least one processor configured to receive an audio signal corresponding to at least one microphone of a device, and to determine, using a first model, a first probability that a speech source is present in the audio signal. The at least one processor is further configured to determine, using a second model, a second probability that an estimated location of a source of the audio signal corresponds to an expected position of a user of the device, and to determine a likelihood that the audio signal corresponds to the user of the device based on the first and second probabilities.
摘要:
A device may include microphones worn on a head of a user. The device may include a processor, configured to obtain microphone signals from the plurality of microphones. The processor may attenuate breathing sound from the user by processing the microphone signals, resulting in attenuated microphone signals. The processor may render one or more output audio channels based on the plurality of attenuated microphone signals.
摘要:
An appliance can include a microphone transducer, a processor, and a memory storing instructions. The appliance is configured to receive an audio signal at the microphone transducer and to detect an utterance in the audio signal. The appliance is further configured to classify a speech mode based on the utterance. The appliance is further configured to determine conditions of an environment of the appliance. The appliance is further configured to select at least one of a playback volume or a speech output mode from a plurality of speech output modes based on the classification, and the conditions of the environment of the appliance. The appliance is further configured to adapt the playback volume and/or mode of played-back speech according to the speech output mode. The appliance may be configured to synthesize speech according to the speech output mode, or to modify synthesized speech according to the speech output mode.
摘要:
An audio system has a housing in which are integrated a number of microphones. A programmed processor accesses the microphone signals and produces a number of acoustic pick up beams. A number of separation values are computed, each being a measure of the difference between strength of a respective beam and strength of a noise reference input signal. One of the beams is selected whose separation value is the largest, and the selected beam is applied to a first input of a two-channel noise suppression process, while the noise reference input signal is applied to the second input of the noise suppression process. Other embodiments are also described and claimed.
摘要:
A device that includes a microphone may be worn in or on an ear of a user. A microphone signal generated by the microphone may be processed to determine a heart activity of a user. An indication of a heart pathology may be detected by applying a predictive algorithm to at least the heart activity. Other aspects are described.
摘要:
A plurality of microphone signals can be obtained. In the plurality of microphone signals, speech of a user can be detected. A gaze of a user can be determined based on the plurality of microphone signals. A voice activated response of the computing device can be performed in response to the gaze of the user being directed at the computing device. Other aspects are described and claimed.
摘要:
An orientation detector can have a first microphone, a second microphone, and a reference microphone spaced from the first microphone and the second microphone. An orientation processor can be configured to determine an orientation of the first microphone, the second microphone, or both, relative to a user's mouth based on a comparison of a relative strength of a first signal associated with the first microphone to a relative strength of a second signal associated with the second microphone. A channel selector in a speech enhancer can select one signal from among several signals based at least in part on the orientation determined by the orientation processor. A mobile communication handset can include a microphone-based orientation detector of the type disclosed herein.
摘要:
An audio device may use the audio detected at two opposite facing, front and rear omnidirectional microphones to determine the angular directional location of a user's voice while the device in speaker mode or audio command input mode. The angular directional location may be determined to be at front, side and rear locations of the device during the period of time by calculating an energy ratio of audio signals output by the front and rear microphones during the period. Comparing the ratio to experimental data for sound received from different directions around the device may provide the location of the user's voice. Based on the determination, audio beamforming input settings may be adjusted for user voice beamforming. As a result, the device can perform better beamforming to combine the signals captured by the microphones and generate a single output that isolates the user's voice from background noise.
摘要:
An audio device may use the audio detected at two opposite facing, front and rear omnidirectional microphones to determine the angular directional location of a user's voice while the device in speaker mode or audio command input mode. The angular directional location may be determined to be at front, side and rear locations of the device during the period of time by calculating an energy ratio of audio signals output by the front and rear microphones during the period. Comparing the ratio to experimental data for sound received from different directions around the device may provide the location of the user's voice. Based on the determination, audio beamforming input settings may be adjusted for user voice beamforming. As a result, the device can perform better beamforming to combine the signals captured by the microphones and generate a single output that isolates the user's voice from background noise.
摘要:
An appliance can include a microphone transducer, a processor, and a memory storing instructions. The appliance is configured to receive an audio signal at the microphone transducer and to detect an utterance in the audio signal. The appliance is further configured to classify a speech mode based on the utterance. The appliance is further configured to determine conditions of an environment of the appliance. The appliance is further configured to select at least one of a playback volume or a speech output mode from a plurality of speech output modes based on the classification, and the conditions of the environment of the appliance. The appliance is further configured to adapt the playback volume and/or mode of played-back speech according to the speech output mode. The appliance may be configured to synthesize speech according to the speech output mode, or to modify synthesized speech according to the speech output mode.