Abstract:
The disclosure includes a voice isolation system comprising an acoustic echo-cancelation subsystem configured to receive a plurality of input signals, subtract an interference component from the input signals, and provide a plurality of output signals. The system also includes an adaptive beamformer subsystem configured to receive the plurality of output signals from the acoustic echo-cancelation subsystem and compute a signal-to-noise ratio enhanced signal based on the received output signals. The system also includes a residual noise suppressor subsystem configured to attenuate at least one portion of the SNR enhanced signal received from the adaptive beamformer subsystem based on the at least one portion having an SNR below a predetermined SNR threshold. The system also includes an automatic gain control subsystem configured to process a signal outputted from the residual noise suppressor subsystem and transmit a resulting signal as an output signal.
Abstract:
A system and method for syntactic re-ranking of possible transcriptions generated by automatic speech recognition are disclosed. A computer system accesses acoustic data for a recorded spoken language and generates a plurality of potential transcriptions for the acoustic data. The computer system scores the plurality of potential transcriptions to create an initial likelihood score for the plurality of potential transcriptions. For a particular potential transcription in the plurality of transcriptions, the computer system generates a syntactical likelihood score. The computer system creates an adjusted score for the particular potential transcription by combining the initial likelihood score and the syntactic likelihood score for the particular potential transcription.
Abstract:
Techniques related to speaker recognition are discussed. Such techniques may include determining an adaptive speaker recognition threshold based on a speech to noise ratio and noise type label corresponding to received audio and performing speaker recognition based on the adaptive speaker recognition threshold and a speaker recognition score corresponding to received audio.
Abstract:
Ein Telekommunikationsgerät umfasst eine Audiosignalübertragungseinrichtung, die ausgelegt ist, um ein Audiosignal zu empfangen und zu einem weiteren Telekommunikationsgerät zu übertragen. Das Telekommunikationsgerät umfasst ferner eine Signalisierungseinrichtung, die ausgelegt ist, um eine Signalisierung auszugeben, wenn zu besorgen ist, dass das Audiosignal akustisch für Dritte verständlich ist oder eine Störung für Dritte darstellt. Ein weiteres Telekommunikationsgerät umfasst eine Audiosignalempfangseinrichtung, die ausgelegt ist, um ein Audiosignal von einem weiteren Telekommunikationsgerät zu empfangen und akustisch auszugeben sowie eine Signalisierungseinrichtung, die ausgelegt ist, um eine Signalisierung auszugeben, wenn zu besorgen ist, dass das ausgegebene Audiosignal akustisch für Dritte verständlich ist oder eine Störung darstellt. Die Telekommunikationsgeräte können in einem System zusammengeschaltet werden. Entsprechende Betriebsverfahren sowie ein Computerprogramm werden ebenfalls beschrieben.
Abstract:
Systems, apparatus and methods may provide for audio processing of received user audio input from a microphone that may optionally be a tissue conducting microphone. Audio processing may be further conducted on received ambient audio from one or more additional microphones. A translator may translate the ambient audio into content to be output to a user. In an embodiment, ambient audio is translated into visual content to be displayed on a virtual reality device.
Abstract:
System and techniques for automatic tuning of speech recognition parameters are described herein. A clean audio segment and a dirty audio segment may be obtained, in an iterative fashion, optimized preprocessing parameters may be obtained by, at an iteration, selecting a set of parameters, preprocessing the clean audio segment with the set of parameters to produce a first result, preprocessing the dirty audio segment with the set of parameters to produce a second result, and scoring a portion of the first result with the a corresponding portion of the second result using clean-diff. When an optimization threshold is reached, exit the iterative process and provide the set of parameters from the last iteration.
Abstract:
Examples described herein include systems, methods, and devices for transmitting a media signal to the remote sensor, receiving a sound signal from the remote sensor, and monitoring the sound signal and the media signal to recognize voice commands.