Abstract:
A system that includes an automatic volume leveler (AVL) that processes audio data based on audio category and desired volume level. The system may select different settings for audio data associated with different audio sources (e.g., content providers), audio categories (e.g., types of audio data, such as music, voice, etc.), genres, and/or the like. For example, the system may distinguish between music signals and voice signals (e.g., speech) and may apply a first gain curve for the music and a second gain curve for the speech. Additionally or alternatively, the system may distinguish between different genres of music and may apply different gain curves based on the genre. Further, the system may select a gain curve based on a desired volume level. Therefore, output audio data generated by performing AVL may be optimized based on the audio category, audio source, genre, desired volume level, and/or the like.
Abstract:
A system that performs automatic volume control and leveling (AVCL) based on audio category and desired volume level. The system may select different settings for audio data associated with different audio sources (e.g., content providers), audio categories (e.g., types of audio data, such as music, voice, etc.), genres, and/or the like. For example, the system may distinguish between music signals and voice signals (e.g., speech) and may apply a first gain curve for the music and a second gain curve for the speech. The gain curve may include an adaptive noise gate to reduce in-channel noise, such as by attenuating quiet sounds. The system may select the gain curve based on a desired volume level, and in some examples may modify the gain curve based on a signal-to-noise ratio (SNR) of the input audio and/or based on environmental noise.
Abstract:
Techniques are described for detecting and/or validating audible taps, double-taps, or other tapped audio sequences generated by a user. A predicted voice signal is subtracted from an audio signal to produce a prediction error signal. The prediction error signal is analyzed in terms of dynamics and periodicity to determine whether it contains an audio pulse. In certain cases, prediction error signals corresponding to multiple microphones are analyzed to determine position information regarding the individual taps of a double-tap. A double-tap is validated and acted upon when the individual taps originate from the same location and the taps occur within a given time period.
Abstract:
The systems, devices, and processes described herein may include a first microphone that detects a target voice of a user within an environment and a second microphone that detects other noise within the environment. A target voice estimate and/or a noise estimate may be generated based at least in part on one or more adaptive filters. Based at least in part on the voice estimate and/or the noise estimate, an enhanced target voice and an enhanced interference, respectively, may be determined. One or more words that correspond to the target voice may be determined based at least in part on the enhanced target voice and/or the enhanced interference. In some instances, the one or more words may be determined by suppressing or canceling the detected noise.
Abstract:
A limiter for an audio system prevents loud audio signals that exceed a threshold from being output. Output of the audio signals are delayed. When a loud signal exceeds the threshold, the gain applied to the delayed signals is gradually reduced so that by the time the loud signal reaches the output, the gain is at a level that reduces the loud audio signal to be within the threshold. Thereafter the gain is gradually restored to normal over a longer period of time than the audio signals are delayed.
Abstract:
An audio-based system may perform echo cancellation using first and second adaptive filters. An adaptation controller may be configured to control whether and/or how the filter coefficients of the first adaptive filter are updated, based on detected filter divergence, echo path changes, and/or presence of near-end user voice. The parameters of the second adaptive filter may be copied from the first adaptive filter under certain conditions that indicate whether copying the parameters would be likely to improve echo suppression.
Abstract:
The systems, devices, and processes described herein may generate a signal, such as an audio signal or an electrical signal, that may include echo and/or noise. The echo may be at least partially canceled based at least in part on acoustic echo cancellation or local echo cancellation, while the noise may be at least partially reduced based at least in part on noise reduction. A residual amount of echo and/or noise may then be determined. The residual echo and/or noise may be suppressed based at least in part on adaptive filtering of the signal, which may include modifying weights of an algorithm associated with an adaptive filter.
Abstract:
Features are disclosed for performing noise compensation so a level of noise may be audible in an output signal. For example, a first noise level of a first signal can be estimated. The first signal can be processed (e.g., by residual echo suppression) to determine a second signal, and a second noise level of the second signal can be estimated. Residual echo suppression can sometimes cause background noise to be eliminated, causing silence. If the second noise level is less than a product of the first noise level and a noise threshold, then a noise signal based on the first noise level can be determined, wherein the noise level of the noise signal is configured to be above a hearing threshold. The noise signal can be combined with the second signal to generate an output signal.
Abstract:
A system capable of directing audio output to a portion of a shared acoustic environment. For example, the system may divide the environment into two or more sound zones and may generate audio output directed to one or more sound zones. The system may distinguish between target sound zones and quiet sound zones and may determine a set of global filter coefficients with which to direct the audio output. The system may generate a first set of filter coefficients that increase audio volume in the target sound zones and a second set of filter coefficients that increase a ratio of audio volume between the target sound zones and the quiet sound zones. The system may generate the set of global filter coefficients using a combination of the first set and the second set. The system may also direct audio from multiple audio sources in different directions.