COMBINED DYNAMIC PROCESSING AND SPEAKER PROTECTION FOR MINIMUM DISTORTION AUDIO PLAYBACK LOUDNESS ENHANCEMENT
    11.
    发明申请
    COMBINED DYNAMIC PROCESSING AND SPEAKER PROTECTION FOR MINIMUM DISTORTION AUDIO PLAYBACK LOUDNESS ENHANCEMENT 有权
    组合动态处理和扬声器保护最小失真音频播放LOUDNESS增强

    公开(公告)号:US20130329894A1

    公开(公告)日:2013-12-12

    申请号:US13802131

    申请日:2013-03-13

    Applicant: APPLE INC.

    CPC classification number: H03G11/00 H03G7/002 H03G7/007

    Abstract: Apparatuses, methods, computer readable mediums, and systems are described for combined dynamic processing and speaker protection for minimizing distortion in audio playback. In some embodiments, at least one compressed audio signal is received, at least one threshold for a speaker is retrieved, modifications to audio signal compression are determined based on the at least one compressed audio signal and the at least one threshold, information embodying the modifications is transmitted to a dynamic processor, and using the dynamic processor, at least one modified compressed audio signal is produced for the speaker based on the information.

    Abstract translation: 描述了组合动态处理和扬声器保护的装置,方法,计算机可读介质和系统,以最小化音频回放中的失真。 在一些实施例中,接收至少一个压缩音频信号,检索到扬声器的至少一个阈值,基于至少一个压缩音频信号和至少一个阈值来确定对音频信号压缩的修改,体现该修改的信息 被发送到动态处理器,并且使用动态处理器,基于该信息为扬声器产生至少一个修改的压缩音频信号。

    MULTI-MICROPHONE SPEECH RECOGNITION SYSTEMS AND RELATED TECHNIQUES

    公开(公告)号:US20180137864A1

    公开(公告)日:2018-05-17

    申请号:US15871836

    申请日:2018-01-15

    Applicant: Apple Inc.

    CPC classification number: G10L15/32 G10L15/20

    Abstract: A speech recognition system for resolving impaired utterances can have a speech recognition engine configured to receive a plurality of representations of an utterance and concurrently to determine a plurality of highest-likelihood transcription candidates corresponding to each respective representation of the utterance. The recognition system can also have a selector configured to determine a most-likely accurate transcription from among the transcription candidates. As but one example, the plurality of representations of the utterance can be acquired by a microphone array, and beamforming techniques can generate independent streams of the utterance across various look directions using output from the microphone array.

    Apparatus and method for linear and nonlinear acoustic echo control using additional microphones collocated with a loudspeaker

    公开(公告)号:US09858944B1

    公开(公告)日:2018-01-02

    申请号:US15206110

    申请日:2016-07-08

    Applicant: Apple Inc.

    Abstract: Apparatus for linear and nonlinear acoustic echo control includes loudspeaker, first, second, and third microphone, beamformer, and first echo canceller. The loudspeaker outputs a loudspeaker signal that includes reference signal. The first microphone and the second microphone are collocated with the loudspeaker, receive at least one of: a near-end speaker signal from a near-end speaker and the loudspeaker signal, and generate first and second microphone uplink signals, respectively. The third microphone receives the near-end speaker signal and generates a third microphone uplink signal. The beamformer receives the first and second microphone uplink signals, directs a beam towards the loudspeaker and drives a null towards the near-end speaker, and generates a beamformer output. The first echo canceler receives the third microphone uplink signal and the beamformer output, and cancels echoes in the third microphone uplink signal based on the beamformer output to generate an echo cancelled signal. Other embodiments are described.

    Robust speech recognition in the presence of echo and noise using multiple signals for discrimination

    公开(公告)号:US09672821B2

    公开(公告)日:2017-06-06

    申请号:US14835588

    申请日:2015-08-25

    Applicant: APPLE INC.

    CPC classification number: G10L15/20 G10L15/16 G10L2021/02082

    Abstract: Systems and methods for speech recognition system having a speech processor that is trained to recognize speech by considering (1) a raw microphone signal that includes an echo signal and (2) different types of echo information signals from an echo cancellation system (and optionally different types of ambient noise suppression signals from a noise suppressor). The different types of echo information signals may include those used for echo cancelation and those having echo information. The speech recognition system may convert the raw microphone signal and different types of echo information signals (and optional noise suppression signals) into spectral features in the form of a vector, and a concatenator to combine the feature vectors into a total vector (for a period of time) that is used to train the speech processor, and during use of the speech processor to recognize speech.

    ROBUST SPEECH RECOGNITION IN THE PRESENCE OF ECHO AND NOISE USING MULTIPLE SIGNALS FOR DISCRIMINATION
    15.
    发明申请
    ROBUST SPEECH RECOGNITION IN THE PRESENCE OF ECHO AND NOISE USING MULTIPLE SIGNALS FOR DISCRIMINATION 有权
    使用多个信号进行歧视的ECHO和NOISE存在下的鲁棒语音识别

    公开(公告)号:US20160358602A1

    公开(公告)日:2016-12-08

    申请号:US14835588

    申请日:2015-08-25

    Applicant: APPLE INC.

    CPC classification number: G10L15/20 G10L15/16 G10L2021/02082

    Abstract: Systems and methods for speech recognition system having a speech processor that is trained to recognize speech by considering (1) a raw microphone signal that includes an echo signal and (2) different types of echo information signals from an echo cancellation system (and optionally different types of ambient noise suppression signals from a noise suppressor). The different types of echo information signals may include those used for echo cancelation and those having echo information. The speech recognition system may convert the raw microphone signal and different types of echo information signals (and optional noise suppression signals) into spectral features in the form of a vector, and a concatenator to combine the feature vectors into a total vector (for a period of time) that is used to train the speech processor, and during use of the speech processor to recognize speech.

    Abstract translation: 通过考虑(1)包含回波信号的原始麦克风信号和(2)来自回波消除系统的不同类型的回波信息信号(以及可选地不同的),语音识别系统的系统和方法具有经过训练以识别语音的语音处理器 来自噪声抑制器的环境噪声抑制信号的类型)。 不同类型的回波信息信号可以包括用于回波消除的信号和具有回波信息的信号。 语音识别系统可以将原始麦克风信号和不同类型的回波信息信号(和可选的噪声抑制信号)以矢量的形式转换为频谱特征,以及将特征向量组合成总矢量(一段时间)的级联器 的时间),用于训练语音处理器,并且在语音处理器的使用期间识别语音。

    SYSTEMS AND METHODS FOR ADJUSTING AUTOMATIC GAIN CONTROL

    公开(公告)号:US20160294343A1

    公开(公告)日:2016-10-06

    申请号:US15175970

    申请日:2016-06-07

    Applicant: Apple Inc.

    CPC classification number: H03G3/3005 H03G3/20 H03G3/3089

    Abstract: Automatic gain control systems disclosed herein can incorporate a confidence metric that can estimate the accuracy of gain adjustments calculated by an automatic gain control module. The confidence metric may be based on a percentage of valid audio samples in a given period of time. Based on the confidence metric, the AGC response may be reduced, delayed, frozen, or otherwise altered from the baseline gain adjustment. Time-averaging process may be used to estimate the input signal power level and determine an appropriate baseline gain adjustment. Additionally, weighting functions can be adjusted to prevent overestimation of the signal power.

    APPARATUS AND METHOD FOR IMPROVING AN AUDIO SIGNAL IN THE SPECTRAL DOMAIN
    18.
    发明申请
    APPARATUS AND METHOD FOR IMPROVING AN AUDIO SIGNAL IN THE SPECTRAL DOMAIN 有权
    改进频域中的音频信号的装置和方法

    公开(公告)号:US20150348562A1

    公开(公告)日:2015-12-03

    申请号:US14502863

    申请日:2014-09-30

    Applicant: Apple Inc.

    CPC classification number: G10L21/0264 G10L25/18

    Abstract: Method of improving audio signal in the spectral domain starts by receiving audio signal that includes signals from sources including speech source and music source. Audio signal is tuned for output by sound output device. Portions of audio signal are analyzed in a spectral domain to determine whether adjustments are required. Analyzing portions of audio signal includes determining whether anomaly is present in frequency band of audio signal in spectral domain by using at least one metric. Metrics include band energy ratios, spectral centroid, spectral tilt, spectral flux, spectral variance, absolute thresholds, and relative thresholds. Audio signal is adjusted to improve audio signal in spectral domain when audio signal is determined to require adjustments. Adjusting audio signal includes adjusting values of the metric in frequency band that is determined to include anomaly to correspond to clustering of metric values for audio signal in spectral domain. Other embodiments are also described.

    Abstract translation: 通过接收包括来自包括语音源和音乐源的信号的音频信号来开始改进频域中的音频信号的方法。 音频信号通过声音输出设备进行调谐。 在频谱域中分析音频信号的一部分以确定是否需要调整。 分析音频信号的部分包括通过使用至少一个度量来确定频谱域中的音频信号的频带中是否存在异常。 度量包括带能量比,光谱中心,光谱倾斜,光谱通量,光谱方差,绝对阈值和相对阈值。 当音频信号被确定为需要调整时,调节音频信号以改进频域中的音频信号。 调整音频信号包括调整频带中的度量的值,其被确定为包括异常以对应于频谱域中的音频信号的度量值的聚类。 还描述了其它实施例。

    Multi-microphone speech recognition systems and related techniques

    公开(公告)号:US10304462B2

    公开(公告)日:2019-05-28

    申请号:US15871836

    申请日:2018-01-15

    Applicant: Apple Inc.

    Abstract: A speech recognition system for resolving impaired utterances can have a speech recognition engine configured to receive a plurality of representations of an utterance and concurrently to determine a plurality of highest-likelihood transcription candidates corresponding to each respective representation of the utterance. The recognition system can also have a selector configured to determine a most-likely accurate transcription from among the transcription candidates. As but one example, the plurality of representations of the utterance can be acquired by a microphone array, and beamforming techniques can generate independent streams of the utterance across various look directions using output from the microphone array.

Patent Agency Ranking