Dialog enhancement using adaptive smoothing which depends exponentially on a smoothing factor

    公开(公告)号:US12272376B2

    公开(公告)日:2025-04-08

    申请号:US17638839

    申请日:2020-08-26

    Inventor: Xuemei Yu

    Abstract: A method of enhancing dialog intelligibility in an audio signal, comprising determining a speech confidence score that the audio content includes speech content, determining a music confidence score that the audio content includes music correlated content, in response to the speech confidence score, and applying a user selected gain of selected frequency bands of the audio signal to obtain a dialogue enhanced audio signal. The user selected gain is smoothed by an adaptive smoothing algorithm, an impact of past frames in said smoothing algorithm being determined by a smoothing factor, the smoothing factor being calculated in response to the music confidence score, and having a relatively higher value for content having a relatively higher music confidence score and a relatively lower value for speech content having a relatively lower music confidence score, so as to increase the impact of past frames on the dialogue enhancement of music correlated content.

    Steering of binauralization of audio

    公开(公告)号:US11895479B2

    公开(公告)日:2024-02-06

    申请号:US17637446

    申请日:2020-08-19

    CPC classification number: H04S7/30 H04S2420/01

    Abstract: A method for steering binauralization of audio is provided. The method comprises steps of: receiving (410) an audio input signal, calculating (430) a confidence value indicating a likelihood that a current audio frame of the audio input signal comprises binauralized audio; determining (450) a state signal based on the confidence value; determining (460) a steering signal, based on the first confidence value, the state signal and an energy value of the audio frame; and generating (470) an audio output signal with steered binauralization by processing the audio input signal according to the steering signal.

Patent Agency Ranking