METHOD AND DEVICE FOR PROCESSING A BINAURAL RECORDING

    公开(公告)号:US20230360662A1

    公开(公告)日:2023-11-09

    申请号:US18026281

    申请日:2021-09-15

    CPC classification number: G10L21/0208 H04S1/007 G10L2021/02166

    Abstract: The present invention relates to a method and device for processing a first and a second audio signal representing an input binaural audio signal acquired by a binaural recording device. The present invention further relates to a method for rendering a binaural audio signal on a speaker system. The method for processing a binaural signal comprising extracting audio information from the first audio signal, computing a band gain for reducing noise in the first audio signal and applying the band gains to respective frequency bands of the first audio signal in accordance with a dynamic scaling factor, to provide a first output audio signal. Wherein the dynamic scaling factor has a value between zero and one and is selected so as to reduce quality degradation for the first audio signal.

    Headphone rendering metadata-preserving spatial coding

    公开(公告)号:US12177647B2

    公开(公告)日:2024-12-24

    申请号:US18690133

    申请日:2022-09-08

    Abstract: Systems and methods for preserving headphone rendering mode (HRM) in object clustering are described. In an embodiment, an object-based audio data processing system includes a processor configured to receive a plurality of audio objects, wherein an audio object of the plurality of audio objects is associated with respective object metadata that indicates respective spatial position information and an HRM; determine a plurality of cluster positions by applying an extended hybrid distance metric to a spatial coding algorithm to calculate a partial loudness for each of the audio objects; render the audio objects to the cluster positions to form a plurality of clusters by applying the extended hybrid distance metric to the spatial coding algorithm to calculate object-to-cluster gains; and transmit the clusters to a spatial reproduction system.

    HEADPHONE RENDERING METADATA-PRESERVING SPATIAL CODING

    公开(公告)号:US20240334146A1

    公开(公告)日:2024-10-03

    申请号:US18690133

    申请日:2022-09-08

    CPC classification number: H04S7/302 H04S2400/11 H04S2420/01

    Abstract: Systems and methods for preserving headphone rendering mode (HRM) in object clustering are described. In an embodiment, an object-based audio data processing system includes a processor configured to receive a plurality of audio objects, wherein an audio object of the plurality of audio objects is associated with respective object metadata that indicates respective spatial position information and an HRM; determine a plurality of cluster positions by applying an extended hybrid distance metric to a spatial coding algorithm to calculate a partial loudness for each of the audio objects; render the audio objects to the cluster positions to form a plurality of clusters by applying the extended hybrid distance metric to the spatial coding algorithm to calculate object-to-cluster gains; and transmit the clusters to a spatial reproduction system.

    CLUSTERING AUDIO OBJECTS
    4.
    发明公开

    公开(公告)号:US20240187807A1

    公开(公告)日:2024-06-06

    申请号:US18547006

    申请日:2022-02-15

    Inventor: Ziyu Yang Lie Lu

    CPC classification number: H04S7/30 H04S2400/11

    Abstract: A method for clustering audio objects may involve identifying a plurality of audio objects, wherein each audio object of the plurality of audio objects is associated with respective metadata that indicates respective spatial position information and respective rendering metadata. The method may involve assigning audio objects of the plurality of audio objects to categories of rendering metadata of a plurality of categories of rendering metadata, wherein at least one category of rendering metadata comprises a plurality of types of rendering metadata to be preserved. The method may involve determining an allocation of a plurality of audio object clusters to each category of rendering metadata. The method may involve rendering audio objects of the plurality of audio objects to an allocated plurality of audio object clusters based on the metadata that indicates spatial position information and based on the assignments of the audio objects to the categories of rendering metadata.

    Steering of binauralization of audio

    公开(公告)号:US11895479B2

    公开(公告)日:2024-02-06

    申请号:US17637446

    申请日:2020-08-19

    CPC classification number: H04S7/30 H04S2420/01

    Abstract: A method for steering binauralization of audio is provided. The method comprises steps of: receiving (410) an audio input signal, calculating (430) a confidence value indicating a likelihood that a current audio frame of the audio input signal comprises binauralized audio; determining (450) a state signal based on the confidence value; determining (460) a steering signal, based on the first confidence value, the state signal and an energy value of the audio frame; and generating (470) an audio output signal with steered binauralization by processing the audio input signal according to the steering signal.

Patent Agency Ranking