HEADPHONE RENDERING METADATA-PRESERVING SPATIAL CODING

    公开(公告)号:US20240334146A1

    公开(公告)日:2024-10-03

    申请号:US18690133

    申请日:2022-09-08

    IPC分类号: H04S7/00

    摘要: Systems and methods for preserving headphone rendering mode (HRM) in object clustering are described. In an embodiment, an object-based audio data processing system includes a processor configured to receive a plurality of audio objects, wherein an audio object of the plurality of audio objects is associated with respective object metadata that indicates respective spatial position information and an HRM; determine a plurality of cluster positions by applying an extended hybrid distance metric to a spatial coding algorithm to calculate a partial loudness for each of the audio objects; render the audio objects to the cluster positions to form a plurality of clusters by applying the extended hybrid distance metric to the spatial coding algorithm to calculate object-to-cluster gains; and transmit the clusters to a spatial reproduction system.

    Processing object-based audio signals

    公开(公告)号:US11470437B2

    公开(公告)日:2022-10-11

    申请号:US16825776

    申请日:2020-03-20

    IPC分类号: H04S7/00 H04S3/00 G10L19/008

    摘要: An audio processing system and method which calculates, based on spatial metadata of the audio object, a panning coefficient for each of the audio objects in relation to each of a plurality of predefined channel coverage zones. Converts the audio signal into submixes in relation to the predefined channel coverage zones based on the calculated panning coefficients and the audio objects. Each of the submixes indicating a sum of components of the plurality of the audio objects in relation to one of the predefined channel coverage zones. Generating a submix gain by applying an audio processing to each of the submix and controls an object gain applied to each of the audio objects. The object gain being as a function of the panning coefficients for each of the audio objects and the submix gains in relation to each of the predefined channel coverage zones.

    ADAPTIVE LOUDNESS NORMALIZATION FOR AUDIO OBJECT CLUSTERING

    公开(公告)号:US20220159395A1

    公开(公告)日:2022-05-19

    申请号:US17427665

    申请日:2020-02-12

    发明人: Lianwu Chen Lie Lu

    IPC分类号: H04S7/00

    摘要: A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the audio elements in the cluster; and applying the compensation gain to the at least one audio element in the cluster.

    Audio source separation with source direction determination based on iterative weighting

    公开(公告)号:US10930299B2

    公开(公告)日:2021-02-23

    申请号:US15572067

    申请日:2016-05-12

    发明人: Lie Lu Mingqing Hu

    摘要: Example embodiments disclosed herein relate to audio source separation with source direction determined based on iterative weighted component analysis. A method of separating audio sources in audio content is disclosed. The audio content includes a plurality of channels. The method includes obtaining multiple data samples from multiple time-frequency tiles of the audio content. The method also includes analyzing the data samples to generate multiple components in a plurality of iterations, wherein each of the components indicates a direction with a variance of the data samples, and wherein in each of the plurality of iterations, each of the data samples is weighted with a weight that is determined based on a selected component from the multiple components. The method further includes determining a source direction of the audio content based on the selected component for separating an audio source from the audio content. Corresponding system and computer program product of separating audio sources in audio content are also disclosed.

    Audio object clustering with single channel quality preservation

    公开(公告)号:US10278000B2

    公开(公告)日:2019-04-30

    申请号:US15375488

    申请日:2016-12-12

    IPC分类号: H04S7/00

    摘要: Example embodiments disclosed herein relate to audio object clustering with single channel quality preservation. A method of clustering audio objects is disclosed. The method includes determining cluster positions based on object positions of the audio objects and a reference speaker layout, the reference speaker layout indicating speakers located at different speaker positions. The method also includes determining object-to-cluster gains based on the determined cluster positions, the object positions and the reference speaker layout, an object-to-cluster gain defining a proportion of the respective audio object that is assigned to a cluster associated with one of the determined cluster positions. The method further includes clustering the audio objects based on the object-to-cluster gains and the cluster positions for generating cluster signals. Corresponding system, computer program product and device for clustering audio objects are also disclosed.

    Audio object extraction
    8.
    发明授权

    公开(公告)号:US09786288B2

    公开(公告)日:2017-10-10

    申请号:US15031887

    申请日:2014-11-25

    摘要: Embodiments of the present invention relate to audio object extraction. A method for audio object extraction from audio content of a format based on a plurality of channels is disclosed. The method comprises applying audio object extraction on individual frames of the audio content at least partially based on frequency spectral similarities among the plurality of channels. The method further comprises performing audio object composition across the frames of the audio content, based on the audio object extraction on the individual frames, to generate a track of at least one audio object. Corresponding system and computer program product are also disclosed.

    Audio Processing Method and Audio Processing Apparatus, and Training Method
    9.
    发明申请
    Audio Processing Method and Audio Processing Apparatus, and Training Method 有权
    音频处理方法和音频处理装置以及训练方法

    公开(公告)号:US20140358265A1

    公开(公告)日:2014-12-04

    申请号:US14282654

    申请日:2014-05-20

    发明人: Jun Wang Lie Lu

    IPC分类号: G06F3/16

    摘要: Audio processing method and audio processing apparatus, and training method are described. According to embodiments of the application, an accent identifier is used to identify accent frames from a plurality of audio frames, resulting in an accent sequence comprised of probability scores of accent and/or non-accent decisions with respect to the plurality of audio frames. Then a tempo estimator is used to estimate a tempo sequence of the plurality of audio frames based on the accent sequence. The embodiments can be well adaptive to the change of tempo, and can be further used to tracking beats properly.

    摘要翻译: 描述音频处理方法和音频处理装置以及训练方法。 根据应用的实施例,使用重音标识符来识别来自多个音频帧的重音帧,从而导致包括关于多个音频帧的重音和/或非重音判定的概率分数的重音序列。 然后,速度估计器用于基于重音序列来估计多个音频帧的速度序列。 这些实施例可以很好地适应于速度的改变,并且可以进一步用于适当地跟踪节拍。