PROCESSING OBJECT-BASED AUDIO SIGNALS
    31.
    发明申请

    公开(公告)号:US20180152803A1

    公开(公告)日:2018-05-31

    申请号:US15577510

    申请日:2016-05-26

    CPC classification number: H04S7/302 G10L19/008 H04S3/008 H04S7/30 H04S2400/11

    Abstract: An audio processing system and method which calculates, based on spatial metadata of the audio object, a panning coefficient for each of the audio objects in relation to each of a plurality of predefined channel coverage zones. Converts the audio signal into submixes in relation to the predefined channel coverage zones based on the calculated panning coefficients and the audio objects. Each of the submixes indicating a sum of components of the plurality of the audio objects in relation to one of the predefined channel coverage zones. Generating a submix gain by applying an audio processing to each of the submix and controls an object gain applied to each of the audio objects. The object gain being as a function of the panning coefficients for each of the audio objects and the submix gains in relation to each of the predefined channel coverage zones.

    AUDIO SOURCE SEPARATION WITH SOURCE DIRECTION DETERMINATION BASED ON ITERATIVE WEIGHTING

    公开(公告)号:US20180144759A1

    公开(公告)日:2018-05-24

    申请号:US15572067

    申请日:2016-05-12

    Inventor: Lie LU Mingqing HU

    Abstract: Example embodiments disclosed herein relate to audio source separation with source direction determined based on iterative weighted component analysis. A method of separating audio sources in audio content is disclosed. The audio content includes a plurality of channels. The method includes obtaining multiple data samples from multiple time-frequency tiles of the audio content. The method also includes analyzing the data samples to generate multiple components in a plurality of iterations, wherein each of the components indicates a direction with a variance of the data samples, and wherein in each of the plurality of iterations, each of the data samples is weighted with a weight that is determined based on a selected component from the multiple components. The method further includes determining a source direction of the audio content based on the selected component for separating an audio source from the audio content. Corresponding system and computer program product of separating audio sources in audio content are also disclosed.

    PROCESSING OBJECT-BASED AUDIO SIGNALS
    35.
    发明公开

    公开(公告)号:US20240205629A1

    公开(公告)日:2024-06-20

    申请号:US18391426

    申请日:2023-12-20

    CPC classification number: H04S7/302 H04S3/008 H04S7/30 G10L19/008 H04S2400/11

    Abstract: An audio processing system and method which calculates, based on spatial metadata of the audio object, a panning coefficient for each of the audio objects in relation to each of a plurality of predefined channel coverage zones. Converts the audio signal into submixes in relation to the predefined channel coverage zones based on the calculated panning coefficients and the audio objects. Each of the submixes indicating a sum of components of the plurality of the audio objects in relation to one of the predefined channel coverage zones. Generating a submix gain by applying an audio processing to each of the submix and controls an object gain applied to each of the audio objects. The object gain being as a function of the panning coefficients for each of the audio objects and the submix gains in relation to each of the predefined channel coverage zones.

    METADATA-PRESERVED AUDIO OBJECT CLUSTERING

    公开(公告)号:US20220272474A1

    公开(公告)日:2022-08-25

    申请号:US17737184

    申请日:2022-05-05

    Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.

    A DIALOG DETECTOR
    37.
    发明申请

    公开(公告)号:US20220199074A1

    公开(公告)日:2022-06-23

    申请号:US17604379

    申请日:2020-04-13

    Inventor: Lie LU Xin LIU

    Abstract: The present application relates to a method of extracting audio features in a dialog detector in response to an input audio signal, the method comprising dividing the input audio signal into a plurality of frames, extracting frame audio features from each frame, determining a set of context windows, each context window including a number of frames surrounding a current frame, deriving, for each context window, a relevant context audio feature for the current frame based on the frame audio features of the frames in each respective context, and concatenating each context audio feature to form a combined feature vector to represent the current frame. The context windows with the different length can improve the response speed and improve robustness.

    AUDIO SOURCE SEPARATION
    38.
    发明申请

    公开(公告)号:US20190392848A1

    公开(公告)日:2019-12-26

    申请号:US16561836

    申请日:2019-09-05

    Abstract: The present document describes a method for extracting J audio sources from I audio channels. The method includes updating a Wiener filter matrix based on a mixing matrix from a source matrix and based on a power matrix of the J audio sources. Furthermore, the method includes updating a cross-covariance matrix of the I audio channels and of the J audio sources and an auto-covariance matrix of the J audio sources, based on the updated Wiener filter matrix and based on an auto-covariance matrix of the I audio channels. In addition, the method includes updating the mixing matrix and the power matrix based on the updated cross-covariance matrix of the I audio channels and of the J audio sources, and/or based on the updated auto-covariance matrix of the J audio sources.

Patent Agency Ranking