ADAPTIVE PANNER OF AUDIO OBJECTS
    35.
    发明申请

    公开(公告)号:US20210219083A1

    公开(公告)日:2021-07-15

    申请号:US17149683

    申请日:2021-01-14

    Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.

    Separating audio sources
    38.
    发明授权

    公开(公告)号:US10176826B2

    公开(公告)日:2019-01-08

    申请号:US15549651

    申请日:2016-02-11

    Inventor: Jun Wang

    Abstract: Example embodiments disclosed herein relate to source separation in audio content. A method for separating sources from audio content is disclosed, the audio content being of a multi-channel format based on a plurality of channels. The method comprises performing a component analysis on the audio content for each of the plurality of channels to generate a plurality of components, each of the plurality of components comprising a plurality of time-frequency tiles in full frequency band; generating at least one dominant source with at least one of the time-frequency tiles from the plurality of the components and separating the sources from the audio content by estimating spatial parameters and spectral parameters based on the dominant source. Corresponding system and computer program product are also disclosed.

    Audio object extraction
    39.
    发明授权

    公开(公告)号:US09786288B2

    公开(公告)日:2017-10-10

    申请号:US15031887

    申请日:2014-11-25

    Abstract: Embodiments of the present invention relate to audio object extraction. A method for audio object extraction from audio content of a format based on a plurality of channels is disclosed. The method comprises applying audio object extraction on individual frames of the audio content at least partially based on frequency spectral similarities among the plurality of channels. The method further comprises performing audio object composition across the frames of the audio content, based on the audio object extraction on the individual frames, to generate a track of at least one audio object. Corresponding system and computer program product are also disclosed.

    Audio Processing Method and Audio Processing Apparatus, and Training Method
    40.
    发明申请
    Audio Processing Method and Audio Processing Apparatus, and Training Method 有权
    音频处理方法和音频处理装置以及训练方法

    公开(公告)号:US20140358265A1

    公开(公告)日:2014-12-04

    申请号:US14282654

    申请日:2014-05-20

    Inventor: Jun Wang Lie Lu

    Abstract: Audio processing method and audio processing apparatus, and training method are described. According to embodiments of the application, an accent identifier is used to identify accent frames from a plurality of audio frames, resulting in an accent sequence comprised of probability scores of accent and/or non-accent decisions with respect to the plurality of audio frames. Then a tempo estimator is used to estimate a tempo sequence of the plurality of audio frames based on the accent sequence. The embodiments can be well adaptive to the change of tempo, and can be further used to tracking beats properly.

    Abstract translation: 描述音频处理方法和音频处理装置以及训练方法。 根据应用的实施例,使用重音标识符来识别来自多个音频帧的重音帧,从而导致包括关于多个音频帧的重音和/或非重音判定的概率分数的重音序列。 然后,速度估计器用于基于重音序列来估计多个音频帧的速度序列。 这些实施例可以很好地适应于速度的改变,并且可以进一步用于适当地跟踪节拍。

Patent Agency Ranking