Processing Spatially Diffuse or Large Audio Objects
    31.
    发明申请
    Processing Spatially Diffuse or Large Audio Objects 有权
    处理空间漫射或大型音频对象

    公开(公告)号:US20160192105A1

    公开(公告)日:2016-06-30

    申请号:US14909058

    申请日:2014-07-24

    Abstract: Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.

    Abstract translation: 漫射或空间较大的音频对象可能被识别用于特殊处理。 可以对与大音频对象相对应的音频信号执行去相关处理,以产生解相关的大音频对象音频信号。 这些去相关的大音频对象音频信号可以与对象位置相关联,其可以是静止的或时变的位置。 例如,解相关的大音频对象音频信号可以呈现为虚拟或实际的扬声器位置。 这样的渲染处理的输出可以被输入到场景简化处理。 可以在对音频数据进行编码的处理之前执行去相关,关联和/或场景简化处理。

    VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD
    32.
    发明申请
    VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD 有权
    体积调节器和控制方法

    公开(公告)号:US20160049915A1

    公开(公告)日:2016-02-18

    申请号:US14777271

    申请日:2014-03-17

    Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.

    Abstract translation: 公开了卷积矫直机控制器和控制方法。 在一个实施例中,音量调平器控制器包括用于实时地识别音频信号的内容类型的音频内容分类器; 以及调整单元,用于基于所识别的内容类型以连续的方式调整音量调节器。 调整单元可以被配置为使音量调平器的动态增益与音频信号的信息内容类型正相关,并且将音量调平器的动态增益与音频信号的干扰内容类型负相关。

    Object Clustering for Rendering Object-Based Audio Content Based on Perceptual Criteria
    33.
    发明申请
    Object Clustering for Rendering Object-Based Audio Content Based on Perceptual Criteria 有权
    基于感知标准渲染基于对象的音频内容的对象聚类

    公开(公告)号:US20150332680A1

    公开(公告)日:2015-11-19

    申请号:US14654460

    申请日:2013-11-25

    Abstract: Embodiments are directed a method of rendering object-based audio comprising determining an initial spatial position of objects having object audio data and associated metadata, determining a perceptual importance of the objects, and grouping the audio objects into a number of clusters based on the determined perceptual importance of the objects, such that a spatial error caused by moving an object from an initial spatial position to a second spatial position in a cluster is minimized for objects with a relatively high perceptual importance. The perceptual importance is based at least in part by a partial loudness of an object and content semantics of the object.

    Abstract translation: 实施例涉及一种渲染基于对象的音频的方法,包括:确定具有对象音频数据和相关元数据的对象的初始空间位置,确定对象的感知重要性,以及基于所确定的知觉,将音频对象分组成多个聚类 使得通过将对象从群集中的初始空间位置移动到第二空间位置而引起的空间误差最小化为具有相对高感知重要性的对象的对象的重要性。 感知重要性至少部分地基于对象的部分响度和对象的内容语义。

    PROCESSING OBJECT-BASED AUDIO SIGNALS
    34.
    发明公开

    公开(公告)号:US20240205629A1

    公开(公告)日:2024-06-20

    申请号:US18391426

    申请日:2023-12-20

    CPC classification number: H04S7/302 H04S3/008 H04S7/30 G10L19/008 H04S2400/11

    Abstract: An audio processing system and method which calculates, based on spatial metadata of the audio object, a panning coefficient for each of the audio objects in relation to each of a plurality of predefined channel coverage zones. Converts the audio signal into submixes in relation to the predefined channel coverage zones based on the calculated panning coefficients and the audio objects. Each of the submixes indicating a sum of components of the plurality of the audio objects in relation to one of the predefined channel coverage zones. Generating a submix gain by applying an audio processing to each of the submix and controls an object gain applied to each of the audio objects. The object gain being as a function of the panning coefficients for each of the audio objects and the submix gains in relation to each of the predefined channel coverage zones.

    METADATA-PRESERVED AUDIO OBJECT CLUSTERING

    公开(公告)号:US20220272474A1

    公开(公告)日:2022-08-25

    申请号:US17737184

    申请日:2022-05-05

    Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying an audio object into at least a category based rendering mode information metadata. The method further comprises assigning a predetermined number of clusters to the categories and rendering the audio object based on the rendering mode. Corresponding system and computer program product are also disclosed.

    A DIALOG DETECTOR
    36.
    发明申请

    公开(公告)号:US20220199074A1

    公开(公告)日:2022-06-23

    申请号:US17604379

    申请日:2020-04-13

    Inventor: Lie LU Xin LIU

    Abstract: The present application relates to a method of extracting audio features in a dialog detector in response to an input audio signal, the method comprising dividing the input audio signal into a plurality of frames, extracting frame audio features from each frame, determining a set of context windows, each context window including a number of frames surrounding a current frame, deriving, for each context window, a relevant context audio feature for the current frame based on the frame audio features of the frames in each respective context, and concatenating each context audio feature to form a combined feature vector to represent the current frame. The context windows with the different length can improve the response speed and improve robustness.

    AUDIO SOURCE SEPARATION
    37.
    发明申请

    公开(公告)号:US20190392848A1

    公开(公告)日:2019-12-26

    申请号:US16561836

    申请日:2019-09-05

    Abstract: The present document describes a method for extracting J audio sources from I audio channels. The method includes updating a Wiener filter matrix based on a mixing matrix from a source matrix and based on a power matrix of the J audio sources. Furthermore, the method includes updating a cross-covariance matrix of the I audio channels and of the J audio sources and an auto-covariance matrix of the J audio sources, based on the updated Wiener filter matrix and based on an auto-covariance matrix of the I audio channels. In addition, the method includes updating the mixing matrix and the power matrix based on the updated cross-covariance matrix of the I audio channels and of the J audio sources, and/or based on the updated auto-covariance matrix of the J audio sources.

Patent Agency Ranking