Processing Spatially Diffuse or Large Audio Objects
    11.
    发明申请
    Processing Spatially Diffuse or Large Audio Objects 有权
    处理空间漫射或大型音频对象

    公开(公告)号:US20160192105A1

    公开(公告)日:2016-06-30

    申请号:US14909058

    申请日:2014-07-24

    Abstract: Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.

    Abstract translation: 漫射或空间较大的音频对象可能被识别用于特殊处理。 可以对与大音频对象相对应的音频信号执行去相关处理,以产生解相关的大音频对象音频信号。 这些去相关的大音频对象音频信号可以与对象位置相关联,其可以是静止的或时变的位置。 例如,解相关的大音频对象音频信号可以呈现为虚拟或实际的扬声器位置。 这样的渲染处理的输出可以被输入到场景简化处理。 可以在对音频数据进行编码的处理之前执行去相关,关联和/或场景简化处理。

    Object Clustering for Rendering Object-Based Audio Content Based on Perceptual Criteria
    12.
    发明申请
    Object Clustering for Rendering Object-Based Audio Content Based on Perceptual Criteria 有权
    基于感知标准渲染基于对象的音频内容的对象聚类

    公开(公告)号:US20150332680A1

    公开(公告)日:2015-11-19

    申请号:US14654460

    申请日:2013-11-25

    Abstract: Embodiments are directed a method of rendering object-based audio comprising determining an initial spatial position of objects having object audio data and associated metadata, determining a perceptual importance of the objects, and grouping the audio objects into a number of clusters based on the determined perceptual importance of the objects, such that a spatial error caused by moving an object from an initial spatial position to a second spatial position in a cluster is minimized for objects with a relatively high perceptual importance. The perceptual importance is based at least in part by a partial loudness of an object and content semantics of the object.

    Abstract translation: 实施例涉及一种渲染基于对象的音频的方法,包括:确定具有对象音频数据和相关元数据的对象的初始空间位置,确定对象的感知重要性,以及基于所确定的知觉,将音频对象分组成多个聚类 使得通过将对象从群集中的初始空间位置移动到第二空间位置而引起的空间误差最小化为具有相对高感知重要性的对象的对象的重要性。 感知重要性至少部分地基于对象的部分响度和对象的内容语义。

    SYSTEM AND METHOD FOR ADAPTIVE AUDIO SIGNAL GENERATION, CODING AND RENDERING

    公开(公告)号:US20230045090A1

    公开(公告)日:2023-02-09

    申请号:US17883440

    申请日:2022-08-08

    Abstract: Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent. The object position metadata contains the appropriate allocentric frame of reference information required to play the sound correctly using the available speaker positions in a room that is set up to play the adaptive audio content.

    RENDERING OF AUDIO OBJECTS WITH APPARENT SIZE TO ARBITRARY LOUDSPEAKER LAYOUTS

    公开(公告)号:US20180167756A1

    公开(公告)日:2018-06-14

    申请号:US15894626

    申请日:2018-02-12

    Abstract: Multiple virtual source locations may be defined for a volume within which audio objects can move. A set-up process for rendering audio data may involve receiving reproduction speaker location data and pre-computing gain values for each of the virtual sources according to the reproduction speaker location data and each virtual source location. The gain values may be stored and used during “run time,” during which audio reproduction data are rendered for the speakers of the reproduction environment. During run time, for each audio object, contributions from virtual source locations within an area or volume defined by the audio object position data and the audio object size data may be computed. A set of gain values for each output channel of the reproduction environment may be computed based, at least in part, on the computed contributions. Each output channel may correspond to at least one reproduction speaker of the reproduction environment.

Patent Agency Ranking