Methods and Apparatus for Rendering Audio Objects

    公开(公告)号:US20210352426A1

    公开(公告)日:2021-11-11

    申请号:US17329094

    申请日:2021-05-24

    Abstract: Multiple virtual source locations may be defined for a volume within which audio objects can move. A set-up process for rendering audio data may involve receiving reproduction speaker location data and pre-computing gain values for each of the virtual sources according to the reproduction speaker location data and each virtual source location. The gain values may be stored and used during “run time,” during which audio reproduction data are rendered for the speakers of the reproduction environment. During run time, for each audio object, contributions from virtual source locations within an area or volume defined by the audio object position data and the audio object size data may be computed. A set of gain values for each output channel of the reproduction environment may be computed based, at least in part, on the computed contributions. Each output channel may correspond to at least one reproduction speaker of the reproduction environment.

    RENDERING OF AUDIO OBJECTS WITH APPARENT SIZE TO ARBITRARY LOUDSPEAKER LAYOUTS

    公开(公告)号:US20200336855A1

    公开(公告)日:2020-10-22

    申请号:US16868861

    申请日:2020-05-07

    Abstract: Multiple virtual source locations may be defined for a volume within which audio objects can move. A set-up process for rendering audio data may involve receiving reproduction speaker location data and pre-computing gain values for each of the virtual sources according to the reproduction speaker location data and each virtual source location. The gain values may be stored and used during “run time,” during which audio reproduction data are rendered for the speakers of the reproduction environment. During run time, for each audio object, contributions from virtual source locations within an area or volume defined by the audio object position data and the audio object size data may be computed. A set of gain values for each output channel of the reproduction environment may be computed based, at least in part, on the computed contributions. Each output channel may correspond to at least one reproduction speaker of the reproduction environment.

    DETERMINING AZIMUTH AND ELEVATION ANGLES FROM STEREO RECORDINGS

    公开(公告)号:US20190335272A1

    公开(公告)日:2019-10-31

    申请号:US16509973

    申请日:2019-07-12

    Abstract: Input audio data, including first microphone audio signals and second microphone audio signals output by a pair of coincident, vertically-stacked directional microphones, may be received. An azimuthal angle corresponding to a sound source location may be determined, based at least in part on an intensity difference between the first microphone audio signals and the second microphone audio signals. An elevation angle corresponding to a sound source location may be determined, based at least in part on a temporal difference between the first microphone audio signals and the second microphone audio signals. Output audio data, including at least one audio object corresponding to a sound source, may be generated. The audio object may include audio object signals and associated audio object metadata. The audio object metadata may include at least audio object location data corresponding to the sound source location.

    ADAPTIVE QUANTIZATION
    7.
    发明申请

    公开(公告)号:US20190027157A1

    公开(公告)日:2019-01-24

    申请号:US16072168

    申请日:2017-01-26

    CPC classification number: G10L19/032 G10L19/00 G10L19/002 G10L19/20 H03M1/00

    Abstract: An importance metric, based at least in part on an energy metric, may be determined for each of a plurality of received audio objects. Some methods may involve: determining a global importance metric for all of the audio objects, based, at least in part, on a total energy value calculated by summing the energy metric of each of the audio objects; determining an estimated quantization bit depth and a quantization error for each of the audio objects; calculating a total noise metric for all of the audio objects, the total noise metric being based, at least in part, on a total quantization error corresponding with the estimated quantization bit depth; calculating a total signal-to-noise ratio corresponding with the total noise metric and the total energy value; and determining a final quantization bit depth for each of the audio objects by applying a signal-to-noise ratio threshold to the total signal-to-noise ratio.

    System and Method for Adaptive Audio Signal Generation, Coding and Rendering
    10.
    发明申请
    System and Method for Adaptive Audio Signal Generation, Coding and Rendering 有权
    自适应音频信号生成,编码和渲染的系统和方法

    公开(公告)号:US20160381483A1

    公开(公告)日:2016-12-29

    申请号:US15263279

    申请日:2016-09-12

    Abstract: Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent. The object position metadata contains the appropriate allocentric frame of reference information required to play the sound correctly using the available speaker positions in a room that is set up to play the adaptive audio content.

    Abstract translation: 对于处理包括多个独立单声道音频流的音频数据的自适应音频系统描述实施例。 一个或多个流与其相关联的元数据,其指定流是基于信道还是基于对象的流。 基于频道的流具有通过频道名称编码的渲染信息; 并且基于对象的流具有通过在相关联的元数据中编码的位置表达式编码的位置信息。 编解码器将独立音频流打包成包含所有音频数据的单个串行比特流。 该配置允许根据异轴参考系渲染声音,其中声音的渲染位置基于播放环境的特性(例如,房间大小,形状等),以对应于混音器的 意图。 对象位置元数据包含使用设置为播放自适应音频内容的房间中的可用扬声器位置来正确播放声音所需的适当的同心异动帧参考信息。

Patent Agency Ranking