Systems, methods and apparatus for conversion from channel-based audio to object-based audio

    公开(公告)号:US12094476B2

    公开(公告)日:2024-09-17

    申请号:US17781978

    申请日:2020-12-02

    CPC classification number: G10L19/167

    Abstract: Embodiments are disclosed for channel-based audio (CBA) (e.g., 22.2-ch audio) to object-based audio (OBA) conversion. The conversion includes converting CBA metadata to object audio metadata (OAMD) and reordering the CBA channels based on channel shuffle information derived in accordance with channel ordering constraints of the OAMD. The OBA with reordered channels is rendered in a playback device using the OAMD or in a source device, such as a set-top box or audio/video recorder. In an embodiment, the CBA metadata includes signaling that indicates a specific OAMD representation to be used in the conversion of the metadata. In an embodiment, pre-computed OAMD is transmitted in a native audio bitstream (e.g., AAC) for transmission (e.g., over HDMI) or for rendering in a source device. In an embodiment, pre-computed OAMD is transmitted in a transport layer bitstream (e.g., ISO BMFF, MPEG4 audio bitstream) to a playback device or source device.

    Audio Bitstreams with Supplementary Data and Encoding and Decoding of Such Bitstreams
    20.
    发明申请
    Audio Bitstreams with Supplementary Data and Encoding and Decoding of Such Bitstreams 审中-公开
    具有补充数据和这样的比特流的编码和解码的音频比特流

    公开(公告)号:US20150348558A1

    公开(公告)日:2015-12-03

    申请号:US14822168

    申请日:2015-08-10

    CPC classification number: G10L19/018 G10L19/167

    Abstract: Methods for generating or decoding an encoded audio bitstream including audio data and supplementary data (e.g., metadata and/or unrelated audio data), where at least some of the supplementary data is included as LSBs of audio segments, and/or at least some of the supplementary data is included in guard bands. Typical embodiments provide a scalable and video synchronous format compatible with real-time and file-based infrastructure components that support the SMPTE 337 format for carrying data in AES3 serial bitstreams, and/or provide a framework for extending distribution codecs to scale beyond an 8-channel limit to support multiples of 8 channels synchronously across multiple AES3 interfaces. Another aspect is an audio processing unit configured to perform any embodiment of the method or including a buffer memory storing at least one segment of an audio bitstream generated in accordance with any embodiment of the method.

    Abstract translation: 用于生成或解码包括音频数据和补充数据(例如,元数据和/或不相关的音频数据)的编码音频比特流的方法,其中至少一些补充数据被包括为音频段的LSB,和/或至少一些 辅助数据包含在保护频带中。 典型的实施例提供了与支持用于在AES3串行比特流中传送数据的SMPTE 337格式的实时和基于文件的基础设施组件兼容的可扩展和视频同步格式,和/或提供用于扩展分布式编解码器以扩展到8- 通道限制,以跨多个AES3接口同步支持8个通道的倍数。 另一方面是被配置为执行该方法的任何实施例的音频处理单元,或者包括存储根据该方法的任何实施例生成的音频比特流的至少一个段的缓冲存储器。

Patent Agency Ranking