METHODS AND DEVICES FOR PERSONALIZING AUDIO CONTENT

    公开(公告)号:US20220417585A1

    公开(公告)日:2022-12-29

    申请号:US17778295

    申请日:2020-11-18

    摘要: The present document describes a method (400) for personalizing audio content. The method (400) comprises receiving (401) a manifest file (140) for the audio content. The manifest file (140) comprises at least one adaptation set (281, 282) referencing an audio bitstream (121), where the audio bitstream (121) comprises a plurality of audio objects (181), and a plurality of different preselection elements (291, 292, 293) for the adaptation set (281, 282), wherein the different preselection elements (291, 292, 293) specify different combinations of the plurality of audio objects (181). The method (400) further comprises selecting (402) a preselection element (291) from the plurality of different preselection elements (291, 292, 293), and causing (403) rendering of an audio signal which depends on the selected preselection element (291).

    Upsampling using oversampled SBR
    7.
    发明授权
    Upsampling using oversampled SBR 有权
    使用过采样SBR进行上采样

    公开(公告)号:US09530424B2

    公开(公告)日:2016-12-27

    申请号:US14357188

    申请日:2012-11-12

    摘要: An encoder (250) comprises a core encoder (252) for encoding a low frequency component of the audio signal at the signal sampling rate (fs_in) and a spectral band replication-referred to as SBR-encoding unit (153, 254) for determining a plurality of SBR parameters. A plurality of the SBR parameters is determined such that a high frequency component of the audio signal can be approximated based on the low frequency component of the audio signal and the plurality of SBR parameters. A multiplexer (155) is adapted to generate an overall bitstream comprising the core encoded bitstream, the plurality of SBR parameters and an indication of one or more SBR encoder settings applied by the SBR encoder (153, 254); wherein the generated overall bitstream does not indicate that the core encoded bitstream has been determined by encoding the low frequency component at the signal sampling rate (fs_in).

    摘要翻译: 编码器(250)包括用于以信号采样率(fs_in)对音频信号的低频分量进行编码的核心编码器(252)和被称为SBR编码单元(153,254)的频谱带复制,用于确定 多个SBR参数。 确定多个SBR参数,使得音频信号的高频分量可以基于音频信号的低频分量和多个SBR参数来近似。 复用器(155)适于生成包括核心编码比特流,多个SBR参数和由SBR编码器(153,254)应用的一个或多个SBR编码器设置的指示的总比特流; 其中所生成的总比特流不指示通过以信号采样率(fs_in)对低频分量进行编码来确定核心编码比特流。

    Seamless playback of successive multimedia files
    8.
    发明授权
    Seamless playback of successive multimedia files 有权
    连续多媒体文件的无缝播放

    公开(公告)号:US09111524B2

    公开(公告)日:2015-08-18

    申请号:US13688682

    申请日:2012-11-29

    发明人: Holger Hoerich

    IPC分类号: G10L19/00 G10L19/16

    CPC分类号: G10L19/00 G10L19/167

    摘要: The present document relates to methods and systems for encoding and decoding multimedia files. In particular, the present document relates to methods and systems for encoding and decoding a plurality of audio tracks for seamless playback of the plurality of audio tracks. A method for encoding an audio signal comprising a first and a directly following second audio track for seamless and individual playback of the first and second audio tracks is described. The first and second audio tracks comprise a first and second plurality of audio frames, respectively. The method comprises jointly encoding the audio signal using a frame based audio encoder, thereby yielding a continuous sequence of encoded frames; extracting a first plurality of encoded frames from the continuous sequence of encoded frames; extracting a second plurality of encoded frames from the continuous sequence of encoded frames; appending one or more rear extension frames to an end of the first plurality of encoded frames; and appending one or more front extension frames to the beginning of the second plurality of encoded frames.

    摘要翻译: 本文件涉及用于对多媒体文件进行编码和解码的方法和系统。 特别地,本文件涉及用于编码和解码多个音频轨道以用于多个音频轨道的无缝重放的方法和系统。 描述了一种用于对包括第一和直接跟随的第二音频轨道的音频信号进行编码的方法,用于第一和第二音轨的无缝和单独重放。 第一和第二音轨分别包括第一和第二多个音频帧。 该方法包括使用基于帧的音频编码器对音频信号进行联合编码,从而产生编码帧的连续序列; 从编码帧的连续序列中提取第一多个编码帧; 从编码帧的连续序列中提取第二多个编码帧; 将一个或多个后扩展帧附加到所述第一多个编码帧的末尾; 以及将一个或多个前扩展帧附加到第二多个编码帧的开头。

    Efficient DRC profile transmission

    公开(公告)号:US12112766B2

    公开(公告)日:2024-10-08

    申请号:US18233330

    申请日:2023-08-14

    摘要: A method (600) for decoding an encoded audio signal (102) is described. The encoded audio signal (102) comprises a sequence of frames. Furthermore, the encoded audio signal (102) is indicative of a plurality of different dynamic range control (DRC) profiles for a corresponding plurality of different rendering modes. Different subsets of DRC profiles from the plurality of DRC profiles are comprised within different frames of the sequence of frames, such that two or more frames of the sequence of frames jointly comprise the plurality of DRC profiles. The method (600) comprises determining a first rendering mode from the plurality of different rendering modes; determining (609, 610) one or more DRC profiles from a subset of DRC profiles comprised within a current frame of the sequence of frames; determining (611) whether at least one of the one or more DRC profiles is applicable to the first rendering mode; selecting (604) a default DRC profile as a current DRC profile, if none of the one or more DRC profiles is applicable to the first rendering mode; wherein definition data of the default DRC profile is known at a decoder (100) for decoding the encoded audio signal (102); and decoding the current frame using the current DRC profile.