-
公开(公告)号:US20230075445A1
公开(公告)日:2023-03-09
申请号:US17898358
申请日:2022-08-29
发明人: Jeroen Koppens , Jeffrey Riedmiller , Kristofer Kjoerling , Alexander Stahlmann , Holger Hoerich , Alan J. Seefeldt
IPC分类号: G06F3/16 , H04S7/00 , H03G3/30 , G10L19/16 , H03G9/18 , H03G9/00 , H03G7/00 , G10L21/0324 , H03G9/02 , H04S1/00 , H03G9/12
摘要: In an audio encoder, for audio content received in a source audio format, default gains are generated based on a default dynamic range compression (DRC) curve, and non-default gains are generated for a non-default gain profile. Based on the default gains and non-default gains, differential gains are generated. An audio signal comprising the audio content, the default DRC curve, and differential gains is generated. In an audio decoder, the default DRC curve and the differential gains are identified from the audio signal. Default gains are re-generated based on the default DRC curve. Based on the combination of the re-generated default gains and the differential gains, operations are performed on the audio content extracted from the audio signal.
-
公开(公告)号:US20220417585A1
公开(公告)日:2022-12-29
申请号:US17778295
申请日:2020-11-18
发明人: Malte Schmidt , Holger Hoerich
IPC分类号: H04N21/439 , H04N21/81 , H04N21/462
摘要: The present document describes a method (400) for personalizing audio content. The method (400) comprises receiving (401) a manifest file (140) for the audio content. The manifest file (140) comprises at least one adaptation set (281, 282) referencing an audio bitstream (121), where the audio bitstream (121) comprises a plurality of audio objects (181), and a plurality of different preselection elements (291, 292, 293) for the adaptation set (281, 282), wherein the different preselection elements (291, 292, 293) specify different combinations of the plurality of audio objects (181). The method (400) further comprises selecting (402) a preselection element (291) from the plurality of different preselection elements (291, 292, 293), and causing (403) rendering of an audio signal which depends on the selected preselection element (291).
-
公开(公告)号:US10606550B2
公开(公告)日:2020-03-31
申请号:US16387073
申请日:2019-04-17
发明人: Jeroen Koppens , Jeffrey Riedmiller , Kristofer Kjoerling , Alexander Stahlmann , Holger Hoerich , Alan J. Seefeldt
IPC分类号: G06F3/16 , H03G7/00 , H03G9/00 , G10L19/16 , H03G9/02 , H03G9/18 , H03G3/30 , H03G9/12 , H04S1/00 , H04S7/00 , G10L21/0324
摘要: In an audio encoder, for audio content received in a source audio format, default gains are generated based on a default dynamic range compression (DRC) curve, and non-default gains are generated for a non-default gain profile. Based on the default gains and non-default gains, differential gains are generated. An audio signal comprising the audio content, the default DRC curve, and differential gains is generated. In an audio decoder, the default DRC curve and the differential gains are identified from the audio signal. Default gains are re-generated based on the default DRC curve. Based on the combination of the re-generated default gains and the differential gains, operations are performed on the audio content extracted from the audio signal.
-
公开(公告)号:US09990935B2
公开(公告)日:2018-06-05
申请号:US14916545
申请日:2014-09-10
发明人: Alexander Groeschel , Jens Popp , Martin Wolters , Jeffrey Riedmiller , Michael Ward , Karl Jonas Roeden , Holger Hoerich , Karsten Linzmeier , Tobias Friedrich
IPC分类号: G06F17/00 , G10L19/24 , G10L19/008 , H03H17/02 , H03H17/04 , H04S3/02 , G10L19/16 , G10L25/18 , G10L19/005
CPC分类号: G10L19/24 , G10L19/005 , G10L19/008 , G10L19/167 , G10L25/18 , H03H17/0275 , H03H17/0277 , H03H17/0455 , H04S3/02 , H04S2400/03 , H04S2400/05
摘要: The present document relates to the design of anti-aliasing and/or anti-imaging filters for resamplers using rational resampling factors. In particular, the present document relates to a method for implementing such anti-aliasing and/or anti-imaging filters with reduced computational complexity. In addition, the present document relates to further aspects of an audio encoding and decoding system, such as the phase relation between the channels of a multi-channel audio signal and/or the structure of the bitstream of an encoded audio signal.
-
公开(公告)号:US09761236B2
公开(公告)日:2017-09-12
申请号:US15452948
申请日:2017-03-08
IPC分类号: G10L19/00 , G10L19/093 , G10L19/07 , G10L19/26 , G10L19/02 , G10L19/028 , G10L19/16
CPC分类号: G10L19/0208 , G10L19/0017 , G10L19/0204 , G10L19/028 , G10L19/06 , G10L19/07 , G10L19/093 , G10L19/167 , G10L19/24 , G10L19/26 , G10L19/265 , G10L21/038
摘要: A method performed in an audio decoder for reconstructing an original audio signal having a lowband portion and a highband portion is disclosed. The method includes receiving an encoded audio signal and extracting reconstruction parameters from the encoded audio signal. The method further includes decoding the encoded audio signal with a core audio decoder to obtain a decoded lowband portion and regenerating the highband portion based at least in part on a cross over frequency and the decoded lowband portion to obtain a regenerated highband portion. The method also includes creating a synthetic sinusoid with a level based at least in part on a spectral envelope value for the particular subband and a noise floor value for the particular subband and adding the synthetic sinusoid to the regenerated highband portion in the particular frequency band specified by the location information. Finally, the method includes combining the lowband portion and the regenerated highband portion to obtain a full bandwidth audio signal.
-
公开(公告)号:US09626149B2
公开(公告)日:2017-04-18
申请号:US15098530
申请日:2016-04-14
发明人: Jeroen Koppens , Jeffrey Riedmiller , Kristofer Kjoerling , Alexander Groeschel , Holger Hoerich , Alan J. Seefeldt
IPC分类号: H03G7/00 , G06F3/16 , H03G9/00 , G10L19/16 , H03G3/30 , H03G9/12 , H04S1/00 , H04S7/00 , G10L21/0324
CPC分类号: G06F3/165 , G10L19/167 , G10L21/0324 , H03G3/3005 , H03G3/3089 , H03G7/002 , H03G7/007 , H03G9/005 , H03G9/12 , H04S1/00 , H04S7/30 , H04S2400/13
摘要: In an audio encoder, for audio content received in a source audio format, default gains are generated based on a default dynamic range compression (DRC) curve, and non-default gains are generated for a non-default gain profile. Based on the default gains and non-default gains, differential gains are generated. An audio signal comprising the audio content, the default DRC curve, and differential gains is generated. In an audio decoder, the default DRC curve and the differential gains are identified from the audio signal. Default gains are re-generated based on the default DRC curve. Based on the combination of the re-generated default gains and the differential gains, operations are performed on the audio content extracted from the audio signal.
-
公开(公告)号:US09530424B2
公开(公告)日:2016-12-27
申请号:US14357188
申请日:2012-11-12
发明人: Holger Hoerich , Tobias Friedrich
IPC分类号: G10L19/032 , G10L19/24 , G10L19/02 , G10L19/26 , G10L19/16
CPC分类号: G10L19/032 , G10L19/0204 , G10L19/167 , G10L19/24 , G10L19/265
摘要: An encoder (250) comprises a core encoder (252) for encoding a low frequency component of the audio signal at the signal sampling rate (fs_in) and a spectral band replication-referred to as SBR-encoding unit (153, 254) for determining a plurality of SBR parameters. A plurality of the SBR parameters is determined such that a high frequency component of the audio signal can be approximated based on the low frequency component of the audio signal and the plurality of SBR parameters. A multiplexer (155) is adapted to generate an overall bitstream comprising the core encoded bitstream, the plurality of SBR parameters and an indication of one or more SBR encoder settings applied by the SBR encoder (153, 254); wherein the generated overall bitstream does not indicate that the core encoded bitstream has been determined by encoding the low frequency component at the signal sampling rate (fs_in).
摘要翻译: 编码器(250)包括用于以信号采样率(fs_in)对音频信号的低频分量进行编码的核心编码器(252)和被称为SBR编码单元(153,254)的频谱带复制,用于确定 多个SBR参数。 确定多个SBR参数,使得音频信号的高频分量可以基于音频信号的低频分量和多个SBR参数来近似。 复用器(155)适于生成包括核心编码比特流,多个SBR参数和由SBR编码器(153,254)应用的一个或多个SBR编码器设置的指示的总比特流; 其中所生成的总比特流不指示通过以信号采样率(fs_in)对低频分量进行编码来确定核心编码比特流。
-
公开(公告)号:US09111524B2
公开(公告)日:2015-08-18
申请号:US13688682
申请日:2012-11-29
发明人: Holger Hoerich
CPC分类号: G10L19/00 , G10L19/167
摘要: The present document relates to methods and systems for encoding and decoding multimedia files. In particular, the present document relates to methods and systems for encoding and decoding a plurality of audio tracks for seamless playback of the plurality of audio tracks. A method for encoding an audio signal comprising a first and a directly following second audio track for seamless and individual playback of the first and second audio tracks is described. The first and second audio tracks comprise a first and second plurality of audio frames, respectively. The method comprises jointly encoding the audio signal using a frame based audio encoder, thereby yielding a continuous sequence of encoded frames; extracting a first plurality of encoded frames from the continuous sequence of encoded frames; extracting a second plurality of encoded frames from the continuous sequence of encoded frames; appending one or more rear extension frames to an end of the first plurality of encoded frames; and appending one or more front extension frames to the beginning of the second plurality of encoded frames.
摘要翻译: 本文件涉及用于对多媒体文件进行编码和解码的方法和系统。 特别地,本文件涉及用于编码和解码多个音频轨道以用于多个音频轨道的无缝重放的方法和系统。 描述了一种用于对包括第一和直接跟随的第二音频轨道的音频信号进行编码的方法,用于第一和第二音轨的无缝和单独重放。 第一和第二音轨分别包括第一和第二多个音频帧。 该方法包括使用基于帧的音频编码器对音频信号进行联合编码,从而产生编码帧的连续序列; 从编码帧的连续序列中提取第一多个编码帧; 从编码帧的连续序列中提取第二多个编码帧; 将一个或多个后扩展帧附加到所述第一多个编码帧的末尾; 以及将一个或多个前扩展帧附加到第二多个编码帧的开头。
-
公开(公告)号:US12112766B2
公开(公告)日:2024-10-08
申请号:US18233330
申请日:2023-08-14
发明人: Holger Hoerich , Jeroen Koppens
IPC分类号: G10L19/16 , G10L19/26 , G10L21/0364 , H03G7/00 , H03G9/00 , H04N21/233
CPC分类号: G10L19/167 , G10L19/26 , G10L21/0364 , H03G7/002 , H03G7/007 , H03G9/005 , H04N21/233
摘要: A method (600) for decoding an encoded audio signal (102) is described. The encoded audio signal (102) comprises a sequence of frames. Furthermore, the encoded audio signal (102) is indicative of a plurality of different dynamic range control (DRC) profiles for a corresponding plurality of different rendering modes. Different subsets of DRC profiles from the plurality of DRC profiles are comprised within different frames of the sequence of frames, such that two or more frames of the sequence of frames jointly comprise the plurality of DRC profiles. The method (600) comprises determining a first rendering mode from the plurality of different rendering modes; determining (609, 610) one or more DRC profiles from a subset of DRC profiles comprised within a current frame of the sequence of frames; determining (611) whether at least one of the one or more DRC profiles is applicable to the first rendering mode; selecting (604) a default DRC profile as a current DRC profile, if none of the one or more DRC profiles is applicable to the first rendering mode; wherein definition data of the default DRC profile is known at a decoder (100) for decoding the encoded audio signal (102); and decoding the current frame using the current DRC profile.
-
公开(公告)号:US12087317B2
公开(公告)日:2024-09-10
申请号:US17603149
申请日:2020-04-15
IPC分类号: G10L21/0308 , G10L19/008 , H03G5/16 , H03G9/18
CPC分类号: G10L21/0308 , G10L19/008 , H03G5/165 , H03G9/18
摘要: Dialogue enhancement of an audio signal, comprising obtaining a set of time-varying parameters configured to estimate a dialogue component present in said audio signal, estimating the dialogue component from the audio signal, applying a compressor only to the estimated dialogue component, to generate a processed dialogue component, applying a user-determined gain to the processed dialogue component, to provide an enhanced dialogue component. The processing of the estimated dialogue may be performed on the decoder side or encoder side. The invention enables an improved dialogue enhancement.
-
-
-
-
-
-
-
-
-