-
公开(公告)号:US12300263B2
公开(公告)日:2025-05-13
申请号:US18982152
申请日:2024-12-16
Applicant: DOLBY INTERNATIONAL AB
Inventor: Kristofer Kjoerling , Lars Villemoes , Heiko Purnhagen , Per Ekstrand
IPC: G10L21/0388 , G10L19/008 , G10L19/02 , H04S3/00
Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
-
2.
公开(公告)号:US12296028B2
公开(公告)日:2025-05-13
申请号:US18982254
申请日:2024-12-16
Applicant: DOLBY INTERNATIONAL AB
Inventor: Kristofer Kjoerling , Lars Villemoes , Heiko Purnhagen , Per Ekstrand
IPC: G10L19/18 , A61B5/055 , A61K49/10 , G01R33/56 , G01R33/563 , G10L21/038
Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
-
公开(公告)号:US20250142285A1
公开(公告)日:2025-05-01
申请号:US19012706
申请日:2025-01-07
Inventor: Dirk Jeroen BREEBAART , Lie LU , Nicolas R. TSINGOS , Antonio MATEOS SOLE
IPC: H04S7/00 , G10L19/00 , G10L19/008 , G10L19/018 , G10L19/20 , H04S3/00
Abstract: Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.
-
公开(公告)号:US20250142280A1
公开(公告)日:2025-05-01
申请号:US18942909
申请日:2024-11-11
Applicant: Dolby International AB
Inventor: Leon Terentiv , Christof Fersch , Daniel Fischer
Abstract: A method (900) for rendering audio in a virtual reality rendering environment (180) is described. The method (900) comprises rendering (901) an origin audio signal of an origin audio source (113) of an origin audio scene (111) from an origin source position on a sphere (114) around a listening position (201) of a listener (181). Furthermore, the method (900) comprises determining (902) that the listener (181) moves from the listening position (201) within the origin audio scene (111) to a listening position (202) within a different destination audio scene (112). In addition, the method (900) comprises applying (903) a fade-out gain to the origin audio signal to determine a modified origin audio signal, and rendering (903) the modified origin audio signal of the origin audio source (113) from the origin source position on the sphere (114) around the listening position (201, 202).
-
公开(公告)号:US20250142276A1
公开(公告)日:2025-05-01
申请号:US18835523
申请日:2023-02-03
Inventor: Saketh SATHUVALLI , Christof Joseph FERSCH , Panji SETIAWAN , Tripti TIWARI , Reshma RAI
IPC: H04S3/00
Abstract: The present document describes a method (400) for rendering an ambisonics signal using a loudspeaker arrangement comprising S loudspeakers. The method (400) comprises converting (401) a set of N ambisonics channel signals (111) into a set of unfiltered pre-rendered signals (211), with N>1 and S>1. Furthermore, the method (400) comprises performing (402) near field compensation, referred to as NFC, filtering of M unfiltered pre-rendered signals (211) of the set of unfiltered pre-rendered signals (211) to provide a set of S filtered loudspeaker channel signals (114) for rendering using the corresponding S loudspeakers.
-
6.
公开(公告)号:US12288564B1
公开(公告)日:2025-04-29
申请号:US18982478
申请日:2024-12-16
Applicant: Dolby International AB
Inventor: Kristofer Kjoerling , Lars Villemoes , Heiko Purnhagen , Per Ekstrand
Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.
-
公开(公告)号:US20250126427A1
公开(公告)日:2025-04-17
申请号:US18984971
申请日:2024-12-17
Inventor: Michael C. Ward , Jeffrey Riedmiller , Scott Gregory Norcross , Alexander Stahlmann
IPC: H04S7/00 , G10L19/008 , G10L21/0324 , H04S1/00
Abstract: Audio content coded for a reference speaker configuration is downmixed to downmix audio content coded for a specific speaker configuration. One or more gain adjustments are performed on individual portions of the downmix audio content coded for the specific speaker configuration. Loudness measurements are then performed on the individual portions of the downmix audio content. An audio signal that comprises the audio content coded for the reference speaker configuration and downmix loudness metadata is generated. The downmix loudness metadata is created based at least in part on the loudness measurements on the individual portions of the downmix audio content.
-
公开(公告)号:US20250124933A1
公开(公告)日:2025-04-17
申请号:US19000460
申请日:2024-12-23
Inventor: Sripal S. Mehta , Thomas Ziegler , Giles Baker , Jeffrey Riedmiller , Prinyar Saungsomboon
Abstract: Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.
-
公开(公告)号:US20250119698A1
公开(公告)日:2025-04-10
申请号:US18925693
申请日:2024-10-24
Inventor: Stefan BRUHN
IPC: G10L19/008 , H04R1/32 , H04S7/00
Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.
-
公开(公告)号:US20250118325A1
公开(公告)日:2025-04-10
申请号:US18982152
申请日:2024-12-16
Applicant: DOLBY INTERNATIONAL AB
Inventor: Kristofer KJOERLING , Lars VILLEMOES , Heiko PURNHAGEN , Per EKSTRAND
IPC: G10L21/0388 , G10L19/008 , G10L19/02 , H04S3/00
Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
-
-
-
-
-
-
-
-
-