-
公开(公告)号:US20220293112A1
公开(公告)日:2022-09-15
申请号:US17635795
申请日:2020-09-01
Applicant: DOLBY LABORATORIES LICENSING CORPORATION
Inventor: Rishabh TYAGI , David MCGRATH
IPC: G10L19/02 , G10L25/21 , G10L19/032 , G10L19/008 , G10L19/16 , G10L25/18
Abstract: In some implementations, a method of encoding a low-frequency effect (LFE) channel comprises: receiving a time-domain LFL channel signal; filtering, using a low-pass filter, the time-domain LFE channel signal; converting the filtered time-domain LFE channel signal into a frequency-domain representation of the LFE channel signal that includes a number of coefficients representing a frequency spectrum of the LFL channel signal; arranging coefficients into a number of subband groups corresponding to different frequency bands of the LFE channel signal; quantizing coefficients in each subband group according to a frequency response curve of the low-pass filter; encoding the quantized coefficients in each subband group using an entropy coder tuned for the subband group; and generating a bitstream including the encoded quantized coefficients; and storing the bitstream on a storage device or streaming the bitstream to a downstream device.
-
公开(公告)号:US20220366919A1
公开(公告)日:2022-11-17
申请号:US17762709
申请日:2020-09-22
Applicant: Dolby Laboratories Licensing Corporation
Inventor: DIRK JEROEN BREEBAART , ALEX BRANDMEYER , POPPY ANNE CARRIE CRUM , JOYNER STEELE MCGREGOR , David MCGRATH , Andrea FANELLI , Rhonda J. WILSON
IPC: G10L19/008 , H04S7/00
Abstract: Encoding/decoding techniques where multiple transform parameter sets are encoded together with a rendered playback presentation of an input audio content. The multiple transform parameters are used on the decoder side to transform the playback presentation to provide a personalized binaural playback presentation optimized for an individual listener with respect to their hearing profile. This may be achieved by selection or combination of the data present in the metadata streams.
-
公开(公告)号:US20240282321A1
公开(公告)日:2024-08-22
申请号:US18584290
申请日:2024-02-22
Applicant: Dolby Laboratories Licensing Corporation
Inventor: David MCGRATH
IPC: G10L19/008
CPC classification number: G10L19/008
Abstract: The disclosure relates to methods of processing a spatial audio signal for generating a compressed representation of the spatial audio signal. The methods include analyzing the spatial audio signal to determine directions of arrival for one or more audio elements; for at least one frequency subband, determining respective indications of signal power associated with the directions of arrival; generating metadata including direction information that includes indications of the directions of arrival of the audio elements, and energy information that includes respective indications of signal power; generating a channel-based audio signal with a predefined number of channels based on the spatial audio signal; and outputting, as the compressed representation, the channel-based audio signal and the metadata. The disclosure further relates to methods of processing a compressed representation of a spatial audio signal for generating a reconstructed representation of the spatial audio signal, and to corresponding apparatus, programs, and storage media.
-
公开(公告)号:US20220392462A1
公开(公告)日:2022-12-08
申请号:US17771877
申请日:2020-10-29
Applicant: Dolby Laboratories Licensing Corporation
Inventor: David MCGRATH
IPC: G10L19/008
Abstract: The disclosure relates to methods of processing a spatial audio signal for generating a compressed representation of the spatial audio signal. The methods include analyzing the spatial audio signal to determine directions of arrival for one or more audio elements; for at least one frequency subband, determining respective indications of signal power associated with the directions of arrival; generating metadata including direction information that includes indications of the directions of arrival of the audio elements, and energy information that includes respective indications of signal power; generating a channel-based audio signal with a predefined number of channels based on the spatial audio signal; and outputting, as the compressed representation, the channel-based audio signal and the metadata. The disclosure further relates to methods of processing a compressed representation of a spatial audio signal for generating a reconstructed representation of the spatial audio signal, and to corresponding apparatus, programs, and storage media.
-
-
-