LOW-LATENCY, LOW-FREQUENCY EFFECTS CODEC

    公开(公告)号:US20220293112A1

    公开(公告)日:2022-09-15

    申请号:US17635795

    申请日:2020-09-01

    Abstract: In some implementations, a method of encoding a low-frequency effect (LFE) channel comprises: receiving a time-domain LFL channel signal; filtering, using a low-pass filter, the time-domain LFE channel signal; converting the filtered time-domain LFE channel signal into a frequency-domain representation of the LFE channel signal that includes a number of coefficients representing a frequency spectrum of the LFL channel signal; arranging coefficients into a number of subband groups corresponding to different frequency bands of the LFE channel signal; quantizing coefficients in each subband group according to a frequency response curve of the low-pass filter; encoding the quantized coefficients in each subband group using an entropy coder tuned for the subband group; and generating a bitstream including the encoded quantized coefficients; and storing the bitstream on a storage device or streaming the bitstream to a downstream device.

    MULTICHANNEL AUDIO ENCODE AND DECODE USING DIRECTIONAL METADATA

    公开(公告)号:US20240282321A1

    公开(公告)日:2024-08-22

    申请号:US18584290

    申请日:2024-02-22

    Inventor: David MCGRATH

    CPC classification number: G10L19/008

    Abstract: The disclosure relates to methods of processing a spatial audio signal for generating a compressed representation of the spatial audio signal. The methods include analyzing the spatial audio signal to determine directions of arrival for one or more audio elements; for at least one frequency subband, determining respective indications of signal power associated with the directions of arrival; generating metadata including direction information that includes indications of the directions of arrival of the audio elements, and energy information that includes respective indications of signal power; generating a channel-based audio signal with a predefined number of channels based on the spatial audio signal; and outputting, as the compressed representation, the channel-based audio signal and the metadata. The disclosure further relates to methods of processing a compressed representation of a spatial audio signal for generating a reconstructed representation of the spatial audio signal, and to corresponding apparatus, programs, and storage media.

    MULTICHANNEL AUDIO ENCODE AND DECODE USING DIRECTIONAL METADATA

    公开(公告)号:US20220392462A1

    公开(公告)日:2022-12-08

    申请号:US17771877

    申请日:2020-10-29

    Inventor: David MCGRATH

    Abstract: The disclosure relates to methods of processing a spatial audio signal for generating a compressed representation of the spatial audio signal. The methods include analyzing the spatial audio signal to determine directions of arrival for one or more audio elements; for at least one frequency subband, determining respective indications of signal power associated with the directions of arrival; generating metadata including direction information that includes indications of the directions of arrival of the audio elements, and energy information that includes respective indications of signal power; generating a channel-based audio signal with a predefined number of channels based on the spatial audio signal; and outputting, as the compressed representation, the channel-based audio signal and the metadata. The disclosure further relates to methods of processing a compressed representation of a spatial audio signal for generating a reconstructed representation of the spatial audio signal, and to corresponding apparatus, programs, and storage media.

Patent Agency Ranking