ROTATION OF SOUND COMPONENTS FOR ORIENTATION-DEPENDENT CODING SCHEMES

    公开(公告)号:US20240013793A1

    公开(公告)日:2024-01-11

    申请号:US18255232

    申请日:2021-12-02

    CPC classification number: G10L19/008 G10L19/002 G10L19/032

    Abstract: Method for encoding scene-based audio is provided. In some implementations, the method involves determining, by an encoder, a spatial direction of a dominant sound component in a frame of an input audio signal. In some implementations, the method involves determining rotation parameters based on the determined spatial direction and a direction preference of a coding scheme to be used to encode the input audio signal. In some implementations, the method involves rotating sound components of the frame based on the rotation parameters such that, after being rotated, the dominant sound component has a spatial direction that aligns with the direction preference of the coding scheme. In some implementations, the method involves encoding the rotated sound components of the frame of the input audio signal using the coding scheme in connection with an indication of the rotation parameters or an indication of the spatial direction of the dominant sound component.

    QUANTIZATION AND ENTROPY CODING OF PARAMETERS FOR A LOW LATENCY AUDIO CODEC

    公开(公告)号:US20230343346A1

    公开(公告)日:2023-10-26

    申请号:US18008445

    申请日:2021-06-10

    CPC classification number: G10L19/032 G10L19/008

    Abstract: Described is a method of frame-wise encoding metadata for an input signal, the metadata comprising a plurality of at least partially interrelated parameters calculable from the input signal. The method comprises, for each frame: iteratively performing, by using a looping process, steps of: determining a processing strategy from a plurality of processing strategies for calculating and quantizing the parameters; calculating and quantizing the parameters based on the determined processing strategy to obtain quantized parameters; and encoding the quantized parameters. In particular, each of the plurality of processing strategies comprises a respective first indication indicative of an ordering related to the calculation and quantization of individual parameters; and the processing strategy is determined based on at least one bitrate threshold.

    POSITION-BASED GAIN ADJUSTMENT OF OBJECT-BASED AUDIO AND RING-BASED CHANNEL AUDIO
    6.
    发明申请
    POSITION-BASED GAIN ADJUSTMENT OF OBJECT-BASED AUDIO AND RING-BASED CHANNEL AUDIO 审中-公开
    基于对象的音频和基于声道的通道音频的基于位置的增益调整

    公开(公告)号:US20160295343A1

    公开(公告)日:2016-10-06

    申请号:US15037193

    申请日:2014-11-21

    CPC classification number: H04S7/308 H04S2400/11 H04S2400/13

    Abstract: The positions of a plurality of speakers at a media consumption site are determined. Audio information in an object-based format is received. Gain adjustment value for a sound content portion in the object-based format may be determined based on the position of the sound content portion and the positions of the plurality of speakers. Audio information in a ring-based channel format is received. Gain adjustment value for each ring-based channel in a set of ring-based channels may be determined based on the ring to which the ring-based channel belongs and the positions of the speakers at a media consumption site.

    Abstract translation: 确定媒体消费站点处的多个扬声器的位置。 接收基于对象的格式的音频信息。 可以基于声音内容部分的位置和多个扬声器的位置来确定基于对象的格式的声音内容部分的增益调整值。 接收基于环的频道格式的音频信息。 可以基于基于环的频道所属的环和媒体消费站点处的扬声器的位置来确定一组基于环的频道中的每个基于环的频道的增益调整值。

    METHODS AND DEVICES FOR ENCODING AND/OR DECODING IMMERSIVE AUDIO SIGNALS

    公开(公告)号:US20240005933A1

    公开(公告)日:2024-01-04

    申请号:US18349427

    申请日:2023-07-10

    CPC classification number: G10L19/167 G10L19/008 G10L19/18

    Abstract: The present document describes a method (700) for encoding a multi-channel input signal (201). The method (700) comprises determining (701) a plurality of downmix channel signals (203) from the multi-channel input signal (201) and performing (702) energy compaction of the plurality of downmix channel signals (203) to provide a plurality of compacted channel signals (404). Furthermore, the method (700) comprises determining (703) joint coding metadata (205) based on the plurality of compacted channel signals (404) and based on the multi-channel input signal (201), wherein the joint coding metadata (205) is such that it allows upmixing of the plurality of compacted channel signals (404) to an approximation of the multi-channel input signal (201). In addition, the method (700) comprises encoding (704) the plurality of compacted channel signals (404) and the joint coding metadata (205).

    ENHANCEMENT OF SPATIAL AUDIO SIGNALS BY MODULATED DECORRELATION

    公开(公告)号:US20230230600A1

    公开(公告)日:2023-07-20

    申请号:US18158032

    申请日:2023-01-23

    Inventor: David S. MCGRATH

    CPC classification number: G10L19/008 H04S3/008

    Abstract: Some methods involve receiving an input audio signal that includes N input audio channels, the input audio signal representing a first soundfield format having a first soundfield format resolution, N being an integer ≥2. A first decorrelation process may be applied to two or more of the input audio channels to produce a first set of decorrelated channels, the first decorrelation process maintaining an inter-channel correlation of the set of input audio channels. A first modulation process may be applied to the first set of decorrelated channels to produce a first set of decorrelated and modulated output channels. The first set of decorrelated and modulated output channels may be combined with two or more undecorrelated output channels to produce an output audio signal that includes O output audio channels representing a second and relatively higher-resolution soundfield format than the first soundfield format, O being an integer ≥3.

Patent Agency Ranking