Creating Spatial Audio Stream from Audio Objects with Spatial Extent

    公开(公告)号:US20240355341A1

    公开(公告)日:2024-10-24

    申请号:US18574918

    申请日:2022-06-16

    IPC分类号: G10L19/008

    CPC分类号: G10L19/008

    摘要: An apparatus for spatial audio encoding including circuitry configured to: obtain a first spatial audio stream of a first spatial audio format configured to be encoded with a low bitrate, wherein the first spatial audio stream includes an audio signal and a first metadata; obtain a second and different spatial audio stream of a second spatial audio format, wherein the second spatial audio stream includes a second audio signal and a second metadata; convert the second spatial audio format into the first spatial audio format to encode a converted second spatial audio stream with the low bitrate, wherein the converted spatial audio stream represents spatial audio properties of the second spatial audio stream; combine the first spatial audio stream and the converted second spatial audio stream to generate a combined spatial audio stream; and encode the combined spatial audio stream.

    Method and system for decoding left and right channels of a stereo sound signal

    公开(公告)号:US12125492B2

    公开(公告)日:2024-10-22

    申请号:US17071299

    申请日:2020-10-15

    IPC分类号: G10L19/008

    CPC分类号: G10L19/008

    摘要: A stereo sound decoding method and system decode left and right channels of a stereo sound signal, using received encoding parameters comprising encoding parameters of a primary channel, encoding parameters of a secondary channel, and a factor β. The primary channel encoding parameters comprise LP filter coefficients of the primary channel. The primary channel is decoded in response to the primary channel encoding parameters. The secondary channel is decoded using one of a plurality of coding models, wherein at least one of the coding models uses the primary channel LP filter coefficients to decode the secondary channel. The decoded primary and secondary channels are time domain up-mixed using the factor β to produce the decoded left and right channels of the stereo sound signal, wherein the factor β determines respective contributions of the primary and secondary channels upon production of the left and right channels.

    METHOD AND DEVICE FOR UNIFIED TIME-DOMAIN / FREQUENCY DOMAIN CODING OF A SOUND SIGNAL

    公开(公告)号:US20240321285A1

    公开(公告)日:2024-09-26

    申请号:US18259971

    申请日:2022-01-05

    摘要: A unified time-domain/frequency-domain coding method and device for coding an input sound signal comprise a classifier of the input sound signal into one of a plurality of sound signal categories comprising an unclear signal type category showing that the nature of the input sound signal is unclear. One of a plurality of coding sub-modes is selected for coding the input sound signal if the input sound signal is classified in the unclear signal type category. A mixed time-domain/frequency-domain encoder codes the input sound signal using the selected coding sub-mode. The mixed time-domain/frequency-domain encoder comprises a selector of frequency bands and allocator of bits for selecting frequency bands to quantize and for distributing a bit budget available to quantization between the selected frequency bands. Corresponding sound signal decoder and decoding method are also provided.