AUDIO ENCODING AND DECODING METHOD AND APPARATUS

    公开(公告)号:US20230298601A1

    公开(公告)日:2023-09-21

    申请号:US18202930

    申请日:2023-05-28

    CPC classification number: G10L19/008

    Abstract: Audio encoding and decoding methods and apparatuses are disclosed, to reduce an amount of encoded and decoded data, so as to improve encoding and decoding efficiency. The method includes: selecting a first target virtual speaker from a preset virtual speaker set based on a first scene audio signal; generating a first virtual speaker signal based on the first scene audio signal and attribute information of the first target virtual speaker; obtaining a second scene audio signal using the attribute information of the first target virtual speaker and the first virtual speaker signal; generating a residual signal based on the first scene audio signal and the second scene audio signal; and encoding the first virtual speaker signal and the residual signal, to produce encoded signals, and writing the encoded signals into a bitstream.

    METHOD AND APPARATUS FOR ENCODING THREE-DIMENSIONAL AUDIO SIGNAL, ENCODER, AND SYSTEM

    公开(公告)号:US20240119950A1

    公开(公告)日:2024-04-11

    申请号:US18538708

    申请日:2023-12-13

    CPC classification number: G10L19/008 G10L25/21 H04S7/30

    Abstract: A method for encoding a three-dimensional audio signal is provided. The method includes: An encoder obtains a current frame of a three-dimensional audio signal; obtains coding efficiency of an initial virtual speaker for the current frame based on the current frame of the three-dimensional audio signal; and when the coding efficiency of the initial virtual speaker for the current frame meets a preset condition, determines an updated virtual speaker for the current frame from a set of candidate virtual speakers; encodes the current frame based on the updated virtual speaker for the current frame, to obtain a first bitstream; or when the coding efficiency of the initial virtual speaker for the current frame does not meet the preset condition, encodes the current frame based on the initial virtual speaker for the current frame, to obtain a second bitstream.

    THREE-DIMENSIONAL AUDIO SIGNAL CODING METHOD AND APPARATUS, AND ENCODER

    公开(公告)号:US20240087579A1

    公开(公告)日:2024-03-14

    申请号:US18511061

    申请日:2023-11-16

    CPC classification number: G10L19/008 H04S7/30 H04S2420/11

    Abstract: This application discloses a three-dimensional audio signal coding method and apparatus, and an encoder, and relates to the multimedia field. The method includes: After determining a first quantity of virtual speakers and a first quantity of vote values based on a current frame of a three-dimensional audio signal, a candidate virtual speaker set, and a voting round quantity, the encoder selects a second quantity of representative virtual speakers for the current frame from the first quantity of virtual speakers based on the first quantity of vote values, and further encodes the current frame based on the second quantity of representative virtual speakers for the current frame to obtain a bitstream. This achieves efficient data compression.

    THREE-DIMENSIONAL AUDIO SIGNAL PROCESSING METHOD AND APPARATUS

    公开(公告)号:US20240105187A1

    公开(公告)日:2024-03-28

    申请号:US18521944

    申请日:2023-11-28

    CPC classification number: G10L19/008 G10L19/02 H04S7/30 H04S2420/11

    Abstract: Embodiments of this application disclose a three-dimensional audio signal processing method and apparatus, to implement sound field classification of a three-dimensional audio signal, to accurately identify the three-dimensional audio signal. An embodiment of this application provides a three-dimensional audio signal processing method, including: performing linear decomposition on a current frame of a three-dimensional audio signal, to obtain a linear decomposition result; obtaining, based on the linear decomposition result, a sound field classification parameter corresponding to the current frame; and determining a sound field classification result of the current frame based on the sound field classification parameter.

    THREE-DIMENSIONAL AUDIO SIGNAL CODING METHOD AND APPARATUS, AND ENCODER

    公开(公告)号:US20240087578A1

    公开(公告)日:2024-03-14

    申请号:US18511025

    申请日:2023-11-16

    CPC classification number: G10L19/008 G10L19/167 H04S7/00 H04S2420/11

    Abstract: A three-dimensional audio signal coding method, apparatus, and encoder are described. The method includes, after obtaining a first correlation between a current frame of a three-dimensional audio signal and a representative virtual speaker set for a previous frame, the encoder determines whether the first correlation satisfies a reuse condition, where the first correlation is used to determine whether to reuse the representative virtual speaker set for the previous frame when the current frame is encoded. The method further encodes the current frame based on the representative virtual speaker set for the previous frame when the first correlation satisfies the reuse condition, to obtain a bitstream. A virtual speaker in the representative virtual speaker set for the previous frame is a virtual speaker used for encoding the previous frame of the three-dimensional audio signal.

    THREE-DIMENSIONAL AUDIO SIGNAL CODING METHOD AND APPARATUS, AND ENCODER

    公开(公告)号:US20240079017A1

    公开(公告)日:2024-03-07

    申请号:US18509653

    申请日:2023-11-15

    Abstract: A three-dimensional audio signal encoding method and apparatus, and an encoder are provided, and relate to the multimedia field. The method includes: The encoder obtains a first quantity of current-frame initial vote values for a current frame of a three-dimensional audio signal. Then, the encoder obtains, based on the first quantity of current-frame initial vote values and a sixth quantity of previous-frame final vote values, a seventh quantity of current-frame final vote values that are of a seventh quantity of virtual loudspeakers and that correspond to the current frame. Further, the encoder selects a second quantity of current-frame representative virtual loudspeakers from the seventh quantity of virtual loudspeakers based on the seventh quantity of current-frame final vote values. The encoder encodes the current frame based on the second quantity of current-frame representative virtual loudspeakers, to obtain a bitstream.

    METHOD AND APPARATUS FOR DETERMINING VIRTUAL SPEAKER SET

    公开(公告)号:US20230412981A1

    公开(公告)日:2023-12-21

    申请号:US18241698

    申请日:2023-09-01

    CPC classification number: H04R5/02 H04S2420/11 H04R2205/024

    Abstract: This application provides a method and an apparatus for determining a virtual speaker set. The method for determining a virtual speaker set includes: determining a target virtual speaker from F preset virtual speakers based on a to-be-processed audio signal, where each of the F virtual speakers corresponds to S virtual speakers, F is a positive integer, and S is a positive integer greater than 1; and obtaining, from a preset virtual speaker distribution table, respective position information of S virtual speakers corresponding to the target virtual speaker, where the virtual speaker distribution table includes position information of K virtual speakers, the position information includes an elevation angle index and an azimuth angle index, K is a positive integer greater than 1, F≤K, and F×S≥K. This application can improve audio signal playback effect.

    THREE-DIMENSIONAL AUDIO SIGNAL PROCESSING METHOD AND APPARATUS

    公开(公告)号:US20240112684A1

    公开(公告)日:2024-04-04

    申请号:US18532085

    申请日:2023-12-07

    CPC classification number: G10L19/002 G10L19/008 H04S7/00

    Abstract: Embodiments of this application disclose a three-dimensional audio signal processing method and apparatus, to implement bit allocation of a signal. The method includes: performing spatial coding on a to-be-coded three-dimensional audio signal, to obtain a transmission channel signal and transmission channel attribute information, where the transmission channel signal includes at least one virtual speaker signal group and at least one residual signal group; and determining a bit allocation ratio of the virtual speaker signal group and a bit allocation ratio of the residual signal group based on the transmission channel attribute information.

    THREE-DIMENSIONAL AUDIO SIGNAL CODING METHOD AND APPARATUS, AND ENCODER

    公开(公告)号:US20240087580A1

    公开(公告)日:2024-03-14

    申请号:US18511191

    申请日:2023-11-16

    Abstract: This application discloses a three-dimensional audio signal coding method. After obtaining a fourth quantity of coefficients for a current frame of a three-dimensional audio signal and frequency domain feature values of the fourth quantity of coefficients, an encoder selects a third quantity of representative coefficients from the fourth quantity of coefficients based on the frequency domain feature values of the fourth quantity of coefficients, and selects a second quantity of representative virtual speakers for the current frame from a candidate virtual speaker set based on the third quantity of representative coefficients, and then encodes the current frame based on the second quantity of representative virtual speakers for the current frame to obtain a bitstream. The encoder selects the representative virtual speakers from the candidate virtual speaker set by using a small quantity of representative coefficients to represent all coefficients.

    AUDIO ENCODING AND DECODING METHOD AND APPARATUS

    公开(公告)号:US20230298600A1

    公开(公告)日:2023-09-21

    申请号:US18202553

    申请日:2023-05-26

    CPC classification number: G10L19/008

    Abstract: An audio encoding and decoding method and apparatus, and a non-transitory readable storage medium are provided. The encoding method includes: selecting a first target virtual speaker from a preset virtual speaker set based on a current scene audio signal; generating a first virtual speaker signal based on the current scene audio signal and attribute information of the first target virtual speaker; and encoding the first virtual speaker signal to obtain a bitstream. According to the encoding method, an amount of encoded data is reduced, to improve encoding efficiency.

Patent Agency Ranking