Patent search ap:("HUAWEI TECHNOLOGIES CO. Page LTD.") AND inv:"Shuai LIU"

1.

发明公开
AUDIO ENCODING AND DECODING METHOD AND APPARATUS 审中-公开

公开(公告)号：US20230298601A1

公开(公告)日：2023-09-21

申请号：US18202930

申请日：2023-05-28

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Yuan GAO , Shuai LIU , Bin WANG , Zhe WANG , Tianshu QU , Jiahao XU

IPC: G10L19/008

CPC classification number: G10L19/008

Abstract: Audio encoding and decoding methods and apparatuses are disclosed, to reduce an amount of encoded and decoded data, so as to improve encoding and decoding efficiency. The method includes: selecting a first target virtual speaker from a preset virtual speaker set based on a first scene audio signal; generating a first virtual speaker signal based on the first scene audio signal and attribute information of the first target virtual speaker; obtaining a second scene audio signal using the attribute information of the first target virtual speaker and the first virtual speaker signal; generating a residual signal based on the first scene audio signal and the second scene audio signal; and encoding the first virtual speaker signal and the residual signal, to produce encoded signals, and writing the encoded signals into a bitstream.

2.

发明公开
METHOD AND APPARATUS FOR ENCODING THREE-DIMENSIONAL AUDIO SIGNAL, ENCODER, AND SYSTEM 审中-公开

公开(公告)号：US20240119950A1

公开(公告)日：2024-04-11

申请号：US18538708

申请日：2023-12-13

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Yuan GAO , Shuai LIU , Bingyin XIA , Bin WANG , Zhe WANG

IPC: G10L19/008 , G10L25/21 , H04S7/00

CPC classification number: G10L19/008 , G10L25/21 , H04S7/30

Abstract: A method for encoding a three-dimensional audio signal is provided. The method includes: An encoder obtains a current frame of a three-dimensional audio signal; obtains coding efficiency of an initial virtual speaker for the current frame based on the current frame of the three-dimensional audio signal; and when the coding efficiency of the initial virtual speaker for the current frame meets a preset condition, determines an updated virtual speaker for the current frame from a set of candidate virtual speakers; encodes the current frame based on the updated virtual speaker for the current frame, to obtain a first bitstream; or when the coding efficiency of the initial virtual speaker for the current frame does not meet the preset condition, encodes the current frame based on the initial virtual speaker for the current frame, to obtain a second bitstream.

3.

发明公开
THREE-DIMENSIONAL AUDIO SIGNAL CODING METHOD AND APPARATUS, AND ENCODER 审中-公开

公开(公告)号：US20240087579A1

公开(公告)日：2024-03-14

申请号：US18511061

申请日：2023-11-16

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Yuan GAO , Shuai LIU , Bin WANG , Zhe WANG , Tianshu QU , Jiahao XU

IPC: G10L19/008 , H04S7/00

CPC classification number: G10L19/008 , H04S7/30 , H04S2420/11

Abstract: This application discloses a three-dimensional audio signal coding method and apparatus, and an encoder, and relates to the multimedia field. The method includes: After determining a first quantity of virtual speakers and a first quantity of vote values based on a current frame of a three-dimensional audio signal, a candidate virtual speaker set, and a voting round quantity, the encoder selects a second quantity of representative virtual speakers for the current frame from the first quantity of virtual speakers based on the first quantity of vote values, and further encodes the current frame based on the second quantity of representative virtual speakers for the current frame to obtain a bitstream. This achieves efficient data compression.

4.

发明公开
THREE-DIMENSIONAL AUDIO SIGNAL PROCESSING METHOD AND APPARATUS 审中-公开

公开(公告)号：US20240105187A1

公开(公告)日：2024-03-28

申请号：US18521944

申请日：2023-11-28

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Yuan GAO , Shuai LIU , Bin WANG , Zhe WANG , Tianshu QU , Jiahao XU

IPC: G10L19/008 , G10L19/02 , H04S7/00

CPC classification number: G10L19/008 , G10L19/02 , H04S7/30 , H04S2420/11

Abstract: Embodiments of this application disclose a three-dimensional audio signal processing method and apparatus, to implement sound field classification of a three-dimensional audio signal, to accurately identify the three-dimensional audio signal. An embodiment of this application provides a three-dimensional audio signal processing method, including: performing linear decomposition on a current frame of a three-dimensional audio signal, to obtain a linear decomposition result; obtaining, based on the linear decomposition result, a sound field classification parameter corresponding to the current frame; and determining a sound field classification result of the current frame based on the sound field classification parameter.

5.

发明公开
THREE-DIMENSIONAL AUDIO SIGNAL CODING METHOD AND APPARATUS, AND ENCODER 审中-公开

公开(公告)号：US20240087578A1

公开(公告)日：2024-03-14

申请号：US18511025

申请日：2023-11-16

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Yuan GAO , Shuai LIU , Bin WANG , Zhe WANG , Tianshu QU , Jiahao XU

IPC: G10L19/008 , G10L19/16 , H04S7/00

CPC classification number: G10L19/008 , G10L19/167 , H04S7/00 , H04S2420/11

Abstract: A three-dimensional audio signal coding method, apparatus, and encoder are described. The method includes, after obtaining a first correlation between a current frame of a three-dimensional audio signal and a representative virtual speaker set for a previous frame, the encoder determines whether the first correlation satisfies a reuse condition, where the first correlation is used to determine whether to reuse the representative virtual speaker set for the previous frame when the current frame is encoded. The method further encodes the current frame based on the representative virtual speaker set for the previous frame when the first correlation satisfies the reuse condition, to obtain a bitstream. A virtual speaker in the representative virtual speaker set for the previous frame is a virtual speaker used for encoding the previous frame of the three-dimensional audio signal.

6.

发明公开
THREE-DIMENSIONAL AUDIO SIGNAL CODING METHOD AND APPARATUS, AND ENCODER 审中-公开

公开(公告)号：US20240079017A1

公开(公告)日：2024-03-07

申请号：US18509653

申请日：2023-11-15

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Yuan GAO , Shuai LIU , Bin WANG , Zhe WANG

IPC: G10L19/008 , G10L19/16 , H04S7/00

CPC classification number: G10L19/008 , G10L19/167 , H04S7/302 , H04S2400/11 , H04S2420/11

Abstract: A three-dimensional audio signal encoding method and apparatus, and an encoder are provided, and relate to the multimedia field. The method includes: The encoder obtains a first quantity of current-frame initial vote values for a current frame of a three-dimensional audio signal. Then, the encoder obtains, based on the first quantity of current-frame initial vote values and a sixth quantity of previous-frame final vote values, a seventh quantity of current-frame final vote values that are of a seventh quantity of virtual loudspeakers and that correspond to the current frame. Further, the encoder selects a second quantity of current-frame representative virtual loudspeakers from the seventh quantity of virtual loudspeakers based on the seventh quantity of current-frame final vote values. The encoder encodes the current frame based on the second quantity of current-frame representative virtual loudspeakers, to obtain a bitstream.

7.

发明公开
METHOD AND APPARATUS FOR DETERMINING VIRTUAL SPEAKER SET 审中-公开

公开(公告)号：US20230412981A1

公开(公告)日：2023-12-21

申请号：US18241698

申请日：2023-09-01

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Yuan GAO , Shuai LIU , Bin WANG , Zhe WANG , Tianshu QU , Jiahao XU

IPC: H04R5/02

CPC classification number: H04R5/02 , H04S2420/11 , H04R2205/024

Abstract: This application provides a method and an apparatus for determining a virtual speaker set. The method for determining a virtual speaker set includes: determining a target virtual speaker from F preset virtual speakers based on a to-be-processed audio signal, where each of the F virtual speakers corresponds to S virtual speakers, F is a positive integer, and S is a positive integer greater than 1; and obtaining, from a preset virtual speaker distribution table, respective position information of S virtual speakers corresponding to the target virtual speaker, where the virtual speaker distribution table includes position information of K virtual speakers, the position information includes an elevation angle index and an azimuth angle index, K is a positive integer greater than 1, F≤K, and F×S≥K. This application can improve audio signal playback effect.

8.

发明公开
THREE-DIMENSIONAL AUDIO SIGNAL PROCESSING METHOD AND APPARATUS 审中-公开

公开(公告)号：US20240112684A1

公开(公告)日：2024-04-04

申请号：US18532085

申请日：2023-12-07

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Shuai LIU , Yuan GAO , Bingyin XIA , Bin WANG , Zhe WANG

IPC: G10L19/002 , G10L19/008 , H04S7/00

CPC classification number: G10L19/002 , G10L19/008 , H04S7/00

Abstract: Embodiments of this application disclose a three-dimensional audio signal processing method and apparatus, to implement bit allocation of a signal. The method includes: performing spatial coding on a to-be-coded three-dimensional audio signal, to obtain a transmission channel signal and transmission channel attribute information, where the transmission channel signal includes at least one virtual speaker signal group and at least one residual signal group; and determining a bit allocation ratio of the virtual speaker signal group and a bit allocation ratio of the residual signal group based on the transmission channel attribute information.

9.

发明公开
THREE-DIMENSIONAL AUDIO SIGNAL CODING METHOD AND APPARATUS, AND ENCODER 审中-公开

公开(公告)号：US20240087580A1

公开(公告)日：2024-03-14

申请号：US18511191

申请日：2023-11-16

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Yuan GAO , Shuai LIU , Bin WANG , Zhe WANG

IPC: G10L19/008 , G10L19/02 , G10L19/16 , H04S7/00

CPC classification number: G10L19/008 , G10L19/0204 , G10L19/167 , H04S7/00 , H04S2420/11

Abstract: This application discloses a three-dimensional audio signal coding method. After obtaining a fourth quantity of coefficients for a current frame of a three-dimensional audio signal and frequency domain feature values of the fourth quantity of coefficients, an encoder selects a third quantity of representative coefficients from the fourth quantity of coefficients based on the frequency domain feature values of the fourth quantity of coefficients, and selects a second quantity of representative virtual speakers for the current frame from a candidate virtual speaker set based on the third quantity of representative coefficients, and then encodes the current frame based on the second quantity of representative virtual speakers for the current frame to obtain a bitstream. The encoder selects the representative virtual speakers from the candidate virtual speaker set by using a small quantity of representative coefficients to represent all coefficients.

10.

发明公开
AUDIO ENCODING AND DECODING METHOD AND APPARATUS 审中-公开

公开(公告)号：US20230298600A1

公开(公告)日：2023-09-21

申请号：US18202553

申请日：2023-05-26

Applicant: HUAWEI TECHNOLOGIES CO., LTD.

Inventor： Yuan GAO , Shuai LIU , Bin WANG , Zhe WANG , Tianshu QU , Jiahao XU

IPC: G10L19/008

CPC classification number: G10L19/008

Abstract: An audio encoding and decoding method and apparatus, and a non-transitory readable storage medium are provided. The encoding method includes: selecting a first target virtual speaker from a preset virtual speaker set based on a current scene audio signal; generating a first virtual speaker signal based on the current scene audio signal and attribute information of the first target virtual speaker; and encoding the first virtual speaker signal to obtain a bitstream. According to the encoding method, an amount of encoded data is reduced, to improve encoding efficiency.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification