QUANTIZING SPATIAL AUDIO PARAMETERS
    1.
    发明公开

    公开(公告)号:US20230335143A1

    公开(公告)日:2023-10-19

    申请号:US18044666

    申请日:2021-08-19

    CPC classification number: G10L19/008 H04S7/305

    Abstract: There is inter alia disclosed an apparatus for spatial audio encoding configured to convert two or more energy ratios associated with a time frequency tile of one or more audio signals to a further energy ratio parameter which is related to the two or more energy ratios; quantize the further energy ratio parameter using a first quantizer; determine a distribution factor of energy ratios dependent on a ratio of a first of the two or more energy ratios to the sum of the two or more energy ratios; select a further quantizer from a plurality of further quantizers using the quantized further energy ratio parameter; and quantize the distribution factor of energy ratios using the selected further quantizer.

    SPATIAL AUDIO PARAMETER ENCODING AND ASSOCIATED DECODING

    公开(公告)号:US20230047237A1

    公开(公告)日:2023-02-16

    申请号:US17791115

    申请日:2020-12-07

    Abstract: An apparatus comprising means configured to obtain direction parameter values (108) associated with at least two time-frequency parts (202) of at least one audio signal (102); and encode the obtained direction parameter values based on a codebook (206), wherein the codebook comprises two or more quantization levels arranged such that a first quantization level comprises a first set of quantization values, and a second or succeeding quantization level comprises a second or further set of quantization values and preceding quantization level quantization values.

    DETERMINATION OF SPATIAL AUDIO PARAMETER ENCODING AND ASSOCIATED DECODING

    公开(公告)号:US20220343928A1

    公开(公告)日:2022-10-27

    申请号:US17642288

    申请日:2020-09-09

    Abstract: An apparatus comprising means configured to: generate spatial audio signal directional metadata parameters for a block of time-frequencies; generate encoded spatial audio signal directional metadata parameters (108) for a block of time-frequencies based on a first quantization resolution (203); compare a number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution against a determined number of bits; output or store the encoded spatial audio signal directional metadata parameters for a block of time-frequencies (108) based on a first quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies (108) based on the first quantization resolution is less than a determined number of bits (217); generate encoded spatial audio signal directional metadata parameters (108) for the block of time-frequencies based on a second quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies (108) based on the first quantization resolution is more than the determined number of bits and a difference between the determined number of bits and the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is less than a determined number of bits is within a determined threshold (217); generate encoded spatial audio signal directional metadata parameters (108) for the block of time-frequencies based on a third quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is more than the determined number of bits and the difference between the determined number of bits and the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is greater than the determined threshold, wherein the third quantization resolution is determined such that a number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies based on the third quantization resolution is always equal to or less than the determined number of bits (217).

    QUANTIZATION OF SPATIAL AUDIO DIRECTION PARAMETERS

    公开(公告)号:US20220335956A1

    公开(公告)日:2022-10-20

    申请号:US17635593

    申请日:2020-07-27

    Abstract: A method for spatial audio signal encoding comprising: obtaining, for a first frame, a plurality of audio direction parameters, wherein each parameter comprises an elevation value and an azimuth value and wherein each parameter has an ordered position; determining whether, for a preceding frame, any of the plurality of audio direction parameters was differentially encoded based on a difference between the preceding frame parameter elevation value and a further preceding frame parameter elevation value and the preceding frame parameter azimuth value and a further preceding frame parameter azimuth value; generating, for any audio direction parameter which was not differentially encoded in the considered preceding frame, a differential parameter value based on a difference between the frame parameter elevation value and a preceding frame parameter elevation value and a difference between the frame parameter azimuth value and a preceding frame parameter azimuth value; generating for each of the plurality of audio direction parameters a difference parameter value based on a difference between the audio direction parameter and a rotated derived audio direction parameter; quantizing the difference between the audio direction parameter and a rotated derived audio direction parameter and the differential parameter value; and selecting for each of the plurality of audio direction parameters, either of the quantized difference or differential parameter value.

    QUANTIZATION OF SPATIAL AUDIO DIRECTION PARAMETERS

    公开(公告)号:US20220279299A1

    公开(公告)日:2022-09-01

    申请号:US17628792

    申请日:2020-06-15

    Abstract: There is disclosed inter alia an apparatus for spatial audio signal encoding configured to derive for each of a plurality of audio direction parameters a corresponding derived audio direction parameter comprising an elevation value and an azimuth value. Each derived audio direction parameter is rotated by the azimuth value of an audio direction parameter in the first position of the plurality of audio direction parameters. The position of some of the audio direction parameters are changed followed by determining for each of the plurality audio direction parameters a difference between each audio direction parameter and a corresponding rotated derived audio direction parameter. The difference for each of the plurality of audio direction parameters is then quantised.

    AUDIO PARAMETER QUANTIZATION
    6.
    发明申请

    公开(公告)号:US20180226082A1

    公开(公告)日:2018-08-09

    申请号:US15506416

    申请日:2014-08-28

    CPC classification number: G10L19/032 G10L19/008 G10L19/04

    Abstract: A technique for audio encoding is provided. According to an example embodiment, the technique comprises deriving a first quantization error that is descriptive of an error resulting with a non-predictive quantization of an audio parameter of an audio signal segment, deriving a second quantization error that is descriptive of an error resulting with a predictive quantization of said audio parameter of said audio signal segment, determining whether said second quantization error exceeds said first quantization error by at least an adaptive margin that is dependent on the number of consecutive audio signal segments that precede said audio signal segment in which said audio parameter is provided quantized with said predictive quantization, providing said audio parameter of said audio segment quantized with said non-predictive quantization as part of an encoded audio signal at least in case the outcome of quantization quantization said determination is affirmative, and providing otherwise said audio parameter of said audio segment quantized with said predictive quantization as part of an encoded audio signal.

    AUDIO CODEC MODE SELECTOR
    7.
    发明申请
    AUDIO CODEC MODE SELECTOR 审中-公开
    音频编解码模式选择器

    公开(公告)号:US20150332677A1

    公开(公告)日:2015-11-19

    申请号:US14710284

    申请日:2015-05-12

    CPC classification number: G10L19/24 G10L19/002 G10L19/02 G10L19/22 G10L25/78

    Abstract: There is inter alia a method comprising: receiving a request to change the coding rate of a multimode audio codec; determining that the request corresponds to a coding rate of another mode of operation of the multimode audio codec; determining a frame of an input audio signal of the multimode audio codec to be an active region of the audio signal; maintaining a current operating mode of the multimode audio codec; and reducing the coding rate of the multimode audio codec to a coding rate lower than the requested coding rate.

    Abstract translation: 尤其涉及一种方法,包括:接收改变多模音频编解码器的编码率的请求; 确定所述请求对应于所述多模式音频编解码器的另一操作模式的编码率; 将所述多模式音频编解码器的输入音频信号的帧确定为所述音频信号的有效区域; 保持多模式音频编解码器的当前操作模式; 并将多模式音频编解码器的编码率降低到低于所请求编码速率的编码速率。

    SPATIAL AUDIO PARAMETER ENCODING AND ASSOCIATED DECODING

    公开(公告)号:US20230335141A1

    公开(公告)日:2023-10-19

    申请号:US17788155

    申请日:2020-12-11

    CPC classification number: G10L19/008 G10L19/032

    Abstract: An apparatus comprising means configured to: obtain at least one parameter value (106) associated with at least two time-frequency parts of at least one audio signal (104); obtain at least one similarity value based on the at least one parameter value (106) associated with the at least two time-frequency parts of at least one audio signal (104); determine at least one group of time-frequency parts from the at least two time-frequency parts of at least one audio signal (104), the at least one group of time-frequency parts based on the at least one similarity value; and generate for the at least one group of time-frequency parts at least one associated group parameter (204), the at least one group parameter (204) based on the at least one parameter value (106) associated with the time-frequency parts.

    SPATIAL AUDIO PARAMETER ENCODING AND ASSOCIATED DECODING

    公开(公告)号:US20230197087A1

    公开(公告)日:2023-06-22

    申请号:US17998866

    申请日:2021-04-15

    CPC classification number: G10L19/008 G10L19/032 G10L25/21 G10L2019/0004

    Abstract: An apparatus comprising means configured to: obtain at least one direction parameter value for a time-frequency part of at least one audio signal (301); obtain at least one energy ratio for the time-frequency part (301), wherein each energy ratio is associated with a respective direction parameter value; generate respective at least one modified energy ratio from the at least one energy ratio for the time-frequency part (304); determine a quantization spatial resolution for encoding the at least one obtained direction parameter value based on the at least one modified energy ratio (305); and encode the obtained direction parameter values based on the quantization spatial resolution (306).

    THE MERGING OF SPATIAL AUDIO PARAMETERS
    10.
    发明公开

    公开(公告)号:US20230197086A1

    公开(公告)日:2023-06-22

    申请号:US17786088

    申请日:2020-11-13

    CPC classification number: G10L19/008 H04S7/302 H04S2420/03

    Abstract: There is inter alia disclosed an apparatus for spatial audio encoding comprising: means for determining at least two of a type of spatial audio parameter for one or more audio signals, wherein a first of the type of spatial audio parameter is associated with a first group of samples in a domain of the one or more audio signals and a second of the type of spatial audio parameter is associated with a second group of samples in the domain of the one or more audio signals; and means for merging the first of the type of spatial audio parameter and the second of the type of spatial audio parameter into a merged spatial audio parameter.

Patent Agency Ranking