-
公开(公告)号:US20230335143A1
公开(公告)日:2023-10-19
申请号:US18044666
申请日:2021-08-19
Applicant: Nokia Technologies OY
Inventor: Tapani PIHLAJAKUJA , Adriana VASILACHE , Mikko-Ville LAITINEN
IPC: G10L19/008
CPC classification number: G10L19/008 , H04S7/305
Abstract: There is inter alia disclosed an apparatus for spatial audio encoding configured to convert two or more energy ratios associated with a time frequency tile of one or more audio signals to a further energy ratio parameter which is related to the two or more energy ratios; quantize the further energy ratio parameter using a first quantizer; determine a distribution factor of energy ratios dependent on a ratio of a first of the two or more energy ratios to the sum of the two or more energy ratios; select a further quantizer from a plurality of further quantizers using the quantized further energy ratio parameter; and quantize the distribution factor of energy ratios using the selected further quantizer.
-
公开(公告)号:US20230047237A1
公开(公告)日:2023-02-16
申请号:US17791115
申请日:2020-12-07
Applicant: Nokia Technologies Oy
Inventor: Adriana VASILACHE
IPC: G10L19/008 , G10L19/002
Abstract: An apparatus comprising means configured to obtain direction parameter values (108) associated with at least two time-frequency parts (202) of at least one audio signal (102); and encode the obtained direction parameter values based on a codebook (206), wherein the codebook comprises two or more quantization levels arranged such that a first quantization level comprises a first set of quantization values, and a second or succeeding quantization level comprises a second or further set of quantization values and preceding quantization level quantization values.
-
公开(公告)号:US20220343928A1
公开(公告)日:2022-10-27
申请号:US17642288
申请日:2020-09-09
Applicant: Nokia Technologies Oy
Inventor: Adriana VASILACHE
IPC: G10L19/032 , G10L19/008 , G10L19/00
Abstract: An apparatus comprising means configured to: generate spatial audio signal directional metadata parameters for a block of time-frequencies; generate encoded spatial audio signal directional metadata parameters (108) for a block of time-frequencies based on a first quantization resolution (203); compare a number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution against a determined number of bits; output or store the encoded spatial audio signal directional metadata parameters for a block of time-frequencies (108) based on a first quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies (108) based on the first quantization resolution is less than a determined number of bits (217); generate encoded spatial audio signal directional metadata parameters (108) for the block of time-frequencies based on a second quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies (108) based on the first quantization resolution is more than the determined number of bits and a difference between the determined number of bits and the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is less than a determined number of bits is within a determined threshold (217); generate encoded spatial audio signal directional metadata parameters (108) for the block of time-frequencies based on a third quantization resolution when the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is more than the determined number of bits and the difference between the determined number of bits and the number of bits used for the encoded spatial audio signal directional parameters (108) for the block of time-frequencies based on the first quantization resolution is greater than the determined threshold, wherein the third quantization resolution is determined such that a number of bits used for the encoded spatial audio signal directional parameters for the block of time-frequencies based on the third quantization resolution is always equal to or less than the determined number of bits (217).
-
公开(公告)号:US20220335956A1
公开(公告)日:2022-10-20
申请号:US17635593
申请日:2020-07-27
Applicant: Nokia Technologies Oy
Inventor: Adriana VASILACHE
IPC: G10L19/008 , G10L19/035
Abstract: A method for spatial audio signal encoding comprising: obtaining, for a first frame, a plurality of audio direction parameters, wherein each parameter comprises an elevation value and an azimuth value and wherein each parameter has an ordered position; determining whether, for a preceding frame, any of the plurality of audio direction parameters was differentially encoded based on a difference between the preceding frame parameter elevation value and a further preceding frame parameter elevation value and the preceding frame parameter azimuth value and a further preceding frame parameter azimuth value; generating, for any audio direction parameter which was not differentially encoded in the considered preceding frame, a differential parameter value based on a difference between the frame parameter elevation value and a preceding frame parameter elevation value and a difference between the frame parameter azimuth value and a preceding frame parameter azimuth value; generating for each of the plurality of audio direction parameters a difference parameter value based on a difference between the audio direction parameter and a rotated derived audio direction parameter; quantizing the difference between the audio direction parameter and a rotated derived audio direction parameter and the differential parameter value; and selecting for each of the plurality of audio direction parameters, either of the quantized difference or differential parameter value.
-
公开(公告)号:US20220279299A1
公开(公告)日:2022-09-01
申请号:US17628792
申请日:2020-06-15
Applicant: Nokia Technologies Oy
Inventor: Adriana VASILACHE
IPC: H04S7/00 , G10L19/035 , G10L19/008 , H04S3/00
Abstract: There is disclosed inter alia an apparatus for spatial audio signal encoding configured to derive for each of a plurality of audio direction parameters a corresponding derived audio direction parameter comprising an elevation value and an azimuth value. Each derived audio direction parameter is rotated by the azimuth value of an audio direction parameter in the first position of the plurality of audio direction parameters. The position of some of the audio direction parameters are changed followed by determining for each of the plurality audio direction parameters a difference between each audio direction parameter and a corresponding rotated derived audio direction parameter. The difference for each of the plurality of audio direction parameters is then quantised.
-
公开(公告)号:US20180226082A1
公开(公告)日:2018-08-09
申请号:US15506416
申请日:2014-08-28
Applicant: Nokia Technologies Oy
Inventor: Anssi RAMO , Adriana VASILACHE , Lasse Juhani LAAKSONEN
IPC: G10L19/032 , G10L19/008 , G10L19/04
CPC classification number: G10L19/032 , G10L19/008 , G10L19/04
Abstract: A technique for audio encoding is provided. According to an example embodiment, the technique comprises deriving a first quantization error that is descriptive of an error resulting with a non-predictive quantization of an audio parameter of an audio signal segment, deriving a second quantization error that is descriptive of an error resulting with a predictive quantization of said audio parameter of said audio signal segment, determining whether said second quantization error exceeds said first quantization error by at least an adaptive margin that is dependent on the number of consecutive audio signal segments that precede said audio signal segment in which said audio parameter is provided quantized with said predictive quantization, providing said audio parameter of said audio segment quantized with said non-predictive quantization as part of an encoded audio signal at least in case the outcome of quantization quantization said determination is affirmative, and providing otherwise said audio parameter of said audio segment quantized with said predictive quantization as part of an encoded audio signal.
-
公开(公告)号:US20150332677A1
公开(公告)日:2015-11-19
申请号:US14710284
申请日:2015-05-12
Applicant: Nokia Technologies Oy
Inventor: Adriana VASILACHE , Lasse Juhani LAAKSONEN , Anssi Sakari RÄMÖ
IPC: G10L19/002 , G10L19/02
CPC classification number: G10L19/24 , G10L19/002 , G10L19/02 , G10L19/22 , G10L25/78
Abstract: There is inter alia a method comprising: receiving a request to change the coding rate of a multimode audio codec; determining that the request corresponds to a coding rate of another mode of operation of the multimode audio codec; determining a frame of an input audio signal of the multimode audio codec to be an active region of the audio signal; maintaining a current operating mode of the multimode audio codec; and reducing the coding rate of the multimode audio codec to a coding rate lower than the requested coding rate.
Abstract translation: 尤其涉及一种方法,包括:接收改变多模音频编解码器的编码率的请求; 确定所述请求对应于所述多模式音频编解码器的另一操作模式的编码率; 将所述多模式音频编解码器的输入音频信号的帧确定为所述音频信号的有效区域; 保持多模式音频编解码器的当前操作模式; 并将多模式音频编解码器的编码率降低到低于所请求编码速率的编码速率。
-
公开(公告)号:US20230335141A1
公开(公告)日:2023-10-19
申请号:US17788155
申请日:2020-12-11
Applicant: Nokia Technologies Oy
Inventor: Tapani PIHLAJAKUJA , Adriana VASILACHE , Mikko-Ville LAITINEN , Anssi RÄMÖ , Lasse Juhani LAAKSONEN
IPC: G10L19/008 , G10L19/032
CPC classification number: G10L19/008 , G10L19/032
Abstract: An apparatus comprising means configured to: obtain at least one parameter value (106) associated with at least two time-frequency parts of at least one audio signal (104); obtain at least one similarity value based on the at least one parameter value (106) associated with the at least two time-frequency parts of at least one audio signal (104); determine at least one group of time-frequency parts from the at least two time-frequency parts of at least one audio signal (104), the at least one group of time-frequency parts based on the at least one similarity value; and generate for the at least one group of time-frequency parts at least one associated group parameter (204), the at least one group parameter (204) based on the at least one parameter value (106) associated with the time-frequency parts.
-
公开(公告)号:US20230197087A1
公开(公告)日:2023-06-22
申请号:US17998866
申请日:2021-04-15
Applicant: Nokia Technologies Oy
Inventor: Tapani Johannes PIHLAJAKUJA , Adriana VASILACHE
IPC: G10L19/008 , G10L19/032 , G10L25/21
CPC classification number: G10L19/008 , G10L19/032 , G10L25/21 , G10L2019/0004
Abstract: An apparatus comprising means configured to: obtain at least one direction parameter value for a time-frequency part of at least one audio signal (301); obtain at least one energy ratio for the time-frequency part (301), wherein each energy ratio is associated with a respective direction parameter value; generate respective at least one modified energy ratio from the at least one energy ratio for the time-frequency part (304); determine a quantization spatial resolution for encoding the at least one obtained direction parameter value based on the at least one modified energy ratio (305); and encode the obtained direction parameter values based on the quantization spatial resolution (306).
-
公开(公告)号:US20230197086A1
公开(公告)日:2023-06-22
申请号:US17786088
申请日:2020-11-13
Applicant: Nokia Technologies Oy
Inventor: Mikko-Ville LAITINEN , Lasse LAAKSONEN , Adriana VASILACHE , Tapani PIHLAJAKUJA , Anssi RÄMÖ
IPC: G10L19/008 , H04S7/00
CPC classification number: G10L19/008 , H04S7/302 , H04S2420/03
Abstract: There is inter alia disclosed an apparatus for spatial audio encoding comprising: means for determining at least two of a type of spatial audio parameter for one or more audio signals, wherein a first of the type of spatial audio parameter is associated with a first group of samples in a domain of the one or more audio signals and a second of the type of spatial audio parameter is associated with a second group of samples in the domain of the one or more audio signals; and means for merging the first of the type of spatial audio parameter and the second of the type of spatial audio parameter into a merged spatial audio parameter.
-
-
-
-
-
-
-
-
-