-
公开(公告)号:US20240127828A1
公开(公告)日:2024-04-18
申请号:US18263494
申请日:2021-01-29
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Adriana VASILACHE , Anssi RÄMÖ , Lasse LAAKSONEN , Tapani PIHLAJAKUJA , Mikko-Ville LAITINEN
IPC: G10L19/002 , G10L19/035 , G10L25/21
CPC classification number: G10L19/002 , G10L19/035 , G10L25/21
Abstract: An apparatus comprising means for: obtaining values for parameters representing an audio signal, the values comprising at least one directional value and at least one energy ratio value for each sub-band of at least two sub-bands of a frame of the audio signal; determining a penalty value for each sub-band; and on a sub-band by sub-band basis: selecting a sub-band based on the penalty value; and encoding, for the selected sub-band, the at least one directional value for each sub-band; distributing any bits allocated for encoding the selected sub-band at least one directional value which are not used in the encoding of the at least one directional value to succeeding selections of sub-bands.
-
公开(公告)号:US20240079014A1
公开(公告)日:2024-03-07
申请号:US18261783
申请日:2021-01-18
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Adriana VASILACHE
IPC: G10L19/008 , G10L19/032 , H04S3/00 , H04S7/00
CPC classification number: G10L19/008 , G10L19/032 , H04S3/008 , H04S7/30 , H04S2400/01 , H04S2400/11 , H04S2420/03
Abstract: There is inter alia disclosed an apparatus for spatial audio encoding configured to: determine, for two or more audio signals, a first spatial audio direction parameter and a second spatial audio direction parameter for providing spatial audio reproduction: quantize the first spatial audio direction parameter (301); transform the second spatial audio direction parameter to have an opposite spatial audio direction (303); determine a difference between the transformed second spatial audio direction parameter and the quantized first spatial audio direction parameter (305); and quantize the difference (307).
-
公开(公告)号:US20230402053A1
公开(公告)日:2023-12-14
申请号:US17783735
申请日:2020-11-13
Applicant: Nokia Technologies Oy
Inventor: Mikko-Ville LAITINEN , Lasse LAAKSONEN , Anssi RÄMÖ , Tapani PIHLAJAKUJA , Adriana VASILACHE
IPC: G10L25/03
CPC classification number: G10L25/03
Abstract: There is inter alia disclosed an apparatus for spatial audio encoding comprising: means for determining a first spatial audio parameter of a frequency sub band of one or more audio signals and a second spatial audio parameter of the frequency sub band of the one or more audio signals; and means for combining the first spatial audio parameter and the second spatial audio parameter to provide a combined spatial audio parameter for the frequency sub band.
-
公开(公告)号:US20220386056A1
公开(公告)日:2022-12-01
申请号:US17634108
申请日:2020-07-27
Applicant: Nokia Technologies Oy
Inventor: Adriana VASILACHE , Mikko-Ville LAITINEN
IPC: H04S7/00 , H04S3/00 , G10L19/008 , G10L19/035
Abstract: A method for spatial audio signal encoding comprising: obtaining a plurality of audio direction parameters, wherein each parameter comprises an elevation value and an azimuth value and wherein each parameter has an ordered position; deriving for each of the plurality of audio direction parameters a corresponding derived audio direction parameter (SP) comprising an elevation and an azimuth value, corresponding derived audio direction parameters (SP) being arranged in a manner determined by a spatial utilization defined by the elevation values and the azimuth values of the plurality of audio direction parameters; rotating each derived audio direction parameter (SP) by the azimuth value (φ0) of an audio direction parameter in the first position of the plurality of audio direction parameters and quantizing the rotation to determine for each a corresponding quantized rotated derived audio direction parameter; changing the ordered position of an audio direction parameter to a further position coinciding with a position of a rotated derived audio direction parameter when the azimuth value of the audio direction parameter is closest to the azimuth value of the further rotated derived audio direction parameter compared to the azimuth values of other rotated derived audio direction parameters, followed by determining for each of the plurality audio direction parameters a difference between each audio direction parameter and their corresponding quantized rotated derived audio direction parameter; and quantizing a difference for each of the plurality of audio direction parameters, wherein a difference quantization resolution for each of the plurality of audio direction parameters is defined based on a spatial extent of the audio direction parameters.
-
公开(公告)号:US20220189494A1
公开(公告)日:2022-06-16
申请号:US17441829
申请日:2020-03-26
Applicant: Nokia Technologies Oy
Inventor: Mikko-Ville LAITINEN , Adriana VASILACHE
Abstract: There is inter alia disclosed an apparatus for spatial audio encoding which can receive or determine for one or more audio signals (102), spatial audio parameters (106) on a sub band basis for providing spatial audio reproduction, the spatial audio parameters can comprise a coherence value (112) for each sub band of a plurality of subbands (202) of a frame. The apparatus then determines a significance measure for the coherence values (401) of the plurality of sub bands of the frame and uses the significance measure to determine whether to encode (403) the coherence values of the plurality of sub bands of the frame.
-
36.
公开(公告)号:US20190096410A1
公开(公告)日:2019-03-28
申请号:US16080339
申请日:2016-03-03
Applicant: Nokia Technologies Oy
Inventor: Adriana VASILACHE , Lasse Juhani LAAKSONEN , Anssi Sakari RAMO , Antti HURMALAINEN
IPC: G10L19/008 , G10L19/02 , G10L25/21
Abstract: A method including determining a plurality of band energy scale values for a pair of audio signals; transforming the band energy scale values using a discrete cosine transform to generate a plurality of coefficient values; and selecting a sub-set of the plurality of coefficient values to generate a representation of a level difference between the pair of audio signals.
-
公开(公告)号:US20170148455A1
公开(公告)日:2017-05-25
申请号:US15424698
申请日:2017-02-03
Applicant: Nokia Technologies Oy
Inventor: Adriana VASILACHE , Anssi Sakari RAMO , Lasse Juhani LAAKSONEN
IPC: G10L19/038 , G10L19/22
CPC classification number: G10L19/038 , G10L19/012 , G10L19/07 , G10L19/22 , G10L25/93 , G10L2019/0001 , G10L2019/0005 , H03M7/3082 , H04N19/124 , H04N19/194 , H04N19/94
Abstract: It is inter alia disclosed to determine a first quantized representation of an input vector, and to determine a second quantized representation of the input vector based on a codebook depending on the first quantized representation
-
公开(公告)号:US20170103769A1
公开(公告)日:2017-04-13
申请号:US15127143
申请日:2015-03-13
Applicant: Nokia Technologies Oy
Inventor: Lasse LAAKSONEN , Anssi RÄMÖ , Adriana VASILACHE
IPC: G10L19/24 , G10L19/16 , G10L19/008
CPC classification number: G10L19/24 , G10L19/008 , G10L19/167 , G10L19/18
Abstract: It is disclosed inter alia a method for forming an audio payload frame, wherein the audio payload frame comprises: an encoded audio data frame with a first marker bit at the front of the encoded audio data frame, wherein the first marker is set to a first value, and wherein the first value denotes a type of encoded audio data in the encoded audio data frame; an extension encoded audio data frame; and a second marker bit in front of the first marker bit, wherein the second marker bit is set to a second value; and wherein the second value denotes a type of encoded audio data other than the type of encoded audio data in the encoded audio data frame.
-
公开(公告)号:US20160078877A1
公开(公告)日:2016-03-17
申请号:US14785518
申请日:2013-04-26
Applicant: Nokia Technologies OY
Inventor: Adriana VASILACHE , Lasse Juhani LAAKSONEN , Anssi Sakari RÄMÖ
IPC: G10L19/22 , G10L19/008
CPC classification number: G10L19/22 , G10L19/008
Abstract: An apparatus comprising: a channel analyser configured to determine for a first frame of at least one audio signal a set of first frame audio signal multi-channel parameters; a multichannel difference selector configured to select for the first frame groups of elements of the set of first frame audio signal multi-channel parameters based on a value associated with the first frame; and a multichannel parameter encoder configured to generate an encoded first frame audio signal multi-channel parameter based on the selected groups of elements of the set of first frame audio signal multi-channel parameters.
Abstract translation: 一种装置,包括:信道分析器,被配置为针对第一帧确定至少一个音频信号一组第一帧音频信号多信道参数; 多通道差分选择器,被配置为基于与第一帧相关联的值来选择该组第一帧音频信号多声道参数的第一帧组的元素; 以及多通道参数编码器,被配置为基于所选择的所述一组第一帧音频信号多通道参数中的元素组生成编码的第一帧音频信号多声道参数。
-
-
-
-
-
-
-
-