-
公开(公告)号:US20240312469A1
公开(公告)日:2024-09-19
申请号:US18678716
申请日:2024-05-30
Applicant: Nokia Technologies Oy
Inventor: Tapani PIHLAJAKUJA , Lasse Laaksonen , Antti Eronen , Arto Lehtiniemi
IPC: G10L19/008 , G10L19/00
CPC classification number: G10L19/008 , G10L2019/0001
Abstract: An apparatus configured to: obtain spatial audio content; decode encoded spatial metadata associated with the spatial audio content based, at least partially, on a configuration parameter indicative of a codec configuration used to encode, at least in part, spatial metadata; determine at least one prototype audio signal based, at least partially, on the spatial audio content and a configuration of at least one output device; and determine one or more spatial audio signals based, at least partially, on the at least one prototype audio signal and the decoded spatial metadata; and provide, to the at least one output device, the one or more spatial audio signals.
-
公开(公告)号:US20220366918A1
公开(公告)日:2022-11-17
申请号:US17642500
申请日:2020-09-09
Applicant: Nokia Technologies Oy
Inventor: Jussi LEPPÄNEN , Tapani PIHLAJAKUJA , Kari JARVINEN , Adriana VASILACHE
IPC: G10L19/008 , G10L19/02 , G10L19/032 , H04S3/00
Abstract: A method comprising: obtaining a first audio direction parameter value for each sub-band of a sub-frame of a frame of an audio signal; obtaining a second audio direction parameter value for the sub-frame of the frame of the audio signal for one or more audio objects associated with said audio signal; and determining a bit-efficient encoding for each first audio direction parameter value of the sub-frame based on a similarity between the first audio direction parameter value for each sub-band and the second audio direction parameter values for the one or more audio objects.
-
公开(公告)号:US20210219084A1
公开(公告)日:2021-07-15
申请号:US17058742
申请日:2019-05-29
Applicant: Nokia Technologies Oy
Inventor: Mikko-Ville LAITINEN , Lasse LAAKSONEN , Juha VILKAMO , Tapani PIHLAJAKUJA
IPC: H04S3/02 , H04S7/00 , H04R5/02 , G10L19/008
Abstract: Apparatus including circuitry configured for: determining, for two or more speaker channel audio signals, at least one spatial audio parameter for providing spatial audio reproduction; determining between the two or more speaker channel audio signals at least one audio signal relationship parameter, the at least one audio signal relationship parameter being associated with at least one coherence parameter, in such a way that the at least one coherence parameter provides at least one interchannel coherence information for at least two frequency bands, to reproduce the two or more speaker channel audio signals based on the at least one spatial audio parameter and the at least one audio signal relationship parameter; and transmitting using at least one determined value.
-
公开(公告)号:US20240029745A1
公开(公告)日:2024-01-25
申请号:US18245789
申请日:2021-08-25
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Tapani PIHLAJAKUJA , Mikko-Ville LAITINEN
IPC: G10L19/008 , G10L19/025 , G10L19/032
CPC classification number: G10L19/008 , G10L19/025 , G10L19/032
Abstract: An apparatus comprising means configured to: obtain at least one audio signal; obtain, for the at least one audio signal, spatial audio signal parameter values, the spatial audio signal parameters values distributed within a time-frequency domain (106); determine a merge metric to control a merging of the spatial audio signal parameter values over the time-frequency domain (201); and merge (203), based on the merge metric (202), the spatial audio signal parameter values to a smaller number of spatial audio signal parameter values overtime and/or frequency within the time-frequency domain.
-
公开(公告)号:US20230178085A1
公开(公告)日:2023-06-08
申请号:US17998992
申请日:2021-04-15
Applicant: Nokia Technologies Oy
Inventor: Tapani PIHLAJAKUJA , Mikko-Ville LAITINEN , Lasse Juhani LAAKSONEN , Adriana VASILACHE , Anssi RÄMÖ
IPC: G10L19/008 , G10L25/21 , H04S7/00 , G10L19/02 , G10L25/18
CPC classification number: G10L19/008 , G10L25/21 , H04S7/30 , G10L19/0204 , G10L25/18 , H04S2420/07 , H04S2420/03 , H04S2400/11 , H04S3/008
Abstract: There is inter alia disclosed an apparatus for spatial audio encoding comprising: means for analysing a plurality of spatial audio parameter sets associated with a frame of one or more audio signals, wherein the plurality of spatial audio parameter sets are associated with a plurality of subframes, a plurality of frequency sub bands and a plurality of sound source directions for the frame of the one or more audio signals; and means for determining from the analysis of the plurality of spatial audio parameter sets at least one spatial audio parameter set for subframes of the frame of the one or more audio signals.
-
公开(公告)号:US20210271315A1
公开(公告)日:2021-09-02
申请号:US17258829
申请日:2019-07-05
Applicant: Nokia Technologies Oy
Inventor: Kari Juhani JARVINEN , Jussi LEPPANEN , Tapani PIHLAJAKUJA , Adriana VASILACHE
IPC: G06F3/01 , G10L19/008 , H04S7/00 , G06F3/16 , G06T19/00
Abstract: An apparatus including circuitry configured for: obtaining media content, wherein the media content includes at least one object data; obtaining priority content information, the priority content information including a priority identification identifying and classifying the at least one object; rendering the at least one object based on the priority content information.
-
公开(公告)号:US20210160642A1
公开(公告)日:2021-05-27
申请号:US16613467
申请日:2018-05-08
Applicant: Nokia Technologies Oy
Inventor: Antti ERONEN , Jussi LEPPANEN , Tapani PIHLAJAKUJA , Arto LEHTINIEMI
IPC: H04S7/00 , G10L21/0216 , G10L19/008
Abstract: According to an example embodiment, a technique for spatial audio processing on basis of two or more input audio signals that represent an audio scene and at least one further input audio signal that represents at least part of the audio scene is provided, the technique including identifying a portion of interest (POI) in the audio scene; processing the two or more input audio signals into a spatial audio signal where the POI in the audio scene is suppressed; generating, on basis of the at least one further input audio signal, a complementary audio signal that represents the POI in the audio scene; and combining the complementary audio signal with the spatial audio signal to create a reconstructed spatial audio signal.
-
公开(公告)号:US20240355341A1
公开(公告)日:2024-10-24
申请号:US18574918
申请日:2022-06-16
Applicant: Nokia Technologies Oy
Inventor: Mikko-Ville LAITINEN , Tapani PIHLAJAKUJA
IPC: G10L19/008
CPC classification number: G10L19/008
Abstract: An apparatus for spatial audio encoding including circuitry configured to: obtain a first spatial audio stream of a first spatial audio format configured to be encoded with a low bitrate, wherein the first spatial audio stream includes an audio signal and a first metadata; obtain a second and different spatial audio stream of a second spatial audio format, wherein the second spatial audio stream includes a second audio signal and a second metadata; convert the second spatial audio format into the first spatial audio format to encode a converted second spatial audio stream with the low bitrate, wherein the converted spatial audio stream represents spatial audio properties of the second spatial audio stream; combine the first spatial audio stream and the converted second spatial audio stream to generate a combined spatial audio stream; and encode the combined spatial audio stream.
-
公开(公告)号:US20230335141A1
公开(公告)日:2023-10-19
申请号:US17788155
申请日:2020-12-11
Applicant: Nokia Technologies Oy
Inventor: Tapani PIHLAJAKUJA , Adriana VASILACHE , Mikko-Ville LAITINEN , Anssi RÄMÖ , Lasse Juhani LAAKSONEN
IPC: G10L19/008 , G10L19/032
CPC classification number: G10L19/008 , G10L19/032
Abstract: An apparatus comprising means configured to: obtain at least one parameter value (106) associated with at least two time-frequency parts of at least one audio signal (104); obtain at least one similarity value based on the at least one parameter value (106) associated with the at least two time-frequency parts of at least one audio signal (104); determine at least one group of time-frequency parts from the at least two time-frequency parts of at least one audio signal (104), the at least one group of time-frequency parts based on the at least one similarity value; and generate for the at least one group of time-frequency parts at least one associated group parameter (204), the at least one group parameter (204) based on the at least one parameter value (106) associated with the time-frequency parts.
-
公开(公告)号:US20230197086A1
公开(公告)日:2023-06-22
申请号:US17786088
申请日:2020-11-13
Applicant: Nokia Technologies Oy
Inventor: Mikko-Ville LAITINEN , Lasse LAAKSONEN , Adriana VASILACHE , Tapani PIHLAJAKUJA , Anssi RÄMÖ
IPC: G10L19/008 , H04S7/00
CPC classification number: G10L19/008 , H04S7/302 , H04S2420/03
Abstract: There is inter alia disclosed an apparatus for spatial audio encoding comprising: means for determining at least two of a type of spatial audio parameter for one or more audio signals, wherein a first of the type of spatial audio parameter is associated with a first group of samples in a domain of the one or more audio signals and a second of the type of spatial audio parameter is associated with a second group of samples in the domain of the one or more audio signals; and means for merging the first of the type of spatial audio parameter and the second of the type of spatial audio parameter into a merged spatial audio parameter.
-
-
-
-
-
-
-
-
-