Abstract:
An apparatus comprising: a perception sorter configured to perceptually order at least two object orientated audio signal channels; and a selective channel processor configured to process at least one of the at least two object orientated audio signal channels based on the order of the at least two object orientated audio signal channels.
Abstract:
An apparatus comprising means configured to: obtain a multichannel audio signal; obtain direction parameter values associated with at least two time-frequency parts of the multichannel audio signal (301), the direction parameter values associated with at least two time-frequency parts comprising an elevation element and an azimuth element associated with at least two time-frequency parts; and compand encode the obtained direction parameter values (305), the means configured to compand encode the obtained direction parameter values is further configured to: quantize the elevation element; determine a companding function based on the quantized elevation element and/or multichannel audio signal format; generate a companded azimuth element based on the companding function applied to the azimuth element; and quantize the companded azimuth element.
Abstract:
There is inter alia disclosed an apparatus for spatial audio encoding comprising: means for analysing a plurality of spatial audio parameter sets associated with a frame of one or more audio signals, wherein the plurality of spatial audio parameter sets are associated with a plurality of subframes, a plurality of frequency sub bands and a plurality of sound source directions for the frame of the one or more audio signals; and means for determining from the analysis of the plurality of spatial audio parameter sets at least one spatial audio parameter set for subframes of the frame of the one or more audio signals.
Abstract:
There is disclosed inter alia an apparatus for spatial audio signal encoding comprising means for receiving for each time frequency block of a sub band of an audio frame a spatial audio parameter comprising an azimuth and an elevation; determining a first distortion measure for the audio frame by determining a first distance measure for each time frequency block and summing the first distance measure for each time frequency block; determining a second distortion measure for the audio frame by determining a second distance measure for each time frequency block and summing the second distance measure for each time frequency block, and selecting either the first quantization scheme or the second quantization scheme for quantising the elevation and the azimuth for all time frequency blocks of the sub band of the audio frame, wherein the selecting is dependent on the first and second distortion measures.
Abstract:
An apparatus including circuitry configured for: obtaining media content, wherein the media content includes at least one object data; obtaining priority content information, the priority content information including a priority identification identifying and classifying the at least one object; rendering the at least one object based on the priority content information.
Abstract:
An apparatus configured to: determine a viewing angle associated with at least one apparatus camera; determine from at least two audio signals at least one audio source orientation relative to an apparatus; and generate at least one spatial filter including at least a first orientation range associated with the viewing angle and a second orientation range relative to the apparatus.
Abstract:
It is inter alia disclosed a method comprising: determining an indication of similarity between a first audio frame of a multiple channel input audio signal and a second audio frame of the multiple channel input audio signal; and determining a coding mode for a multiple channel audio spatial encoder dependent on each of: data indicating a coding mode of a mono audio encoder for the first audio frame of the multiple channel input audio signal; a coding mode of the multichannel spatial audio encoder for the first audio frame of the multiple channel input audio signal; and the indication of similarity.
Abstract:
An approach is provided for efficiently capturing, processing, presenting, and/or associating audio objects with content items and geo-locations. A processing platform may determine a viewpoint of a viewer of at least one content item associated with a geo-location. Further, the processing platform and/or a content provider may determine at least one audio object associated with the at least one content item, the geo-location, or a combination thereof. Furthermore, the processing platform may process the at least one audio object for rendering one or more elements of the at least one audio object based, at least in part, on the viewpoint.
Abstract:
An apparatus for decoding a spatial audio signal direction index to a direction value, the direction index representing a point in a spherical grid generated by covering a sphere with smaller spheres, wherein the centres of the smaller spheres define points of the spherical grid the points arranged substantially equidistant from each other on circles of constant elevation, the apparatus comprising means for: obtaining a spatial audio signal direction index value (306); estimating, by application of a defined polynomial comprising the spatial audio signal direction index value, a grid circle index value (502); determining from the grid circle index value a low direction index value (505) and a high direction index value (507); and determining an elevation index value and an azimuth index value based on the grid circle index value, the low direction index value, the high direction index value and the spatial audio signal direction index value (509).
Abstract:
There is inter alia disclosed an apparatus for spatial audio encoding configured to determining an audio scene separation metric between an input audio signal and a further input audio signal. and using the audio scene separation metric for quantizing of at least one spatial audio parameter of the input audio signal.