SPATIAL AUDIO PARAMETER ENCODING AND ASSOCIATED DECODING

    公开(公告)号:US20230410823A1

    公开(公告)日:2023-12-21

    申请号:US18245028

    申请日:2021-08-18

    CPC classification number: G10L19/032 G10L19/008

    Abstract: An apparatus comprising means configured to: obtain a multichannel audio signal; obtain direction parameter values associated with at least two time-frequency parts of the multichannel audio signal (301), the direction parameter values associated with at least two time-frequency parts comprising an elevation element and an azimuth element associated with at least two time-frequency parts; and compand encode the obtained direction parameter values (305), the means configured to compand encode the obtained direction parameter values is further configured to: quantize the elevation element; determine a companding function based on the quantized elevation element and/or multichannel audio signal format; generate a companded azimuth element based on the companding function applied to the azimuth element; and quantize the companded azimuth element.

    SELECTION OF QUANTISATION SCHEMES FOR SPATIAL AUDIO PARAMETER ENCODING

    公开(公告)号:US20230129520A1

    公开(公告)日:2023-04-27

    申请号:US18146151

    申请日:2022-12-23

    Abstract: There is disclosed inter alia an apparatus for spatial audio signal encoding comprising means for receiving for each time frequency block of a sub band of an audio frame a spatial audio parameter comprising an azimuth and an elevation; determining a first distortion measure for the audio frame by determining a first distance measure for each time frequency block and summing the first distance measure for each time frequency block; determining a second distortion measure for the audio frame by determining a second distance measure for each time frequency block and summing the second distance measure for each time frequency block, and selecting either the first quantization scheme or the second quantization scheme for quantising the elevation and the azimuth for all time frequency blocks of the sub band of the audio frame, wherein the selecting is dependent on the first and second distortion measures.

    MULTIPLE CHANNEL AUDIO SIGNAL ENCODER MODE DETERMINER
    27.
    发明申请
    MULTIPLE CHANNEL AUDIO SIGNAL ENCODER MODE DETERMINER 审中-公开
    多通道音频信号编码器模式确定器

    公开(公告)号:US20160064004A1

    公开(公告)日:2016-03-03

    申请号:US14783487

    申请日:2013-04-15

    CPC classification number: G10L19/008 G10L19/22 G10L19/24

    Abstract: It is inter alia disclosed a method comprising: determining an indication of similarity between a first audio frame of a multiple channel input audio signal and a second audio frame of the multiple channel input audio signal; and determining a coding mode for a multiple channel audio spatial encoder dependent on each of: data indicating a coding mode of a mono audio encoder for the first audio frame of the multiple channel input audio signal; a coding mode of the multichannel spatial audio encoder for the first audio frame of the multiple channel input audio signal; and the indication of similarity.

    Abstract translation: 特别地,公开了一种方法,包括:确定多声道输入音频信号的第一音频帧与多声道输入音频信号的第二音频帧之间的相似度的指示; 以及根据以下各项确定多通道音频空间编码器的编码模式:指示多声道输入音频信号的第一音频帧的单声道音频编码器的编码模式的数据; 用于多声道输入音频信号的第一音频帧的多声道空间音频编码器的编码模式; 和相似性的指示。

    METHOD AND APPARATUS FOR ASSOCIATING AUDIO OBJECTS WITH CONTENT AND GEO-LOCATION
    28.
    发明申请
    METHOD AND APPARATUS FOR ASSOCIATING AUDIO OBJECTS WITH CONTENT AND GEO-LOCATION 有权
    用于与内容和地理位置相关的音频对象的方法和装置

    公开(公告)号:US20150316640A1

    公开(公告)日:2015-11-05

    申请号:US14797820

    申请日:2015-07-13

    Abstract: An approach is provided for efficiently capturing, processing, presenting, and/or associating audio objects with content items and geo-locations. A processing platform may determine a viewpoint of a viewer of at least one content item associated with a geo-location. Further, the processing platform and/or a content provider may determine at least one audio object associated with the at least one content item, the geo-location, or a combination thereof. Furthermore, the processing platform may process the at least one audio object for rendering one or more elements of the at least one audio object based, at least in part, on the viewpoint.

    Abstract translation: 提供了一种用于有效地捕获,处理,呈现和/或将音频对象与内容项目和地理位置相关联的方法。 处理平台可以确定与地理位置相关联的至少一个内容项目的观看者的观点。 此外,处理平台和/或内容提供商可以确定与至少一个内容项目,地理位置或其组合相关联的至少一个音频对象。 此外,处理平台可以处理至少一个音频对象,用于至少部分地基于视点来渲染至少一个音频对象的一个​​或多个元素。

    SPATIAL AUDIO PARAMETER DECODING
    29.
    发明申请

    公开(公告)号:US20250029620A1

    公开(公告)日:2025-01-23

    申请号:US18707301

    申请日:2022-09-23

    Abstract: An apparatus for decoding a spatial audio signal direction index to a direction value, the direction index representing a point in a spherical grid generated by covering a sphere with smaller spheres, wherein the centres of the smaller spheres define points of the spherical grid the points arranged substantially equidistant from each other on circles of constant elevation, the apparatus comprising means for: obtaining a spatial audio signal direction index value (306); estimating, by application of a defined polynomial comprising the spatial audio signal direction index value, a grid circle index value (502); determining from the grid circle index value a low direction index value (505) and a high direction index value (507); and determining an elevation index value and an azimuth index value based on the grid circle index value, the low direction index value, the high direction index value and the spatial audio signal direction index value (509).

Patent Agency Ranking