-
公开(公告)号:US20240163529A1
公开(公告)日:2024-05-16
申请号:US18281535
申请日:2022-03-24
发明人: Dirk Jeroen BREEBAART , Brett G. Crockett , Ryan Michael Friedrich , Jordan Robert Glasgow , Derek Christian Jones , Eric Whelan Yeargan
IPC分类号: H04N21/845 , H04N21/854
CPC分类号: H04N21/8456 , H04N21/854
摘要: The present disclosure relates to a method and audio processing system for performing dynamic range adjustment of spatial audio objects. The method comprises obtaining (step S1) a plurality of spatial audio objects (10), obtaining (step S2) at least one rendered audio presentation of the spatial audio objects (10) and determining (step S3) signal level data associated with each presentation audio channel in said set of presentation audio channels. The method further comprises obtaining (step S31) a threshold value and, for each time segment, selecting (step S4) a selected presentation audio channel which is associated with a highest or a lowest signal level, determining (step S5) a gain based on the threshold value and the representation of the signal level of the selected audio channel, and applying (step S6) the gain of each time segment to corresponding time segments of the spatial audio objects.
-
42.
公开(公告)号:US20230262409A1
公开(公告)日:2023-08-17
申请号:US18106261
申请日:2023-02-06
IPC分类号: H04S7/00
CPC分类号: H04S7/304 , H04S7/306 , H04S2420/01 , H04S2400/03 , H04S2420/07
摘要: Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.
-
公开(公告)号:US20220399027A1
公开(公告)日:2022-12-15
申请号:US17887429
申请日:2022-08-13
IPC分类号: G10L19/02 , H04S7/00 , G10L19/008
摘要: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.
-
44.
公开(公告)号:US20210051435A1
公开(公告)日:2021-02-18
申请号:US17012076
申请日:2020-09-04
发明人: Kuan-Chieh YEN , Dirk Jeroen BREEBAART , Grant A. DAVIDSON , Rhonda WILSON , David M. COOPER , Zhiwei SHUANG
IPC分类号: H04S7/00 , G10L19/008 , H04S3/00
摘要: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.
-
45.
公开(公告)号:US20190373397A1
公开(公告)日:2019-12-05
申请号:US16541079
申请日:2019-08-14
发明人: Kuan-Chieh YEN , Dirk Jeroen BREEBAART , Grant A. DAVIDSON , Rhonda WILSON , David M. Cooper , Zhiwei SHUANG
IPC分类号: H04S7/00 , G10L19/008
摘要: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.
-
公开(公告)号:US20190035410A1
公开(公告)日:2019-01-31
申请号:US16073132
申请日:2017-01-23
IPC分类号: G10L19/008 , G10L19/02
摘要: Encoding/decoding an audio signal having one or more audio components, wherein each audio component is associated with a spatial location. A first audio signal presentation (z) of the audio components, a first set of transform parameters (w(f)), and signal level data (β2) are encoded and transmitted to the decoder. The decoder uses the first set of transform parameters (w(f)) to form a reconstructed simulation input signal intended for an acoustic environment simulation, and applies a signal level modification (α) to the reconstructed simulation input signal. The signal level modification is based on the signal level data (β2) and data (p2) related to the acoustic environment simulation. The attenuated reconstructed simulation input signal is then processed in an acoustic environment simulator. With this process, the decoder does not need to determine the signal level of the simulation input signal, thereby reducing processing load.
-
公开(公告)号:US20160210972A1
公开(公告)日:2016-07-21
申请号:US14916029
申请日:2014-09-09
IPC分类号: G10L19/018 , G10L19/008
CPC分类号: G10L19/018 , G10L19/008
摘要: A method for selecting a subset of channels of (e.g., determined from) at least a segment of a multichannel audio program for watermarking and watermarking the selected subset of channels, and a system or device configured to implement any embodiment of the method, or including a buffer which stores at least one frame or other segment of a multichannel audio program generated by any embodiment of the method or steps thereof. Some embodiments generate watermarking metadata during program creation including by analyzing audio content to be included in segments of a multichannel program, determining at least one watermark suitability value for each channel of each of the segments, and including the watermark suitability values (or watermarking data determined therefrom) as metadata in the program. Some embodiments are implemented by a playback system which determines the selected subset of channels to be watermarked.
摘要翻译: 一种用于选择多通道音频节目的至少一个段的信道子集的子集的方法,用于对选定的信道子集进行水印和水印,以及被配置为实现该方法的任何实施例的系统或设备,或包括 存储由该方法或其任何实施方式生成的多声道音频节目的至少一个帧或其他片段的缓冲器。 一些实施例在程序创建期间产生水印元数据,包括通过分析要包括在多通道程序的段中的音频内容,为每个段的每个通道确定至少一个水印适用性值,并且包括水印适用性值(或确定的水印数据 作为程序中的元数据。 一些实施例由确定要被加水印的所选择的信道子集的重放系统来实现。
-
公开(公告)号:US20160150343A1
公开(公告)日:2016-05-26
申请号:US14900117
申请日:2014-06-17
发明人: Jun WANG , Lie LU , Mingqing HU , Dirk Jeroen BREEBAART , Nicolas R. TSINGOS
IPC分类号: H04S7/00 , G10L19/008 , G10L19/02
CPC分类号: H04S7/30 , G10L19/008 , G10L19/0204 , G10L19/20 , G10L21/0272 , H04S3/002 , H04S5/005 , H04S2400/11 , H04S2400/13 , H04S2400/15 , H04S2420/07
摘要: Embodiments of the present invention relate to adaptive audio content generation. Specifically, a method for generating adaptive audio content is provided. The method comprises extracting at least one audio object from channel-based source audio content, and generating the adaptive audio content at least partially based on the at least one audio object. Corresponding system and computer program product are also disclosed.
摘要翻译: 本发明的实施例涉及自适应音频内容生成。 具体地,提供了一种用于产生自适应音频内容的方法。 所述方法包括从基于频道的源音频内容中提取至少一个音频对象,以及至少部分地基于所述至少一个音频对象生成所述自适应音频内容。 还公开了相应的系统和计算机程序产品。
-
公开(公告)号:US20160125887A1
公开(公告)日:2016-05-05
申请号:US14893485
申请日:2014-05-23
发明人: Heiko PURNHAGEN , Kristofer KJOERLING , Toni HIRVONEN , Lars VILLEMOES , Dirk Jeroen BREEBAART , Leif Jonas SAMUELSSON
IPC分类号: G10L19/008 , G10L19/018 , H04S3/00
CPC分类号: G10L19/008 , G10L19/018 , H04S3/008 , H04S2400/03 , H04S2400/11
摘要: There is provided encoding and decoding methods for encoding and decoding of object based audio. An exemplary encoding method includes inter alia calculating M downmix signals by forming combinations of N audio objects, wherein M≦N, and calculating parameters which allow reconstruction of a set of audio objects formed on basis of the N audio objects from the M downmix signals. The calculation of the M downmix signals is made according to a criterion which is independent of any loudspeaker configuration.
-
公开(公告)号:US20220189493A1
公开(公告)日:2022-06-16
申请号:US17687956
申请日:2022-03-07
IPC分类号: G10L19/008 , H04S3/00
摘要: There is provided encoding and decoding methods for encoding and decoding of object based audio. An exemplary encoding method includes inter alia calculating M downmix signals by forming combinations of N audio objects, wherein M≤N, and calculating parameters which allow reconstruction of a set of audio objects formed on basis of the N audio objects from the M downmix signals. The calculation of the M downmix signals is made according to a criterion which is independent of any loudspeaker configuration.
-
-
-
-
-
-
-
-
-