Generating Binaural Audio in Response to Multi-Channel Audio Using at Least One Feedback Delay Network
    22.
    发明申请
    Generating Binaural Audio in Response to Multi-Channel Audio Using at Least One Feedback Delay Network 有权
    生成双耳音频响应多声道音频使用至少一个反馈延迟网络

    公开(公告)号:US20160345116A1

    公开(公告)日:2016-11-24

    申请号:US15109541

    申请日:2014-12-18

    Abstract: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feed-back delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.

    Abstract translation: 在一些实施例中,用于响应多声道音频信号的信道产生双耳信号的虚拟化方法,其通过使用至少一个反馈延迟网络(FDN)向每个信道应用双耳房间脉冲响应(BRIR) 将通用的后期混响应用于频道的混合。 在一些实施例中,输入信号信道在第一处理路径中被处理以向每个信道应用用于信道的单信道BRIR的直接响应和早期反射部分,并且在第二处理路径中处理信道的混合,包括 至少一个FDN应用公共的后期混响。 通常,常见的后期混响模拟了至少一些单通道BRIR的后期混响部分的集体宏属性。 其他方面是被配置为执行该方法的任何实施例的耳机虚拟器。

    METHODS AND SYSTEMS FOR DESIGNING AND APPLYING NUMERICALLY OPTIMIZED BINAURAL ROOM IMPULSE RESPONSES
    23.
    发明申请
    METHODS AND SYSTEMS FOR DESIGNING AND APPLYING NUMERICALLY OPTIMIZED BINAURAL ROOM IMPULSE RESPONSES 审中-公开
    用于设计和应用数值优化的BINURAL ROOM IMPULSE反应的方法和系统

    公开(公告)号:US20160337779A1

    公开(公告)日:2016-11-17

    申请号:US15109557

    申请日:2014-12-23

    Abstract: Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.

    Abstract translation: 用于设计用于耳机虚拟器的双耳室脉冲响应(BRIR)的方法和系统以及响应于多声道音频信号的一组声道产生双耳信号的方法和系统,包括通过将BRIR应用于每个声道 从而产生经滤波的信号,并且组合滤波的信号以产生双耳信号,其中根据设计方法的实施例设计了每个BRIR。 其他方面是被配置为执行本发明方法的任何实施例的音频处理单元。 根据一些实施例,BRIR设计被形成为基于模拟模型(其产生候选BRIR)和至少一个目标函数(其评估每个候选BRIR)的数值优化问题,并且包括识别最佳候选者 BRIR由每个目标函数为候选BRIR确定的性能度量指示。

    SPATIAL ERROR METRICS OF AUDIO CONTENT
    24.
    发明申请
    SPATIAL ERROR METRICS OF AUDIO CONTENT 审中-公开
    音频内容的空间误差度量

    公开(公告)号:US20160337776A1

    公开(公告)日:2016-11-17

    申请号:US15110371

    申请日:2015-01-05

    Abstract: Audio objects that are present in input audio content in one or more frames are determined. Output clusters that are present in output audio content in the one or more frames are also determined. Here, the audio objects in the input audio content are converted to the output clusters in the output audio content. One or more spatial error metrics are computed based at least in part on positional metadata of the audio objects and positional metadata of the output clusters.

    Abstract translation: 确定存在于一个或多个帧中的输入音频内容中的音频对象。 还确定存在于一个或多个帧中的输出音频内容中的输出簇。 这里,输入音频内容中的音频对象被转换为输出音频内容中的输出群集。 至少部分地基于音频对象的位置元数据和输出簇的位置元数据来计算一个或多个空间误差度量。

    DOLBY ATMOS MASTER COMPRESSOR/LIMITER
    27.
    发明公开

    公开(公告)号:US20240163529A1

    公开(公告)日:2024-05-16

    申请号:US18281535

    申请日:2022-03-24

    CPC classification number: H04N21/8456 H04N21/854

    Abstract: The present disclosure relates to a method and audio processing system for performing dynamic range adjustment of spatial audio objects. The method comprises obtaining (step S1) a plurality of spatial audio objects (10), obtaining (step S2) at least one rendered audio presentation of the spatial audio objects (10) and determining (step S3) signal level data associated with each presentation audio channel in said set of presentation audio channels. The method further comprises obtaining (step S31) a threshold value and, for each time segment, selecting (step S4) a selected presentation audio channel which is associated with a highest or a lowest signal level, determining (step S5) a gain based on the threshold value and the representation of the signal level of the selected audio channel, and applying (step S6) the gain of each time segment to corresponding time segments of the spatial audio objects.

    METHODS AND SYSTEMS FOR DESIGNING AND APPLYING NUMERICALLY OPTIMIZED BINAURAL ROOM IMPULSE RESPONSES

    公开(公告)号:US20230262409A1

    公开(公告)日:2023-08-17

    申请号:US18106261

    申请日:2023-02-06

    Abstract: Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.

    AUDIO DECODER AND DECODING METHOD
    29.
    发明申请

    公开(公告)号:US20220399027A1

    公开(公告)日:2022-12-15

    申请号:US17887429

    申请日:2022-08-13

    Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.

    PRESENTATION INDEPENDENT MASTERING OF AUDIO CONTENT

    公开(公告)号:US20220295207A1

    公开(公告)日:2022-09-15

    申请号:US17625720

    申请日:2020-07-07

    Abstract: A method for generating mastered audio content, the method comprising obtaining an input audio content comprising a number, M1, of audio signals, obtaining rendered presentation of the input audio content, the rendered presentation comprising a number, M2, of audio signals, obtaining a mastered presentation generated by mastering the rendered presentation, comparing the mastered presentation with the rendered presentation to determine one or more indications of differences between the mastered presentation and the rendered presentation, modifying one or more of the audio signals of the input audio content based on the indications of differences to generate the mastered audio content. With this approach, conventional, typically stereo, channel-based mastering tools can be used to provide a mastered version of any input audio content, including object-based immersive audio content.

Patent Agency Ranking