SPATIAL AUDIO SIGNAL MANIPULATION
    1.
    发明公开

    公开(公告)号:US20240305945A1

    公开(公告)日:2024-09-12

    申请号:US18607347

    申请日:2024-03-15

    Abstract: Described herein is a method (30) of rendering an audio signal (17) for playback in an audio environment (27) defined by a target loudspeaker system (23), the audio signal (17) including audio data relating to an audio object and associated position data indicative of an object position. Method (30) includes the initial step (31) of receiving the audio signal (17). At step (32) loudspeaker layout data for the target loudspeaker system (23) is received. At step (33) control data is received that is indicative of a position modification to be applied to the audio object in the audio environment (27). At step (38) in response to the position data, loudspeaker layout data and control data, rendering modification data is generated. Finally, at step (39) the audio signal (17) is rendered with the rendering modification data to output the audio signal (17) with the audio object at a modified object position that is between loudspeakers within the audio environment (27).

    ACOUSTIC ENVIRONMENT SIMULATION
    2.
    发明公开

    公开(公告)号:US20240038248A1

    公开(公告)日:2024-02-01

    申请号:US18366385

    申请日:2023-08-07

    CPC classification number: G10L19/008 G10L19/012 G10L19/00 G10L19/0212

    Abstract: Encoding/decoding an audio signal having one or more audio components, wherein each audio component is associated with a spatial location. A first audio signal presentation (z) of the audio components, a first set of transform parameters (w(f)), and signal level data (β2) are encoded and transmitted to the decoder. The decoder uses the first set of transform parameters (w(f)) to form a reconstructed simulation input signal intended for an acoustic environment simulation, and applies a signal level modification (α) to the reconstructed simulation input signal. The signal level modification is based on the signal level data (β2) and data (p2) related to the acoustic environment simulation. The attenuated reconstructed simulation input signal is then processed in an acoustic environment simulator. With this process, the decoder does not need to determine the signal level of the simulation input signal, thereby reducing processing load.

    SPATIAL AUDIO SIGNAL MANIPULATION

    公开(公告)号:US20220272479A1

    公开(公告)日:2022-08-25

    申请号:US17694506

    申请日:2022-03-14

    Abstract: Described herein is a method (30) of rendering an audio signal (17) for playback in an audio environment (27) defined by a target loudspeaker system (23), the audio signal (17) including audio data relating to an audio object and associated position data indicative of an object position. Method (30) includes the initial step (31) of receiving the audio signal (17). At step (32) loudspeaker layout data for the target loudspeaker system (23) is received. At step (33) control data is received that is indicative of a position modification to be applied to the audio object in the audio environment (27). At step (38) in response to the position data, loudspeaker layout data and control data, rendering modification data is generated. Finally, at step (39) the audio signal (17) is rendered with the rendering modification data to output the audio signal (17) with the audio object at a modified object position that is between loudspeakers within the audio environment (27).

    SPATIAL AUDIO SIGNAL MANIPULATION

    公开(公告)号:US20210014628A1

    公开(公告)日:2021-01-14

    申请号:US16938561

    申请日:2020-07-24

    Abstract: Described herein is a method (30) of rendering an audio signal (17) for playback in an audio environment (27) defined by a target loudspeaker system (23), the audio signal (17) including audio data relating to an audio object and associated position data indicative of an object position. Method (30) includes the initial step (31) of receiving the audio signal (17). At step (32) loudspeaker layout data for the target loudspeaker system (23) is received. At step (33) control data is received that is indicative of a position modification to be applied to the audio object in the audio environment (27). At step (38) in response to the position data, loudspeaker layout data and control data, rendering modification data is generated. Finally, at step (39) the audio signal (17) is rendered with the rendering modification data to output the audio signal (17) with the audio object at a modified object position that is between loudspeakers within the audio environment (27).

    METHODS AND SYSTEMS FOR DESIGNING AND APPLYING NUMERICALLY OPTIMIZED BINAURAL ROOM IMPULSE RESPONSES

    公开(公告)号:US20200162835A1

    公开(公告)日:2020-05-21

    申请号:US16749494

    申请日:2020-01-22

    Abstract: Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.

    SPATIAL AUDIO SIGNAL MANIPULATION
    6.
    发明申请

    公开(公告)号:US20190230461A1

    公开(公告)日:2019-07-25

    申请号:US16374520

    申请日:2019-04-03

    Abstract: Described herein is a method (30) of rendering an audio signal (17) for playback in an audio environment (27) defined by a target loudspeaker system (23), the audio signal (17) including audio data relating to an audio object and associated position data indicative of an object position. Method (30) includes the initial step (31) of receiving the audio signal (17). At step (32) loudspeaker layout data for the target loudspeaker system (23) is received. At step (33) control data is received that is indicative of a position modification to be applied to the audio object in the audio environment (27). At step (38) in response to the position data, loudspeaker layout data and control data, rendering modification data is generated. Finally, at step (39) the audio signal (17) is rendered with the rendering modification data to output the audio signal (17) with the audio object at a modified object position that is between loudspeakers within the audio environment (27).

    AUDIO OBJECT CLUSTERING BY UTILIZING TEMPORAL VARIATIONS OF AUDIO OBJECTS
    7.
    发明申请
    AUDIO OBJECT CLUSTERING BY UTILIZING TEMPORAL VARIATIONS OF AUDIO OBJECTS 有权
    使用音频对象的时间变化的音频对象聚类

    公开(公告)号:US20160358618A1

    公开(公告)日:2016-12-08

    申请号:US15117647

    申请日:2015-02-23

    Abstract: Embodiments of the present invention relate to audio object clustering by utilizing temporal variation of audio objects. There is provided a method of estimating temporal variation of an audio object for use in audio object clustering. The method comprises obtaining at least one segment of an audio track associated with the audio object, the at least one segment containing the audio object; estimating variation of the audio object over a time duration of the at least one segment based on at least one property of the audio object and adjusting, at least partially based on the estimated variation of the audio object, a contribution of the audio object to the determination of a centroid in the audio object clustering. Corresponding system and computer program product are disclosed.

    Abstract translation: 本发明的实施例涉及通过利用音频对象的时间变化的音频对象聚类。 提供了一种估计用于音频对象聚类的音频对象的时间变化的方法。 所述方法包括获得与所述音频对象相关联的音轨的至少一个段,所述至少一个段包含所述音频对象; 基于所述音频对象的至少一个属性来估计所述音频对象在所述至少一个段的持续时间上的变化,并且至少部分地基于所估计的所述音频对象的变化来调整所述音频对象对所述音频对象的贡献 确定音频对象聚类中的质心。 披露了相应的系统和计算机程序产品。

    DETERMINING DIALOG QUALITY METRICS OF A MIXED AUDIO SIGNAL

    公开(公告)号:US20240071411A1

    公开(公告)日:2024-02-29

    申请号:US18259848

    申请日:2022-01-04

    CPC classification number: G10L25/60 G10L21/0272

    Abstract: Disclosed is a method for determining one or more dialog quality metrics of a mixed audio signal comprising a dialog component and a noise component, the method comprising separating an estimated dialog component from the mixed audio signal by means of a dialog separator using a dialog separating model determined by training the dialog separator based on the one or more quality metrics; providing the estimated dialog component from the dialog separator to a quality metrics estimator; and determining the one or more quality metrics by means of the quality metrics estimator based on the mixed signal and the estimated dialog component. Further disclosed is a method for training a dialog separator, a system comprising circuitry configured to perform the method, and a non-transitory computer-readable storage medium.

Patent Agency Ranking