METHOD FOR NEURAL NETWORK TRAINING WITH MULTIPLE SUPERVISORS

    公开(公告)号:US20250045585A1

    公开(公告)日:2025-02-06

    申请号:US18716895

    申请日:2022-12-08

    Abstract: The present disclosure relates to a method for designing a processor (20) and a computer implemented neural network. The method comprises obtaining input data and corresponding ground truth target data and providing the input data to a processor (20) for outputting a first prediction of target data given the input data. The method further comprises providing the latent variables output by a processor module (21: 1, 21: 2, . . . 21: n−1) to a supervisor module (22: 1, 22: 2, 22: 3, . . . 22: n−1) which outputs a second prediction of target data based on latent variables and determining a first and second loss measure by comparing the predictions of target data with the ground truth target data. The method further comprises training the processor (20) and the supervisor module (22: 1, 22: 2, 22: 3, . . . 22: n−1) based on the first and second loss measure and adjusting the processor by at least one of removing, replacing and adding a processor module.

    PROCESSING OBJECT-BASED AUDIO SIGNALS
    3.
    发明申请

    公开(公告)号:US20200288260A1

    公开(公告)日:2020-09-10

    申请号:US16825776

    申请日:2020-03-20

    Abstract: An audio processing system and method which calculates, based on spatial metadata of the audio object, a panning coefficient for each of the audio objects in relation to each of a plurality of predefined channel coverage zones. Converts the audio signal into submixes in relation to the predefined channel coverage zones based on the calculated panning coefficients and the audio objects. Each of the submixes indicating a sum of components of the plurality of the audio objects in relation to one of the predefined channel coverage zones. Generating a submix gain by applying an audio processing to each of the submix and controls an object gain applied to each of the audio objects. The object gain being as a function of the panning coefficients for each of the audio objects and the submix gains in relation to each of the predefined channel coverage zones.

    Content-Adaptive Surround Sound Virtualization

    公开(公告)号:US20180324540A1

    公开(公告)日:2018-11-08

    申请号:US15772383

    申请日:2016-11-02

    Abstract: Example embodiments disclosed herein relate to content-adaptive surround sound virtualization. A method of virtualizing surround sound is disclosed. The method includes receiving a set of input audio signals, each of the input audio signals being indicative of sound from one of different sound sources, and determining a probability of the set of input audio signals belonging to a predefined audio content category. The method also includes determining a virtualization amount based on the determined probability, the virtualization amount indicating to which extent the set of input audio signals is virtualized as surround sound. The method further includes performing surround sound virtualization on two or more input audio signals in the set based on the determined virtualization amount and generating output audio signals based on the virtualized input audio signals and other input audio signals in the set. Corresponding system and computer program product for virtualizing surround sound are also disclosed.

    DECOMPOSING AUDIO SIGNALS
    6.
    发明申请

    公开(公告)号:US20170206907A1

    公开(公告)日:2017-07-20

    申请号:US15326378

    申请日:2015-07-14

    Inventor: Jun WANG Lie LU

    Abstract: Example embodiments disclosed herein relate to signal processing. A method for decomposing a plurality of audio signals from at least two different channels is disclosed. The method comprises obtaining a set of components that are weakly correlated, the set of components generated based on the plurality of audio signals. The method comprises extracting a feature from the set of components, and determining a set of gains associated with the set of components at least in part based on the extracted feature, each of the gains indicating a proportion of a diffuse part in the associated component. The method further comprises decomposing the plurality of audio signals by applying the set of gains to the set of components. Corresponding system and computer program product are also disclosed.

    SPATIAL ERROR METRICS OF AUDIO CONTENT
    9.
    发明申请
    SPATIAL ERROR METRICS OF AUDIO CONTENT 审中-公开
    音频内容的空间误差度量

    公开(公告)号:US20160337776A1

    公开(公告)日:2016-11-17

    申请号:US15110371

    申请日:2015-01-05

    Abstract: Audio objects that are present in input audio content in one or more frames are determined. Output clusters that are present in output audio content in the one or more frames are also determined. Here, the audio objects in the input audio content are converted to the output clusters in the output audio content. One or more spatial error metrics are computed based at least in part on positional metadata of the audio objects and positional metadata of the output clusters.

    Abstract translation: 确定存在于一个或多个帧中的输入音频内容中的音频对象。 还确定存在于一个或多个帧中的输出音频内容中的输出簇。 这里,输入音频内容中的音频对象被转换为输出音频内容中的输出群集。 至少部分地基于音频对象的位置元数据和输出簇的位置元数据来计算一个或多个空间误差度量。

    EQUALIZER CONTROLLER AND CONTROLLING METHOD
    10.
    发明申请
    EQUALIZER CONTROLLER AND CONTROLLING METHOD 有权
    均衡器控制器和控制方法

    公开(公告)号:US20160056787A1

    公开(公告)日:2016-02-25

    申请号:US14780485

    申请日:2014-03-17

    CPC classification number: H03G5/165 H03G5/005 H04R3/04

    Abstract: Equalizer controller and controlling method are disclosed. In one embodiment, an equalizer controller includes an audio classifier for identifying the audio type of an audio signal in real time; and an adjusting unit for adjusting an equalizer in a continuous manner based on the confidence value of the audio type as identified.

    Abstract translation: 公开了均衡器控制器和控制方法。 在一个实施例中,均衡器控制器包括用于实时地识别音频信号的音频类型的音频分类器; 以及调整单元,用于基于识别的音频类型的置信度值以连续的方式调整均衡器。

Patent Agency Ranking