Apparatus for determining spatial positions of multiple audio sources

    公开(公告)号:US11921198B2

    公开(公告)日:2024-03-05

    申请号:US17582837

    申请日:2022-01-24

    CPC classification number: G01S15/46 H04R1/406 H04R3/005 G01S2015/465

    Abstract: An apparatus determines a spatial position of an audio source in multi moving audio sources scenarios. The apparatus receives audio signal versions as local sound waves. The apparatus determines first and second probabilities for a direction of arrival of the audio signal version based on the audio signal versions received within a first time interval; determines third and fourth probabilities for the direction of arrival of the audio signal version based on the audio signal versions received within a second time interval; determines a first probability difference between the first and third probabilities; determines a second probability difference between the second and fourth probabilities; combines the third probability and the first probability difference to obtain an updated third probability; combines the fourth probability with the second probability difference to obtain an updated fourth probability; and determines the spatial position based on the updated third and fourth probabilities.

    Audio processing apparatus and method for denoising a multi-channel audio signal

    公开(公告)号:US11889292B2

    公开(公告)日:2024-01-30

    申请号:US17581527

    申请日:2022-01-21

    Abstract: The disclosure relates to an audio processing apparatus, comprising: a plurality of audio sensors, each audio sensor configured to receive a respective plurality of audio frames of an audio signal from an audio source, wherein the respective plurality of audio frames defines an audio channel of the audio signal; and a processing circuitry configured to: determine a respective feature set having at least one feature for each audio frame of each of the plurality of audio frames, wherein the plurality of features define a three-dimensional feature array; process the three-dimensional feature array using a neural network, wherein the neural network comprises a self-attention layer configured to process a plurality of two-dimensional sub-arrays of the three-dimensional feature array; and generate an output signal on the basis of the plurality of processed two-dimensional sub-arrays. Moreover, the disclosure relates to a corresponding audio processing method.

    Device and method for estimating direction of arrival

    公开(公告)号:US11567162B2

    公开(公告)日:2023-01-31

    申请号:US16664373

    申请日:2019-10-25

    Abstract: A device for estimating Direction of Arrival (DOA) of sound from Q≥1 sound sources is provided. The device is configured to obtain a phase difference matrix, which includes measured phase difference values, each of the measured phase difference values being a measured value of a phase difference between two microphone units for a frequency bin in a range of frequencies of the sound. The device is further configured to generate a replicated phase difference matrix by replicating the measured phase difference values to other potential sinusoidal periods, calculate a DOA value for each phase difference value in the replicated phase difference matrix, and determine, as Q DOA results, the Q most prominent peak values in a histogram generated based on the calculated DOA values.

Patent Agency Ranking