CONTEXT AWARE SOUNDSCAPE CONTROL
    1.
    发明公开

    公开(公告)号:US20240155289A1

    公开(公告)日:2024-05-09

    申请号:US18548791

    申请日:2022-04-28

    摘要: Embodiments are disclosed for context aware soundscape control. In an embodiment, an audio processing method comprises: capturing, using a first set of microphones on a mobile device, a first audio signal from an audio scene; capturing, using a second set of microphones on a pair of earbuds, a second audio signal from the audio scene; capturing, using a camera on the mobile device, a video signal from a video scene; generating, with at least one processor, a processed audio signal from the first audio signal and the second audio signal, the processed audio signal generated with adaptive soundscape control based on context information; and combining, with the at least one processor, the processed audio signal and the captured video signal as multimedia output.

    METHOD AND APPARATUS FOR SPEECH SOURCE SEPARATION BASED ON A CONVOLUTIONAL NEURAL NETWORK

    公开(公告)号:US20220223144A1

    公开(公告)日:2022-07-14

    申请号:US17611121

    申请日:2020-05-13

    摘要: Described herein is a method for Convolutional Neural Network (CNN) based speech source separation, wherein the method includes the steps of: (a) providing multiple frames of a time-frequency transform of an original noisy speech signal; (b) inputting the time-frequency transform of said multiple frames into an aggregated multi-scale CNN having a plurality of parallel convolution paths; (c) extracting and outputting, by each parallel convolution path, features from the input time-frequency transform of said multiple frames; (d) obtaining an aggregated output of the outputs of the parallel convolution paths; and (e) generating an output mask for extracting speech from the original noisy speech signal based on the aggregated output. Described herein are further an apparatus for CNN based speech source separation as well as a respective computer program product comprising a computer-readable storage medium with instructions adapted to carry out said method when executed by a device having processing capability.

    Generating Binaural Audio in Response to Multi-Channel Audio Using at Least One Feedback Delay Network

    公开(公告)号:US20190373397A1

    公开(公告)日:2019-12-05

    申请号:US16541079

    申请日:2019-08-14

    IPC分类号: H04S7/00 G10L19/008

    摘要: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.

    STEERING OF BINAURALIZATION OF AUDIO

    公开(公告)号:US20220279300A1

    公开(公告)日:2022-09-01

    申请号:US17637446

    申请日:2020-08-19

    IPC分类号: H04S7/00

    摘要: A method for steering binauralization of audio is provided. The method comprises steps of: receiving (410) an audio input signal, calculating (430) a confidence value indicating a likelihood that a current audio frame of the audio input signal comprises binauralized audio; determining (450) a state signal based on the confidence value; determining (460) a steering signal, based on the first confidence value, the state signal and an energy value of the audio frame; and generating (470) an audio output signal with steered binauralization by processing the audio input signal according to the steering signal.

    Generating Binaural Audio in Response to Multi-Channel Audio Using at Least One Feedback Delay Network

    公开(公告)号:US20200245094A1

    公开(公告)日:2020-07-30

    申请号:US16777599

    申请日:2020-01-30

    IPC分类号: H04S7/00 G10L19/008 H04S3/00

    摘要: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.

    AUDIO SIGNAL PROCESSING BASED ON REMOTE USER CONTROL

    公开(公告)号:US20180167515A1

    公开(公告)日:2018-06-14

    申请号:US15573049

    申请日:2016-05-26

    摘要: Example embodiments disclosed herein relate to audio signal processing based on remote user control. A method of processing an audio signal in an audio sender device is disclosed. The method includes receiving, at a current device, a control parameter from a remote device, the control parameter being generated based on a user input of the remote device and specifying a user preference for an audio signal to be transmitted to the remote device. The method also includes processing the audio signal based on the received control parameter and transmitting the processed audio signal to the remote device. Corresponding computer program product of processing an audio signal and corresponding device are also disclosed. Corresponding method in an audio receiver device and computer program product of processing an audio signal as well as corresponding device are also disclosed.