USER VOICE ACTIVITY DETECTION USING DYNAMIC CLASSIFIER

    公开(公告)号:WO2022076963A1

    公开(公告)日:2022-04-14

    申请号:PCT/US2021/071503

    申请日:2021-09-17

    Abstract: A device includes a memory configured to store instructions and one or more processors configured to execute the instructions. The one or more processors are configured to execute the instructions to receive audio data including first audio data corresponding to a first output of a first microphone and second audio data corresponding to a second output of a second microphone. The one or more processors are also configured to execute the instructions to provide the audio data to a dynamic classifier. The dynamic classifier is configured to generate a classification output corresponding to the audio data. The one or more processors are further configured to execute the instructions to determine, at least partially based on the classification output, whether the audio data corresponds to user voice activity.

    NOISE SUPPRESSION USING TANDEM NETWORKS
    6.
    发明申请

    公开(公告)号:WO2023004223A1

    公开(公告)日:2023-01-26

    申请号:PCT/US2022/073104

    申请日:2022-06-23

    Abstract: A device includes a memory configured to store instructions and one or more processors configured to execute the instructions. The one or more processors are configured to execute the instructions to receive audio data including a first audio frame corresponding to a first output of a first microphone and a second audio frame corresponding to a second output of a second microphone. The one or more processors are also configured to execute the instructions to provide the audio data to a first noisesuppression network and a second noise-suppression network. The first noisesuppression network is configured to generate a first noise-suppressed audio frame and the second noise-suppression network is configured to generate a second noisesuppressed audio frame. The one or more processors are further configured to execute the instructions to provide the noise-suppressed audio frames to an attention-pooling network. The attention-pooling network is configured to generate an output noisesuppressed audio frame.

    ACTIVE SELF-VOICE NATURALIZATION USING A BONE CONDUCTION SENSOR

    公开(公告)号:WO2022076493A1

    公开(公告)日:2022-04-14

    申请号:PCT/US2021/053674

    申请日:2021-10-06

    Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device to receive an input audio signal from one or more outer microphones, an input audio signal from one or more inner microphones, and a bone conduction signal from a bone conduction sensor based on the input audio signals. The wearable device may filter the bone conduction signal based on a set of frequencies of the input audio signals, such as a low frequency portion of the input audio signals. For example, the wearable device may apply a filter to the bone conduction signal that accounts for an error in the input audio signals. The wearable device may add a gain to the filtered bone conduction signal and may equalize the filtered bone conduction signal based on the gain. The wearable device may output an audio signal to a speaker.

Patent Agency Ranking