SCALABLE VOICE SCENE MEDIA SERVER

    公开(公告)号:US20220197592A1

    公开(公告)日:2022-06-23

    申请号:US17601199

    申请日:2020-04-03

    Abstract: A communication system, method, and computer-readable medium therefor comprise a media server configured to receive a plurality of audio streams from a corresponding plurality of client devices, the media server including circuitry configured to rank the plurality of audio streams based on a predetermined metric, group a first portion of the plurality of audio streams into a first set, the first portion of the plurality of audio streams being the N highest-ranked audio streams, group a second portion of the plurality of audio streams into a second set, the second portion of the plurality of audio streams being the M lowest-ranked audio streams, forward respective audio streams of the first set to a receiver device, and discard respective audio streams of the second set, wherein N and M are independent integers.

    DEEP-LEARNING BASED SPEECH ENHANCEMENT
    2.
    发明公开

    公开(公告)号:US20230368807A1

    公开(公告)日:2023-11-16

    申请号:US18250393

    申请日:2021-10-29

    CPC classification number: G10L21/0232 G10L19/022

    Abstract: A system for suppressing noise and enhancing speech and a related method are disclosed. The system trains a neural network model that takes banded energies corresponding to an original noisy waveform and produces a speech value indicating the amount of speech present in each band at each frame. The neural model comprises a feature extraction block that implements some lookahead. The feature extraction block is followed by an encoder with steady down-sampling along the frequency domain forming a contracting path. The encoder is followed by a corresponding decoder with steady up-sampling along the frequency domain forming an expanding path. The decoder receives scaled output feature maps from the encoder at a corresponding level. The decoder is followed by a classification block that generates a speech value indicating an amount of speech present for each frequency band of the plurality of frequency bands at each frame of the plurality of frames.

Patent Agency Ranking