METHOD AND SYSTEM FOR SPEECH DETECTION AND SPEECH ENHANCEMENT

    公开(公告)号:US20230005469A1

    公开(公告)日:2023-01-05

    申请号:US17852548

    申请日:2022-06-29

    Applicant: PEXIP AS

    Abstract: A method of speech detection and speech enhancement in a speech detection and speech enhancement unit of Multipoint Conferencing Node (MCN) and a method of training the same. The method comprising receiving input audio segments, and determining an acoustic environment based on input audio auxiliary information, extracting T-F-domain features from the received input audio segments, determining if each of the received input audio segments is speech by inputting the T-F domain features into a speech detection classifier trained for the determined acoustic environment, determining, when one of the received input audio segments is speech, if the received audio segment is noisy speech by inputting the T-F domain features into a noise classifier using a statistical generative model representing the probability distributions of the T-F domain features of noisy speech trained for the determined acoustic environment, and applying a noise reduction mask on the received input audio segments according to the determination of the received audio segment is noisy speech

Patent Agency Ranking