-
公开(公告)号:US20230005469A1
公开(公告)日:2023-01-05
申请号:US17852548
申请日:2022-06-29
Applicant: PEXIP AS
Inventor: Anna Kim , Eamonn Shaw
IPC: G10L15/06 , G10L25/57 , G10L15/02 , G10L25/84 , G10L21/0232 , G10L21/0224
Abstract: A method of speech detection and speech enhancement in a speech detection and speech enhancement unit of Multipoint Conferencing Node (MCN) and a method of training the same. The method comprising receiving input audio segments, and determining an acoustic environment based on input audio auxiliary information, extracting T-F-domain features from the received input audio segments, determining if each of the received input audio segments is speech by inputting the T-F domain features into a speech detection classifier trained for the determined acoustic environment, determining, when one of the received input audio segments is speech, if the received audio segment is noisy speech by inputting the T-F domain features into a noise classifier using a statistical generative model representing the probability distributions of the T-F domain features of noisy speech trained for the determined acoustic environment, and applying a noise reduction mask on the received input audio segments according to the determination of the received audio segment is noisy speech