-
公开(公告)号:EP4439558A2
公开(公告)日:2024-10-02
申请号:EP24194500.5
申请日:2019-06-19
发明人: HIJAZI, Samer , MAO, Xuehong , CASAS, Raul Alejandro , WOJCICKI, Kamil Krzysztof , MAYDAN, Dror , ROWEN, Christopher
IPC分类号: G10L21/0208
CPC分类号: H04N21/2335 , H04N21/23418 , H04N21/4394 , H04N21/4398 , H04N21/44008 , G10L17/18 , G10L21/0208 , G10L25/30 , G10L21/02 , G06N3/084 , G10L25/81 , G10L25/84
摘要: Systems and methods are disclosed for audio enhancement. For example, methods may include accessing audio data; determining a window of audio samples based on the audio data; inputting the window of audio samples to a classifier to obtain a classification, in which the classifier includes a neural network and the classification takes a value from a set of multiple classes of audio; selecting, based on the classification, an audio enhancement network from a set of multiple audio enhancement networks; applying the selected audio enhancement network to the window of audio samples to obtain an enhanced audio segment, in which the selected audio enhancement network includes a neural network that has been trained using audio signals of a type associated with the classification; and storing, playing, or transmitting an enhanced audio signal based on the enhanced audio segment.