-
公开(公告)号:US20240161735A1
公开(公告)日:2024-05-16
申请号:US18055739
申请日:2022-11-15
Applicant: Adobe Inc.
Inventor: Justin SALAMON , Juan-Pablo CACERES CHOMALI , Ge ZHU , Nicholas J. BRYAN
Abstract: Embodiments are disclosed for performing a filler word detection process on input audio by a media editing system using trained neural networks. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an input including an audio sequence, analyzing the audio sequence to determine filler word candidates, classifying, by a filler word classification model, each filler word candidate of the filler word candidates into one of a set of categories, and generating an output audio sequence, the output audio sequence including an identification of a subset of the filler word candidates in a filler words category of the set of categories as identified filler words.