DETECTING AND CLASSIFYING FILLER WORDS IN AUDIO USING NEURAL NETWORKS

    公开(公告)号:US20240161735A1

    公开(公告)日:2024-05-16

    申请号:US18055739

    申请日:2022-11-15

    Applicant: Adobe Inc.

    CPC classification number: G10L15/16 G10L15/22 G10L25/78

    Abstract: Embodiments are disclosed for performing a filler word detection process on input audio by a media editing system using trained neural networks. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an input including an audio sequence, analyzing the audio sequence to determine filler word candidates, classifying, by a filler word classification model, each filler word candidate of the filler word candidates into one of a set of categories, and generating an output audio sequence, the output audio sequence including an identification of a subset of the filler word candidates in a filler words category of the set of categories as identified filler words.

Patent Agency Ranking