PREDICTING VIDEO EDITS FROM TEXT-BASED CONVERSATIONS USING NEURAL NETWORKS

    公开(公告)号:US20240163393A1

    公开(公告)日:2024-05-16

    申请号:US18055301

    申请日:2022-11-14

    Applicant: Adobe Inc.

    CPC classification number: H04N7/002 G06T11/60

    Abstract: Embodiments are disclosed for predicting, using neural networks, editing operations for application to a video sequence based on processing conversational messages by a video editing system. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving an input including a video sequence and text sentences, the text sentences describing a modification to the video sequence, mapping, by a first neural network content of the text sentences describing the modification to the video sequence to a candidate editing operation, processing, by a second neural network, the video sequence to predict parameter values for the candidate editing operation, and generating a modified video sequence by applying the candidate editing operation with the predicted parameter values to the video sequence.

Patent Agency Ranking