MULTITRACK EFFECT VISUALIZATION AND INTERACTION FOR TEXT-BASED VIDEO EDITING

    公开(公告)号:US20240233769A1

    公开(公告)日:2024-07-11

    申请号:US18152328

    申请日:2023-01-10

    申请人: Adobe Inc.

    摘要: Embodiments of the present disclosure provide systems, methods, and computer storage media providing visualizations and mechanisms utilized when performing video edits using wrapped timelines (e.g., effect bars/effect tracks) interspersed between text lines representing video effects being applied to text segments in a transcript. An example embodiment provides a transcript using an audio track from a transcribed video. A transcript interface presents the transcript and accepts an input selecting sentences or words from the transcript. The identified boundaries corresponding to the selected text segment are used as boundaries for a selected video segment. Using the selected text segment, a user selects a video effect in which to apply to the corresponding video segment and within the transcript interface, a wrapped timeline is placed in the transcript along the selected text segment to indicate that the video effect is applied to the corresponding video segment.

    Interacting with hierarchical clusters of video segments using a video timeline

    公开(公告)号:US11630562B2

    公开(公告)日:2023-04-18

    申请号:US17017366

    申请日:2020-09-10

    申请人: ADOBE INC.

    摘要: Embodiments are directed to techniques for interacting with a hierarchical video segmentation using a video timeline. In some embodiments, the finest level of a hierarchical segmentation identifies the smallest interaction unit of a video—semantically defined video segments of unequal duration called clip atoms, and higher levels cluster the clip atoms into coarser sets of video segments. A presented video timeline is segmented based on one of the levels, and one or more segments are selected through interactions with the video timeline. For example, a click or tap on a video segment or a drag operation dragging along the timeline snaps selection boundaries to corresponding segment boundaries defined by the level. Navigating to a different level of the hierarchy transforms the selection into coarser or finer video segments defined by the level. Any operation can be performed on selected video segments, including playing back, trimming, or editing.

    Interacting with hierarchical clusters of video segments using a metadata panel

    公开(公告)号:US11995894B2

    公开(公告)日:2024-05-28

    申请号:US17017353

    申请日:2020-09-10

    申请人: ADOBE INC.

    IPC分类号: G06V20/40 G06T7/10

    摘要: Embodiments are directed to techniques for interacting with a hierarchical video segmentation using a metadata panel with a composite list of video metadata. The composite list is segmented into selectable metadata segments at locations corresponding to boundaries of video segments defined by a hierarchical segmentation. In some embodiments, the finest level of a hierarchical segmentation identifies the smallest interaction unit of a video—semantically defined video segments of unequal duration called clip atoms, and higher levels cluster the clip atoms into coarser sets of video segments. One or more metadata segments can be selected in various ways, such as by clicking or tapping on a metadata segment or by performing a metadata search. When a metadata segment is selected, a corresponding video segment is emphasized on the video timeline, a playback cursor is moved to the first video frame of the video segment, and the first video frame is presented.

    Segmentation and hierarchical clustering of video

    公开(公告)号:US11450112B2

    公开(公告)日:2022-09-20

    申请号:US17017344

    申请日:2020-09-10

    申请人: ADOBE INC.

    摘要: Embodiments are directed to segmentation and hierarchical clustering of video. In an example implementation, a video is ingested to generate a multi-level hierarchical segmentation of the video. In some embodiments, the finest level identifies a smallest interaction unit of the video—semantically defined video segments of unequal duration called clip atoms. Clip atom boundaries are detected in various ways. For example, speech boundaries are detected from audio of the video, and scene boundaries are detected from video frames of the video. The detected boundaries are used to define the clip atoms, which are hierarchically clustered to form a multi-level hierarchical representation of the video. In some cases, the hierarchical segmentation identifies a static, pre-computed, hierarchical set of video segments, where each level of the hierarchical segmentation identifies a complete set (i.e., covering the entire range of the video) of disjoint (i.e., non-overlapping) video segments with a corresponding level of granularity.

    Pose selection and animation of characters using video data and training techniques

    公开(公告)号:US11282257B2

    公开(公告)日:2022-03-22

    申请号:US16692471

    申请日:2019-11-22

    摘要: This disclosure generally relates to character animation. More specifically, but not by way of limitation, this disclosure relates to pose selection using data analytics techniques applied to training data, and generating 2D animations of illustrated characters using performance data and the selected poses. An example process or system includes obtaining a selection of training poses of the subject and a set of character poses, obtaining a performance video of the subject, wherein the performance video includes a plurality of performance frames that include poses performed by the subject, grouping the plurality of performance frames into groups of performance frames, assigning a selected training pose from the selection of training poses to each group of performance frames using the clusters of training frames, generating a sequence of character poses based on the groups of performance frames and their assigned training poses, outputting the sequence of character poses.