-
公开(公告)号:US20220075820A1
公开(公告)日:2022-03-10
申请号:US17017370
申请日:2020-09-10
Applicant: ADOBE INC.
Inventor: Seth Walker , Joy Oakyung Kim , Morgan Nicole Evans , Najika Skyler Halsema Yoo , Aseem Agarwala , Joel R. Brandt , Jovan Popovic , Lubomira Dontcheva , Dingzeyu Li , Hijung Shin , Xue Bai
IPC: G06F16/738 , G06T13/80 , G06F3/0482 , G06F3/0484 , G06F16/74 , G06F16/735 , G06F16/75
Abstract: Embodiments are directed to techniques for interacting with a hierarchical video segmentation by performing a metadata search. Generally, various types of metadata can be extracted from a video, such as a transcript of audio, keywords from the transcript, content or action tags visually extracted from video frames, and log event tags extracted from an associated temporal log. The extracted metadata is segmented into metadata segments and associated with corresponding video segments defined by a hierarchical video segmentation. As such, a metadata search can be performed to identify matching metadata segments and corresponding matching video segments defined by a particular level of the hierarchical segmentation. Matching metadata segments are emphasized in a composite list of the extracted metadata, and matching video segments are emphasized on the video timeline. Navigating to a different level of the hierarchy transforms the search results into corresponding coarser or finer segments defined by the level.
-
公开(公告)号:US20190279414A1
公开(公告)日:2019-09-12
申请号:US15915872
申请日:2018-03-08
Applicant: Adobe Inc.
Inventor: Duygu Ceylan Aksit , Yangtuanfeng Wang , Niloy Jyoti Mitra , Mehmet Ersin Yumer , Jovan Popovic
Abstract: Systems and techniques provide a user interface within an application to enable users to designate a folded object image of a folded object, as well as a superimposed image of a superimposed object to be added to the folded object image. Within the user interface, the user may simply place the superimposed image over the folded object image to obtain the desired modified image. If the user places the superimposed image over one or more folds of the folded object image, portions of the superimposed image will be removed to create the illusion in the modified image that the removed portions are obscured by one or more folds.
-
公开(公告)号:US20190197758A1
公开(公告)日:2019-06-27
申请号:US16291585
申请日:2019-03-04
Applicant: Adobe Inc.
Inventor: Daniel M. Kaufman , Jovan Popovic , Yunfei Bai
IPC: G06T13/80 , G06F3/0484
CPC classification number: G06T13/80 , A63B21/00061 , A63B22/0694 , G06F3/04815 , G06F3/04845 , G06F3/04847
Abstract: In embodiments of animation using keyframing and projected dynamics simulation, an animation object is displayed with handles associated with object regions for the animation object, each handle being selectable for setting animation constraints on an object region. An animation simulator receives a user input designating a particular handle with an animation constraint, and sets the animation constraint on the particular handle for the associated object region. The animation simulator also receives another user input, designating a timing of the object region associated with the particular handle of the animation object through multiple frames in an animation sequence. The animation simulator projects a simulation of the animation object utilizing a projected dynamics algorithm that applies physics to simulate the set of object regions of the animation object in the animation sequence, the simulation including simulating the object region associated with the particular handle based on the timing and the animation constraint.
-
公开(公告)号:US10290128B2
公开(公告)日:2019-05-14
申请号:US14937683
申请日:2015-11-10
Applicant: Adobe Inc.
Inventor: Zhicheng Liu , Wilmot Wei-Mau Li , Alan G. Wilson , Lubomira A. Dontcheva , Jovan Popovic
Abstract: Techniques for interactive scene graph manipulation for visualization authoring are described. In implementations, visual marks are grouped into containers. Each container includes layout settings independent of other containers, and the layout settings are individually adjustable. The visual marks are configured to represent data values. Additionally, the containers are nested in a hierarchy. Then, data visualizations are constructed for display via a user interface of a display device. For example, the data visualizations can be constructed by applying data values to the visual marks and layout settings of the containers to the visual marks grouped within the nested containers to generate the data visualizations.
-
公开(公告)号:US20220301313A1
公开(公告)日:2022-09-22
申请号:US17805076
申请日:2022-06-02
Applicant: ADOBE INC.
Inventor: Hijung Shin , Xue Bai , Aseem Agarwala , Joel R. Brandt , Jovan Popovic , Lubomira Dontcheva , Dingzeyu Li , Joy Oakyung Kim , Seth Walker
Abstract: Embodiments are directed to segmentation and hierarchical clustering of video. In an example implementation, a video is ingested to generate a multi-level hierarchical segmentation of the video. In some embodiments, the finest level identifies a smallest interaction unit of the video—semantically defined video segments of unequal duration called clip atoms. Clip atom boundaries are detected in various ways. For example, speech boundaries are detected from audio of the video, and scene boundaries are detected from video frames of the video. The detected boundaries are used to define the clip atoms, which are hierarchically clustered to form a multi-level hierarchical representation of the video. In some cases, the hierarchical segmentation identifies a static, pre-computed, hierarchical set of video segments, where each level of the hierarchical segmentation identifies a complete set (i.e., covering the entire range of the video) of disjoint (i.e., non-overlapping) video segments with a corresponding level of granularity.
-
公开(公告)号:US20220292831A1
公开(公告)日:2022-09-15
申请号:US17805080
申请日:2022-06-02
Applicant: ADOBE INC.
Inventor: Hijung Shin , Xue Bai , Aseem Agarwala , Joel R. Brandt , Jovan Popovic , Lubomira Dontcheva , Dingzeyu Li , Joy Oakyung Kim , Seth Walker
Abstract: Embodiments are directed to segmentation and hierarchical clustering of video. In an example implementation, a video is ingested to generate a multi-level hierarchical segmentation of the video. In some embodiments, the finest level identifies a smallest interaction unit of the video—semantically defined video segments of unequal duration called clip atoms. Clip atom boundaries are detected in various ways. For example, speech boundaries are detected from audio of the video, and scene boundaries are detected from video frames of the video. The detected boundaries are used to define the clip atoms, which are hierarchically clustered to form a multi-level hierarchical representation of the video. In some cases, the hierarchical segmentation identifies a static, pre-computed, hierarchical set of video segments, where each level of the hierarchical segmentation identifies a complete set (i.e., covering the entire range of the video) of disjoint (i.e., non-overlapping) video segments with a corresponding level of granularity.
-
公开(公告)号:US20220292830A1
公开(公告)日:2022-09-15
申请号:US17805075
申请日:2022-06-02
Applicant: ADOBE INC.
Inventor: Hijung Shin , Xue Bai , Aseem Agarwala , Joel R. Brandt , Jovan Popovic , Lubomira Dontcheva , Dingzeyu Li , Joy Oakyung Kim , Seth Walker
Abstract: Embodiments are directed to segmentation and hierarchical clustering of video. In an example implementation, a video is ingested to generate a multi-level hierarchical segmentation of the video. In some embodiments, the finest level identifies a smallest interaction unit of the video—semantically defined video segments of unequal duration called clip atoms. Clip atom boundaries are detected in various ways. For example, speech boundaries are detected from audio of the video, and scene boundaries are detected from video frames of the video. The detected boundaries are used to define the clip atoms, which are hierarchically clustered to form a multi-level hierarchical representation of the video. In some cases, the hierarchical segmentation identifies a static, pre-computed, hierarchical set of video segments, where each level of the hierarchical segmentation identifies a complete set (i.e., covering the entire range of the video) of disjoint (i.e., non-overlapping) video segments with a corresponding level of granularity.
-
公开(公告)号:US20220076707A1
公开(公告)日:2022-03-10
申请号:US17330702
申请日:2021-05-26
Applicant: ADOBE INC.
Inventor: Seth Walker , Hijung Shin , Cristin Ailidh Fraser , Aseem Agarwala , Lubomira Dontcheva , Joel Richard Brandt , Jovan Popovic , Joy Oakyung Kim , Justin Salamon , Jui-hsien Wang , Timothy Jeewun Ganter , Xue Bai , Dingzeyu Li
IPC: G11B27/036 , G06F3/0486 , G06F3/0482
Abstract: Embodiments are directed to a snap point segmentation that defines the locations of selection snap points for a selection of video segments. Candidate snap points are determined from boundaries of feature ranges of the video indicating when instances of detected features are present in the video. In some embodiments, candidate snap point separations are penalized for being separated by less than a minimum duration corresponding to a minimum pixel separation between consecutive snap points on a video timeline. The snap point segmentation is computed by solving a shortest path problem through a graph that models different snap point locations and separations. When a user clicks or taps on the video timeline and drags, a selection snaps to the snap points defined by the snap point segmentation. In some embodiments, the snap points are displayed during a drag operation and disappear when the drag operation is released.
-
公开(公告)号:US20220076706A1
公开(公告)日:2022-03-10
申请号:US17330689
申请日:2021-05-26
Applicant: ADOBE INC.
Inventor: Seth Walker , Hijung Shin , Cristin Ailidh Fraser , Aseem Agarwala , Lubomira Dontcheva , Joel Richard Brandt , Jovan Popovic , Joy Oakyung Kim , Justin Salamon , Jui-hsien Wang , Timothy Jeewun Ganter , Xue Bai , Dingzeyu Li
IPC: G11B27/036 , G06F3/0482 , G06F3/0486
Abstract: Embodiments are directed to interactive tiles that represent video segments of a segmentation of a video. In some embodiments, each interactive tile represents a different video segment from a particular video segmentation (e.g., a default video segmentation). Each interactive tile includes a thumbnail (e.g., the first frame of the video segment represented by the tile), some transcript from the beginning of the video segment, a visualization of detected faces in the video segment, and one or more faceted timelines that visualize a category of detected features (e.g., a visualization of detected visual scenes, audio classifications, visual artifacts). In some embodiments, interacting with a particular interactive tile navigates to a corresponding portion of the video, adds a corresponding video segment to a selection, and/or scrubs through tile thumbnails.
-
10.
公开(公告)号:US10699705B2
公开(公告)日:2020-06-30
申请号:US16016418
申请日:2018-06-22
Applicant: Adobe Inc.
Inventor: Wilmot Li , Jovan Popovic , Deepali Aneja , David Simons
IPC: G10L15/197 , G06N3/04 , G06N3/08 , G10L15/02 , G10L15/06 , G10L21/0316 , G10L25/21 , G10L25/24
Abstract: Disclosed systems and methods predict visemes from an audio sequence. A viseme-generation application accesses a first set of training data that includes a first audio sequence representing a sentence spoken by a first speaker and a sequence of visemes. Each viseme is mapped to a respective audio sample of the first audio sequence. The viseme-generation application creates a second set of training data adjusting a second audio sequence spoken by a second speaker speaking the sentence such that the second and first sequences have the same length and at least one phoneme occurs at the same time stamp in the first sequence and in the second sequence. The viseme-generation application maps the sequence of visemes to the second audio sequence and trains a viseme prediction model to predict a sequence of visemes from an audio sequence.
-
-
-
-
-
-
-
-
-