GENERATING IMPROVED PANOPTIC SEGMENTED DIGITAL IMAGES BASED ON PANOPTIC SEGMENTATION NEURAL NETWORKS THAT UTILIZE EXEMPLAR UNKNOWN OBJECT CLASSES

    公开(公告)号:US20220375090A1

    公开(公告)日:2022-11-24

    申请号:US17319979

    申请日:2021-05-13

    Applicant: Adobe Inc.

    Abstract: This disclosure describes one or more implementations of a panoptic segmentation system that generates panoptic segmented digital images that classify both known and unknown instances of digital images. For example, the panoptic segmentation system builds and utilizes a panoptic segmentation neural network to discover, cluster, and segment new unknown object subclasses for previously unknown object instances. In addition, the panoptic segmentation system can determine additional unknown object instances from additional digital images. Moreover, in some implementations, the panoptic segmentation system utilizes the newly generated unknown object subclasses to refine and tune the panoptic segmentation neural network to improve the detection of unknown object instances in input digital images.

    GENERATING ACTION TAGS FOR DIGITAL VIDEOS

    公开(公告)号:US20210409836A1

    公开(公告)日:2021-12-30

    申请号:US17470441

    申请日:2021-09-09

    Applicant: Adobe Inc.

    Abstract: Systems, methods, and non-transitory computer-readable media are disclosed for automatic tagging of videos. In particular, in one or more embodiments, the disclosed systems generate a set of tagged feature vectors (e.g., tagged feature vectors based on action-rich digital videos) to utilize to generate tags for an input digital video. For instance, the disclosed systems can extract a set of frames for the input digital video and generate feature vectors from the set of frames. In some embodiments, the disclosed systems generate aggregated feature vectors from the feature vectors. Furthermore, the disclosed systems can utilize the feature vectors (or aggregated feature vectors) to identify similar tagged feature vectors from the set of tagged feature vectors. Additionally, the disclosed systems can generate a set of tags for the input digital videos by aggregating one or more tags corresponding to identified similar tagged feature vectors.

    Utilizing an inertial measurement device to adjust orientation of panorama digital images

    公开(公告)号:US10600150B2

    公开(公告)日:2020-03-24

    申请号:US15339017

    申请日:2016-10-31

    Applicant: Adobe Inc.

    Abstract: The present disclosure includes methods and systems for modifying orientation of a spherical panorama digital image based on an inertial measurement device. In particular, one or more embodiments of the disclosed systems and methods correct for tilt and/or roll in a digital camera utilized to capture a spherical panorama digital images by detecting changes in orientation to an inertial measurement device and generating an enhanced spherical panorama digital image based on the detect changes. In particular, in one or more embodiments, the disclosed systems and methods modify orientation of a spherical panorama digital image in three-dimensional space based on changes in orientation to an inertial measurement device and resample pixels based on the modified orientation to generate an enhanced spherical panorama digital image.

    ACTIVE LEARNING METHOD FOR TEMPORAL ACTION LOCALIZATION IN UNTRIMMED VIDEOS

    公开(公告)号:US20190325275A1

    公开(公告)日:2019-10-24

    申请号:US15957419

    申请日:2018-04-19

    Applicant: Adobe Inc.

    Abstract: Various embodiments describe active learning methods for training temporal action localization models used to localize actions in untrimmed videos. A trainable active learning selection function is used to select unlabeled samples that can improve the temporal action localization model the most. The select unlabeled samples are then annotated and used to retrain the temporal action localization model. In some embodiment, the trainable active learning selection function includes a trainable performance prediction model that maps a video sample and a temporal action localization model to a predicted performance improvement for the temporal action localization model.

    VIDEO OBJECT SEGMENTATION BY REFERENCE-GUIDED MASK PROPAGATION

    公开(公告)号:US20190311202A1

    公开(公告)日:2019-10-10

    申请号:US15949935

    申请日:2018-04-10

    Applicant: Adobe Inc.

    Abstract: Various embodiments describe video object segmentation using a neural network and the training of the neural network. The neural network both detects a target object in the current frame based on a reference frame and a reference mask that define the target object and propagates the segmentation mask of the target object for a previous frame to the current frame to generate a segmentation mask for the current frame. In some embodiments, the neural network is pre-trained using synthetically generated static training images and is then fine-tuned using training videos.

Patent Agency Ranking