Image object segmentation based on temporal information

    公开(公告)号:US11379987B2

    公开(公告)日:2022-07-05

    申请号:US17020023

    申请日:2020-09-14

    Applicant: ADOBE INC.

    Abstract: A temporal object segmentation system determines a location of an object depicted in a video. In some cases, the temporal object segmentation system determines the object's location in a particular frame of the video based on information indicating a previous location of the object in a previous video frame. For example, an encoder neural network in the temporal object segmentation system extracts features describing image attributes of a video frame. A convolutional long-short term memory neural network determines the location of the object in the frame, based on the extracted image attributes and information indicating a previous location in a previous frame. A decoder neural network generates an image mask indicating the object's location in the frame. In some cases, a video editing system receives multiple generated masks for a video, and modifies one or more video frames based on the locations indicated by the masks.

    Image white balancing
    33.
    发明授权

    公开(公告)号:US11044450B2

    公开(公告)日:2021-06-22

    申请号:US16435371

    申请日:2019-06-07

    Abstract: Techniques are described for white balancing an input image by determining a color transformation for the input image based on color transformations that have been computed for training images whose color characteristics are similar to those of the input image. Techniques are also described for generating a training dataset comprising color information for a plurality of training images and color transformation information for the plurality of training images. The color information in the training dataset is searched to identify a subset of training images that are most similar in color to the input image. The color transformation for the input image is then computed by combining color transformation information for the identified training images. The contribution of the color transformation information for any given training image to the combination can be weighted based on the degree of color similarity between the input image and the training image.

    Interactive image matting using neural networks

    公开(公告)号:US11004208B2

    公开(公告)日:2021-05-11

    申请号:US16365213

    申请日:2019-03-26

    Applicant: Adobe Inc.

    Abstract: Techniques are disclosed for deep neural network (DNN) based interactive image matting. A methodology implementing the techniques according to an embodiment includes generating, by the DNN, an alpha matte associated with an image, based on user-specified foreground region locations in the image. The method further includes applying a first DNN subnetwork to the image, the first subnetwork trained to generate a binary mask based on the user input, the binary mask designating pixels of the image as background or foreground. The method further includes applying a second DNN subnetwork to the generated binary mask, the second subnetwork trained to generate a trimap based on the user input, the trimap designating pixels of the image as background, foreground, or uncertain status. The method further includes applying a third DNN subnetwork to the generated trimap, the third subnetwork trained to generate the alpha matte based on the user input.

    MULTI-MODULE AND MULTI-TASK MACHINE LEARNING SYSTEM BASED ON AN ENSEMBLE OF DATASETS

    公开(公告)号:US20200349464A1

    公开(公告)日:2020-11-05

    申请号:US16401548

    申请日:2019-05-02

    Applicant: Adobe Inc.

    Abstract: Techniques and systems are provided for training a machine learning model using different datasets to perform one or more tasks. The machine learning model can include a first sub-module configured to perform a first task and a second sub-module configured to perform a second task. The first sub-module can be selected for training using a first training dataset based on a format of the first training dataset. The first sub-module can then be trained using the first training dataset to perform the first task. The second sub-module can be selected for training using a second training dataset based on a format of the second training dataset. The second sub-module can then be trained using the second training dataset to perform the second task.

    INTERACTIVE IMAGE MATTING USING NEURAL NETWORKS

    公开(公告)号:US20200311946A1

    公开(公告)日:2020-10-01

    申请号:US16365213

    申请日:2019-03-26

    Applicant: Adobe Inc.

    Abstract: Techniques are disclosed for deep neural network (DNN) based interactive image matting. A methodology implementing the techniques according to an embodiment includes generating, by the DNN, an alpha matte associated with an image, based on user-specified foreground region locations in the image. The method further includes applying a first DNN subnetwork to the image, the first subnetwork trained to generate a binary mask based on the user input, the binary mask designating pixels of the image as background or foreground. The method further includes applying a second DNN subnetwork to the generated binary mask, the second subnetwork trained to generate a trimap based on the user input, the trimap designating pixels of the image as background, foreground, or uncertain status. The method further includes applying a third DNN subnetwork to the generated trimap, the third subnetwork trained to generate the alpha matte based on the user input.

    Page segmentation of vector graphics documents

    公开(公告)号:US10699111B2

    公开(公告)日:2020-06-30

    申请号:US16251568

    申请日:2019-01-18

    Applicant: Adobe Inc.

    Abstract: Disclosed systems and methods generate page segmented documents from unstructured vector graphics documents. The page segmentation application executing on a computing device receives as input an unstructured vector graphics document. The application generates an element proposal for each of many areas on a page of the input document tentatively identified as being page elements. The page segmentation application classifies each of the element proposals into one of a plurality of defined type of categories of page elements. The page segmentation application may further refine at least one of the element proposals and select a final element proposal for each element within the unstructured vector document. One or more of the page segmentation steps may be performed using a neural network.

Patent Agency Ranking