DIGITAL IMAGE REPOSING TECHNIQUES
    31.
    发明申请

    公开(公告)号:US20240428564A1

    公开(公告)日:2024-12-26

    申请号:US18213118

    申请日:2023-06-22

    Applicant: Adobe Inc.

    Abstract: In implementations of systems for generating images for human reposing, a computing device implements a reposing system to receive input data describing an input digital image depicting a person in a first pose, a first plurality of keypoints representing the first pose, and a second plurality of keypoints representing a second pose. The reposing system generates a mapping by processing the input data using a first machine learning model. The mapping indicates a plurality of first portions of the person in the second pose that are visible in the input digital image and a plurality of second portions of the person in the second pose that are invisible in the input digital image. The reposing system generates an output digital image depicting the person in the second pose by processing the mapping, the first plurality of keypoints, and the second plurality of keypoints using a second machine learning model.

    Model training with retrospective loss

    公开(公告)号:US11797823B2

    公开(公告)日:2023-10-24

    申请号:US16793551

    申请日:2020-02-18

    Applicant: Adobe Inc.

    Abstract: Generating a machine learning model that is trained using retrospective loss is described. A retrospective loss system receives an untrained machine learning model and a task for training the model. The retrospective loss system initially trains the model over warm-up iterations using task-specific loss that is determined based on a difference between predictions output by the model during training on input data and a ground truth dataset for the input data. Following the warm-up training iterations, the retrospective loss system continues to train the model using retrospective loss, which is model-agnostic and constrains the model such that a subsequently output prediction is more similar to the ground truth dataset than the previously output prediction. After determining that the model's outputs are within a threshold similarity to the ground truth dataset, the model is output with its current parameters as a trained model.

    FORM STRUCTURE EXTRACTION BY PREDICTING ASSOCIATIONS

    公开(公告)号:US20230267345A1

    公开(公告)日:2023-08-24

    申请号:US18135948

    申请日:2023-04-18

    Applicant: Adobe Inc.

    CPC classification number: G06N5/04 G06N3/08 G06N20/00 G06N20/10 G06V10/82

    Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.

    Form structure extraction by predicting associations

    公开(公告)号:US11657306B2

    公开(公告)日:2023-05-23

    申请号:US16904263

    申请日:2020-06-17

    Applicant: Adobe Inc.

    CPC classification number: G06N5/04 G06N3/08 G06N20/00 G06N20/10 G06V10/82

    Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.

    Refining Element Associations for Form Structure Extraction

    公开(公告)号:US20230134460A1

    公开(公告)日:2023-05-04

    申请号:US17517434

    申请日:2021-11-02

    Applicant: Adobe Inc.

    Abstract: In implementations of refining element associations for form structure extraction, a computing device implements a structure system to receive estimate data describing estimated associations of elements included in a form and a digital image depicting the form. An image patch is extracted from the digital image, and the image patch depicts a pair of elements of the elements included in the form. The structure system encodes an indication of whether the pair of elements have an association of the estimated associations. An indication is generated that the pair of elements have a particular association based at least partially on the encoded indication, bounding boxes of the pair of elements, and text depicted in the image patch.

    TEXT-CONDITIONED IMAGE SEARCH BASED ON TRANSFORMATION, AGGREGATION, AND COMPOSITION OF VISIO-LINGUISTIC FEATURES

    公开(公告)号:US20220245391A1

    公开(公告)日:2022-08-04

    申请号:US17160893

    申请日:2021-01-28

    Applicant: Adobe Inc.

    Abstract: Techniques are disclosed for text-conditioned image searching. A methodology implementing the techniques includes decomposing a source image into visual feature vectors associated with different levels of granularity. The method also includes decomposing a text query (defining a target image attribute) into feature vectors associated with different levels of granularity including a global text feature vector. The method further includes generating image-text embeddings based on the visual feature vectors and the text feature vectors to encode information from visual and textual features. The method further includes composing a visio-linguistic representation based on a hierarchical aggregation of the image-text embeddings to encode visual and textual information at multiple levels of granularity. The method further includes identifying a target image that includes the visio-linguistic representation and the global text feature vector, so that the target image relates to the target image attribute, and providing the target image as an image search result.

Patent Agency Ranking