Generating Images for Virtual Try-On and Pose Transfer

    公开(公告)号:US20230267663A1

    公开(公告)日:2023-08-24

    申请号:US17678237

    申请日:2022-02-23

    Applicant: Adobe Inc.

    CPC classification number: G06T11/60 G06T7/70 G06T7/11 G06N3/0454

    Abstract: In implementations of systems for generating images for virtual try-on and pose transfer, a computing device implements a generator system to receive input data describing a first digital image that depicts a person in a pose and a second digital image that depicts a garment. Candidate appearance flow maps are computed that warp the garment based on the pose at different pixel-block sizes using a first machine learning model. The generator system generates a warped garment image by combining the candidate appearance flow maps as an aggregate per-pixel displacement map using a convolutional gated recurrent network. A conditional segment mask is predicted that segments portions of a geometry of the person using a second machine learning model. The generator system outputs a digital image that depicts the person in the pose wearing the garment based on the warped garment image and the conditional segmentation mask using a third machine learning model.

    Deep learning based visual compatibility prediction for bundle recommendations

    公开(公告)号:US11640634B2

    公开(公告)日:2023-05-02

    申请号:US16865572

    申请日:2020-05-04

    Applicant: ADOBE INC.

    Abstract: Systems, methods, and computer storage media are disclosed for predicting visual compatibility between a bundle of catalog items (e.g., a partial outfit) and a candidate catalog item to add to the bundle. Visual compatibility prediction may be jointly conditioned on item type, context, and style by determining a first compatibility score jointly conditioned on type (e.g., category) and context, determining a second compatibility score conditioned on outfit style, and combining the first and second compatibility scores into a unified visual compatibility score. A unified visual compatibility score may be determined for each of a plurality of candidate items, and the candidate item with the highest unified visual compatibility score may be selected to add to the bundle (e.g., fill the in blank for the partial outfit).

    Generating a data visualization graph utilizing modularity-based manifold tearing

    公开(公告)号:US11631205B2

    公开(公告)日:2023-04-18

    申请号:US17657255

    申请日:2022-03-30

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that generate interactive visual shape representation of digital datasets. For example, the disclosed systems can generate an augmented nearest neighbor network graph from a sampled subset of digital data points using a nearest neighbor model and witness complex model. The disclosed system can further generate a landmark network graph based on the augmented nearest neighbor network graph utilizing a plurality of random walks. The disclosed systems can also generate a loop-augmented spanning network graph based on a partition of the landmark network graph by adding community edges between communities of landmark groups based on modularity and to complete community loops. Based on the loop-augmented spanning network graph, the disclosed systems can generate an interactive visual shape representation for display on a client device.

    INTERACTIVE SEARCH EXPERIENCE USING MACHINE LEARNING

    公开(公告)号:US20220245141A1

    公开(公告)日:2022-08-04

    申请号:US17656772

    申请日:2022-03-28

    Applicant: Adobe Inc.

    Abstract: An interactive search session is implemented using an artificial intelligence model. For example, when the artificial intelligence model receives a search query from a user, the model selects an action from a plurality of actions based on the search query. The selected action queries the user for more contextual cues about the search query (e.g., may enquire about use of the search results, may request to refine the search query, or otherwise engage the user in conversation to better understand the intent of the search). The interactive search session may be in the form, for example, of a chat session between the user and the system, and the chat session may be displayed along with the search results (e.g., in a separate section of display). The interactive search session may enable the system to better understand the user's search needs, and accordingly may help provide more focused search results.

    Interactive search experience using machine learning

    公开(公告)号:US11294891B2

    公开(公告)日:2022-04-05

    申请号:US16394853

    申请日:2019-04-25

    Applicant: Adobe Inc.

    Abstract: Techniques are disclosed for providing an interactive search session. The interactive search session is implemented using an artificial intelligence model. For example, when the artificial intelligence model receives a search query from a user, the model selects an action from a plurality of actions based on the search query. The selected action queries the user for more contextual cues about the search query (e.g., may enquire about use of the search results, may request to refine the search query, or otherwise engage the user in conversation to better understand the intent of the search). The interactive search session may be in the form, for example, of a chat session between the user and the system, and the chat session may be displayed along with the search results (e.g., in a separate section of display). The interactive search session may enable the system to better understand the user's search needs, and accordingly may help provide more focused search results.

    ATTRIBUTIONALLY ROBUST TRAINING FOR WEAKLY SUPERVISED LOCALIZATION AND SEGMENTATION

    公开(公告)号:US20220012530A1

    公开(公告)日:2022-01-13

    申请号:US16926511

    申请日:2020-07-10

    Applicant: Adobe Inc.

    Abstract: Embodiments are disclosed for training a neural network classifier to learn to more closely align an input image with its attribution map. In particular, in one or more embodiments, the disclosed systems and methods comprise receiving a training image comprising a representation of one or more objects, the training image associated with at least one label for the representation of the one or more objects, generating a perturbed training image based on the training image using a neural network, and training the neural network using the perturbed training image by minimizing a combination of classification loss and attribution loss to learn to align an image with its corresponding attribution map.

    GENERATING A HIGH-DIMENSIONAL NETWORK GRAPH FOR DATA VISUALIZATION UTILIZING LANDMARK DATA POINTS AND MODULARITY-BASED MANIFOLD TEARING

    公开(公告)号:US20210327108A1

    公开(公告)日:2021-10-21

    申请号:US16850677

    申请日:2020-04-16

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that generate interactive visual shape representation of digital datasets. For example, the disclosed systems can generate an augmented nearest neighbor network graph from a sampled subset of digital data points using a nearest neighbor model and witness complex model. The disclosed system can further generate a landmark network graph based on the augmented nearest neighbor network graph utilizing a plurality of random walks. The disclosed systems can also generate a loop-augmented spanning network graph based on a partition of the landmark network graph by adding community edges between communities of landmark groups based on modularity and to complete community loops. Based on the loop-augmented spanning network graph, the disclosed systems can generate an interactive visual shape representation for display on a client device.

    IDENTIFYING DIGITAL ATTRIBUTES FROM MULTIPLE ATTRIBUTE GROUPS WITHIN TARGET DIGITAL IMAGES UTILIZING A DEEP COGNITIVE ATTRIBUTION NEURAL NETWORK

    公开(公告)号:US20210073267A1

    公开(公告)日:2021-03-11

    申请号:US16564831

    申请日:2019-09-09

    Applicant: Adobe, Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media for generating tags for an object portrayed in a digital image based on predicted attributes of the object. For example, the disclosed systems can utilize interleaved neural network layers of alternating inception layers and dilated convolution layers to generate a localization feature vector. Based on the localization feature vector, the disclosed systems can generate attribute localization feature embeddings, for example, using some pooling layer such as a global average pooling layer. The disclosed systems can then apply the attribute localization feature embeddings to corresponding attribute group classifiers to generate tags based on predicted attributes. In particular, attribute group classifiers can predict attributes as associated with a query image (e.g., based on a scoring comparison with other potential attributes of an attribute group). Based on the generated tags, the disclosed systems can respond to tag queries and search queries.

Patent Agency Ranking