-
公开(公告)号:US11874902B2
公开(公告)日:2024-01-16
申请号:US17160862
申请日:2021-01-28
Applicant: Adobe Inc.
Inventor: Pinkesh Badjatiya , Surgan Jandial , Pranit Chawla , Mausoom Sarkar , Ayush Chopra
IPC: G06F18/25 , G06N3/04 , G06F16/538 , G06F16/532 , G06F16/535 , G06F18/214
CPC classification number: G06F18/253 , G06F16/532 , G06F16/535 , G06F16/538 , G06F18/214 , G06F18/251 , G06N3/04
Abstract: Techniques are disclosed for text conditioned image searching. A methodology implementing the techniques according to an embodiment includes receiving a source image and a text query defining a target image attribute. The method also includes decomposing the source image into image content and style feature vectors and decomposing the text query into text content and style feature vectors, wherein image style is descriptive of image content and text style is descriptive of text content. The method further includes composing a global content feature vector based on the text content feature vector and the image content feature vector and composing a global style feature vector based on the text style feature vector and the image style feature vector. The method further includes identifying a target image that relates to the global content feature vector and the global style feature vector so that the target image relates to the target image attribute.
-
公开(公告)号:US11600091B2
公开(公告)日:2023-03-07
申请号:US17327382
申请日:2021-05-21
Applicant: Adobe Inc.
Inventor: Mausoom Sarkar , Arneh Jain
IPC: G06K9/62 , G06V30/414 , G06T7/12 , G06N3/08 , G06V30/412 , G06V30/416
Abstract: Techniques for document segmentation. In an example, a document processing application segments an electronic document image into strips. A first strip overlaps a second strip. The application generates a first mask indicating one or more elements and element types in the first strip by applying a predictive model network to image content in the first strip and a prior mask generated from image content of the first strip. The application generates a second mask indicating one or more elements and element types in the second strip by applying the predictive model network to image content in the second strip and the first mask. The application computes, from a combined mask derived from the first mask and the second mask, an output electronic document that identifies elements in the electronic document and the respective element types.
-
公开(公告)号:US20220309093A1
公开(公告)日:2022-09-29
申请号:US17806922
申请日:2022-06-14
Applicant: Adobe Inc.
Inventor: Ayush Chopra , Mausoom Sarkar , Jonas Dahl , Hiresh Gupta , Balaji Krishnamurthy , Abhishek Sinha
IPC: G06F16/535 , G06K9/62 , G06F17/15 , G06N3/04 , G06F16/55
Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media for generating tags for an object portrayed in a digital image based on predicted attributes of the object. For example, the disclosed systems can utilize interleaved neural network layers of alternating inception layers and dilated convolution layers to generate a localization feature vector. Based on the localization feature vector, the disclosed systems can generate attribute localization feature embeddings, for example, using some pooling layer such as a global average pooling layer. The disclosed systems can then apply the attribute localization feature embeddings to corresponding attribute group classifiers to generate tags based on predicted attributes. In particular, attribute group classifiers can predict attributes as associated with a query image (e.g., based on a scoring comparison with other potential attributes of an attribute group). Based on the generated tags, the disclosed systems can respond to tag queries and search queries.
-
公开(公告)号:US20210279461A1
公开(公告)日:2021-09-09
申请号:US17327382
申请日:2021-05-21
Applicant: Adobe Inc.
Inventor: Mausoom Sarkar , Arneh Jain
Abstract: Techniques for document segmentation. In an example, a document processing application segments an electronic document image into strips. A first strip overlaps a second strip. The application generates a first mask indicating one or more elements and element types in the first strip by applying a predictive model network to image content in the first strip and a prior mask generated from image content of the first strip. The application generates a second mask indicating one or more elements and element types in the second strip by applying the predictive model network to image content in the second strip and the first mask. The application computes, from a combined mask derived from the first mask and the second mask, an output electronic document that identifies elements in the electronic document and the respective element types.
-
公开(公告)号:US11042734B2
公开(公告)日:2021-06-22
申请号:US16539634
申请日:2019-08-13
Applicant: Adobe Inc.
Inventor: Mausoom Sarkar , Arneh Jain
Abstract: Techniques for document segmentation. In an example, a document processing application segments an electronic document image into strips. A first strip overlaps a second strip. The application generates a first mask indicating one or more elements and element types in the first strip by applying a predictive model network to image content in the first strip and a prior mask generated from image content of the first strip. The application generates a second mask indicating one or more elements and element types in the second strip by applying the predictive model network to image content in the second strip and the first mask. The application computes, from a combined mask derived from the first mask and the second mask, an output electronic document that identifies elements in the electronic document and the respective element types.
-
公开(公告)号:US20200372560A1
公开(公告)日:2020-11-26
申请号:US16417373
申请日:2019-05-20
Applicant: ADOBE INC.
Inventor: Jonas Dahl , Mausoom Sarkar , Hiresh Gupta , Balaji Krishnamurthy , Ayush Chopra , Abhishek Sinha
Abstract: A search system provides search results with images of products based on associations of primary products and secondary products from product image sets. The search system analyzes a product image set containing multiple images to determine a primary product and secondary products. Information associating the primary and secondary products are stored in a search index. When the search system receives a query image containing a search product, the search index is queried using the search product to identify search result images based on associations of products in the search index, and the result images are provided as a response to the query image.
-
公开(公告)号:US10831818B2
公开(公告)日:2020-11-10
申请号:US16177243
申请日:2018-10-31
Applicant: Adobe Inc.
Inventor: Mausoom Sarkar , Hiresh Gupta , Abhishek Sinha
IPC: G06K9/54 , G06F16/58 , G06N3/08 , G06F16/56 , G06F16/583
Abstract: Digital image search training techniques and machine-learning architectures are described. In one example, a query digital image is received by service provider system, which is then used to select at least one positive sample digital image, e.g., having a same product ID. A plurality of negative sample digital images is also selected by the service provider system based on the query digital image, e.g., having different product IDs. The at least one positive sample digital image and the plurality of negative samples are then aggregated by the service provider system into a single aggregated digital image. At least one neural network is then trained by the service provider system using a loss function based on a feature comparison between the query digital image and samples from the aggregated digital image in a single pass.
-
公开(公告)号:US10354290B2
公开(公告)日:2019-07-16
申请号:US14741111
申请日:2015-06-16
Applicant: ADOBE INC.
Inventor: Vikas Yadav , Balaji Krishnamurthy , Mausoom Sarkar , Rajiv Mangla , Gitesh Malik
IPC: H04N7/10 , G06Q30/02 , H04N5/76 , G11B27/034 , G06K9/00 , H04N5/93 , G06T7/11 , G06K9/62 , H04N21/254 , H04N21/442
Abstract: Embodiments of the present invention provide systems and methods for automatically generating a shoppable video. A video is parsed into one or more scenes. Products and their corresponding product information are automatically associated with the one or more scenes. The shoppable video is then generated using the associated products and corresponding product information such that the products are visible in the shoppable video based on a scene in which the products are found.
-
公开(公告)号:US20250005824A1
公开(公告)日:2025-01-02
申请号:US18341982
申请日:2023-06-27
Applicant: ADOBE INC.
Inventor: Rishabh Jain , Mayur Hemani , Duygu Ceylan Aksit , Krishna Kumar Singh , Jingwan Lu , Mausoom Sarkar , Balaji Krishnamurthy
Abstract: Systems and methods for image processing are described. One aspect of the systems and methods includes receiving a plurality of images comprising a first image depicting a first body part and a second image depicting a second body part and encoding, using a texture encoder, the first image and the second image to obtain a first texture embedding and a second texture embedding, respectively. Then, a composite image is generated using a generative decoder, the composite image depicting the first body part and the second body part based on the first texture embedding and the second texture embedding.
-
公开(公告)号:US20250005812A1
公开(公告)日:2025-01-02
申请号:US18215484
申请日:2023-06-28
Applicant: Adobe Inc.
Inventor: Rishabh Jain , Mayur Hemani , Mausoom Sarkar , Krishna Kumar Singh , Jingwan Lu , Duygu Ceylan Aksit , Balaji Krishnamurthy
Abstract: In implementations of systems for human reposing based on multiple input views, a computing device implements a reposing system to receive input data describing: input digital images; pluralities of keypoints corresponding to the input digital images, the pluralities of keypoints representing poses of a person depicted in the input digital images; and a plurality of keypoints representing a target pose. The reposing system generates selection masks corresponding to the input digital images by processing the input data using a machine learning model. The selection masks represent likelihoods of spatial correspondence between pixels of an output digital image and portions of the input digital images. The reposing system generates the output digital image depicting the person in the target pose for display in a user interface based on the selection masks and the input data.
-
-
-
-
-
-
-
-
-