Patent search ap:("ADOBE INC.") AND inv:"Balaji Krishnamurthy" Page 11

101.

发明公开
SYSTEMS AND METHODS FOR DATA AUGMENTATION 审中-公开

公开(公告)号：US20240119122A1

公开(公告)日：2024-04-11

申请号：US18045542

申请日：2022-10-11

Applicant: ADOBE INC.

Inventor： Shripad Vilasrao Deshmukh , Surgan Jandial , Abhinav Java , Milan Aggarwal , Mausoom Sarkar , Arneh Jain , Balaji Krishnamurthy

IPC: G06K9/62

CPC classification number: G06K9/6269 , G06K9/6259 , G06K9/6285

Abstract: Systems and methods for data augmentation are provided. One aspect of the systems and methods include receiving an image that is misclassified by a classification network; computing an augmentation image based on the image using an augmentation network; and generating an augmented image by combining the image and the augmentation image, wherein the augmented image is correctly classified by the classification network.

102.

发明授权
Generating images for virtual try-on and pose transfer 有权

公开(公告)号：US11861772B2

公开(公告)日：2024-01-02

申请号：US17678237

申请日：2022-02-23

Applicant: Adobe Inc.

Inventor： Ayush Chopra , Rishabh Jain , Mayur Hemani , Balaji Krishnamurthy

IPC: G06T11/60 , G06N3/045 , G06T7/11 , G06T7/70 , G06N3/044 , G06N3/08

CPC classification number: G06T11/60 , G06N3/045 , G06T7/11 , G06T7/70

Abstract: In implementations of systems for generating images for virtual try-on and pose transfer, a computing device implements a generator system to receive input data describing a first digital image that depicts a person in a pose and a second digital image that depicts a garment. Candidate appearance flow maps are computed that warp the garment based on the pose at different pixel-block sizes using a first machine learning model. The generator system generates a warped garment image by combining the candidate appearance flow maps as an aggregate per-pixel displacement map using a convolutional gated recurrent network. A conditional segment mask is predicted that segments portions of a geometry of the person using a second machine learning model. The generator system outputs a digital image that depicts the person in the pose wearing the garment based on the warped garment image and the conditional segmentation mask using a third machine learning model.

103.

发明授权
Model training with retrospective loss 有权

公开(公告)号：US11797823B2

公开(公告)日：2023-10-24

申请号：US16793551

申请日：2020-02-18

Applicant: Adobe Inc.

Inventor： Ayush Chopra , Balaji Krishnamurthy , Mausoom Sarkar , Surgan Jandial

IPC: G06N3/04 , G06N3/084 , G06F18/214 , G06N3/047 , G06V10/764 , G06V10/82

CPC classification number: G06N3/04 , G06F18/214 , G06N3/047 , G06N3/084 , G06V10/764 , G06V10/82

Abstract: Generating a machine learning model that is trained using retrospective loss is described. A retrospective loss system receives an untrained machine learning model and a task for training the model. The retrospective loss system initially trains the model over warm-up iterations using task-specific loss that is determined based on a difference between predictions output by the model during training on input data and a ground truth dataset for the input data. Following the warm-up training iterations, the retrospective loss system continues to train the model using retrospective loss, which is model-agnostic and constrains the model such that a subsequently output prediction is more similar to the ground truth dataset than the previously output prediction. After determining that the model's outputs are within a threshold similarity to the ground truth dataset, the model is output with its current parameters as a trained model.

104.

发明公开
FORM STRUCTURE EXTRACTION BY PREDICTING ASSOCIATIONS 审中-公开

公开(公告)号：US20230267345A1

公开(公告)日：2023-08-24

申请号：US18135948

申请日：2023-04-18

Applicant: Adobe Inc.

Inventor： Milan Aggarwal , Mausoom Sarkar , Balaji Krishnamurthy

IPC: G06N5/04 , G06N20/00 , G06N3/08 , G06V10/82 , G06N20/10

CPC classification number: G06N5/04 , G06N3/08 , G06N20/00 , G06N20/10 , G06V10/82

Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.

105.

发明授权
Identifying digital attributes from multiple attribute groups utilizing a deep cognitive attribution neural network 有权

公开(公告)号：US11734337B2

公开(公告)日：2023-08-22

申请号：US17806922

申请日：2022-06-14

Applicant: Adobe Inc.

Inventor： Ayush Chopra , Mausoom Sarkar , Jonas Dahl , Hiresh Gupta , Balaji Krishnamurthy , Abhishek Sinha

IPC: G06F16/532 , G06F16/535 , G06F17/15 , G06N3/04 , G06F16/55 , G06F18/22 , G06F18/24 , G06V10/764 , G06V10/82 , G06V10/44

CPC classification number: G06F16/532 , G06F16/535 , G06F16/55 , G06F17/15 , G06F18/22 , G06F18/24 , G06N3/04 , G06V10/454 , G06V10/764 , G06V10/82

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media for generating tags for an object portrayed in a digital image based on predicted attributes of the object. For example, the disclosed systems can utilize interleaved neural network layers of alternating inception layers and dilated convolution layers to generate a localization feature vector. Based on the localization feature vector, the disclosed systems can generate attribute localization feature embeddings, for example, using some pooling layer such as a global average pooling layer. The disclosed systems can then apply the attribute localization feature embeddings to corresponding attribute group classifiers to generate tags based on predicted attributes. In particular, attribute group classifiers can predict attributes as associated with a query image (e.g., based on a scoring comparison with other potential attributes of an attribute group). Based on the generated tags, the disclosed systems can respond to tag queries and search queries.

106.

发明公开
VISUAL SPEECH RECOGNITION FOR DIGITAL VIDEOS UTILIZING GENERATIVE ADVERSARIAL LEARNING 审中-公开

公开(公告)号：US20230252993A1

公开(公告)日：2023-08-10

申请号：US17650020

申请日：2022-02-04

Applicant: Adobe Inc.

Inventor： Yaman Kumar , Balaji Krishnamurthy

IPC: G10L15/25 , G06T9/00 , G06K9/62 , G06V20/40 , G06V10/82 , G10L15/16 , G10L13/02 , G10L15/22 , G10L25/57 , G06N3/02

CPC classification number: G10L15/25 , G06T9/002 , G06K9/6223 , G06V20/49 , G06V10/82 , G10L15/16 , G10L13/02 , G10L15/22 , G10L25/57 , G06N3/02

Abstract: This disclosure describes one or more implementations of systems, non-transitory computer-readable media, and methods that recognize speech from a digital video utilizing an unsupervised machine learning model, such as a generative adversarial neural network (GAN) model. In one or more implementations, the disclosed systems utilize an image encoder to generate self-supervised deep visual speech representations from frames of an unlabeled (or unannotated) digital video. Subsequently, in one or more embodiments, the disclosed systems generate viseme sequences from the deep visual speech representations (e.g., via segmented visemic speech representations from clusters of the deep visual speech representations) utilizing the adversarially trained GAN model. Indeed, in some instances, the disclosed systems decode the viseme sequences belonging to the digital video to generate an electronic transcription and/or digital audio for the digital video.

107.

发明授权
Form structure extraction by predicting associations 有权

公开(公告)号：US11657306B2

公开(公告)日：2023-05-23

申请号：US16904263

申请日：2020-06-17

Applicant: Adobe Inc.

Inventor： Milan Aggarwal , Mausoom Sarkar , Balaji Krishnamurthy

IPC: G06N5/04 , G06N20/00 , G06N3/08 , G06V10/82 , G06N20/10

CPC classification number: G06N5/04 , G06N3/08 , G06N20/00 , G06N20/10 , G06V10/82

Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.

108.

发明授权
Generating combined feature embedding for minority class upsampling in training machine learning models with imbalanced samples 有权

公开(公告)号：US11631029B2

公开(公告)日：2023-04-18

申请号：US16564531

申请日：2019-09-09

Applicant: Adobe, Inc.

Inventor： Nikaash Puri , Balaji Krishnamurthy , Ayush Chopra

IPC: G06N20/00 , G06N5/04

Abstract: Systems, methods, and non-transitory computer-readable media are disclosed for generating combined feature embeddings for minority class upsampling in training machine learning models with imbalanced training samples. For example, the disclosed systems can select training sample values from a set of training samples and a combination ratio value from a continuous probability distribution. Additionally, the disclosed systems can generate a combined synthetic training sample value by modifying the selected training sample values using the combination ratio value and combining the modified training sample values. Moreover, the disclosed systems can generate a combined synthetic ground truth label based on the combination ratio value. In addition, the disclosed systems can utilize the combined synthetic training sample value and the combined synthetic ground truth label to generate a combined synthetic training sample and utilize the combined synthetic training sample to train a machine learning model.

109.

发明申请
COMPRESSING DIGITAL IMAGES UTILIZING DEEP PERCEPTUAL SIMILARITY 有权

公开(公告)号：US20220198717A1

公开(公告)日：2022-06-23

申请号：US17654529

申请日：2022-03-11

Applicant: Adobe Inc.

Inventor： Meet Patel , Mayur Hemani , Karanjeet Singh , Amit Gupta , Apoorva Gupta , Balaji Krishnamurthy

IPC: G06T9/00 , G06N3/08 , G06T7/00

Abstract: Methods, systems, and non-transitory computer readable storage media are disclosed for utilizing deep learning to intelligently determine compression settings for compressing a digital image. For instance, the disclosed system utilizes a neural network to generate predicted perceptual quality values for compression settings on a compression quality scale. The disclosed system fits the predicted compression distortions to a perceptual distortion characteristic curve for interpolating predicted perceptual quality values across the compression settings on the compression quality scale. Additionally, the disclosed system then performs a search over the predicted perceptual quality values for the compression settings along the compression quality scale to select a compression setting based on a perceptual quality threshold. The disclosed system generates a compressed digital image according to compression parameters for the selected compression setting.

110.

发明申请
Image Generation and Editing with Latent Transformation Detection 有权

公开(公告)号：US20220138897A1

公开(公告)日：2022-05-05

申请号：US17088120

申请日：2020-11-03

Applicant: ADOBE INC.

Inventor： Mayank Singh , Parth Patel , Nupur Kumari , Balaji Krishnamurthy

IPC: G06T3/00 , G06K9/62 , G06N3/04 , G06N3/08 , G06T7/73 , G06T3/40

Abstract: This disclosure includes technologies for image processing, particularly for image generation and editing in a configurable semantic direction. A generative adversarial network is trained with an auxiliary network with an auxiliary task that is designed to disentangle the latent space of the generative adversarial network. Resultantly, a new type of GAN is created to improve image generation or editing in both conditional and unconditional settings.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification