Patent search ap:("Adobe Inc.") AND inv:"Nham Le" Page 1

1.

发明申请
GENERATING MODIFIED DIGITAL IMAGES UTILIZING A DISPERSED MULTIMODAL SELECTION MODEL 有权

公开(公告)号：US20210004576A1

公开(公告)日：2021-01-07

申请号：US17025477

申请日：2020-09-18

Applicant: Adobe Inc.

Inventor： Trung Bui , Zhe Lin , Walter Chang , Nham Le , Franck Dernoncourt

IPC: G06K9/00 , G06N3/04 , G10L15/26 , G10L15/25

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating modified digital images based on verbal and/or gesture input by utilizing a natural language processing neural network and one or more computer vision neural networks. The disclosed systems can receive verbal input together with gesture input. The disclosed systems can further utilize a natural language processing neural network to generate a verbal command based on verbal input. The disclosed systems can select a particular computer vision neural network based on the verbal input and/or the gesture input. The disclosed systems can apply the selected computer vision neural network to identify pixels within a digital image that correspond to an object indicated by the verbal input and/or gesture input. Utilizing the identified pixels, the disclosed systems can generate a modified digital image by performing one or more editing actions indicated by the verbal input and/or gesture input.

2.

发明申请
GENERATING MODIFIED DIGITAL IMAGES UTILIZING A MULTIMODAL SELECTION MODEL BASED ON VERBAL AND GESTURE INPUT 审中-公开

公开(公告)号：US20200160042A1

公开(公告)日：2020-05-21

申请号：US16192573

申请日：2018-11-15

Applicant: Adobe Inc.

Inventor： Trung Bui , Zhe Lin , Walter Chang , Nham Le , Franck Dernoncourt

IPC: G06K9/00 , G10L15/25 , G10L15/26 , G06N3/04

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating modified digital images based on verbal and/or gesture input by utilizing a natural language processing neural network and one or more computer vision neural networks. The disclosed systems can receive verbal input together with gesture input. The disclosed systems can further utilize a natural language processing neural network to generate a verbal command based on verbal input. The disclosed systems can select a particular computer vision neural network based on the verbal input and/or the gesture input. The disclosed systems can apply the selected computer vision neural network to identify pixels within a digital image that correspond to an object indicated by the verbal input and/or gesture input. Utilizing the identified pixels, the disclosed systems can generate a modified digital image by performing one or more editing actions indicated by the verbal input and/or gesture input.

3.

发明授权
Generating modified digital images utilizing a dispersed multimodal selection model 有权

公开(公告)号：US11594077B2

公开(公告)日：2023-02-28

申请号：US17025477

申请日：2020-09-18

Applicant: Adobe Inc.

Inventor： Trung Bui , Zhe Lin , Walter Chang , Nham Le , Franck Dernoncourt

IPC: G06V40/20 , G06N3/04 , G10L15/26 , G10L15/25

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating modified digital images based on verbal and/or gesture input by utilizing a natural language processing neural network and one or more computer vision neural networks. The disclosed systems can receive verbal input together with gesture input. The disclosed systems can further utilize a natural language processing neural network to generate a verbal command based on verbal input. The disclosed systems can select a particular computer vision neural network based on the verbal input and/or the gesture input. The disclosed systems can apply the selected computer vision neural network to identify pixels within a digital image that correspond to an object indicated by the verbal input and/or gesture input. Utilizing the identified pixels, the disclosed systems can generate a modified digital image by performing one or more editing actions indicated by the verbal input and/or gesture input.

4.

发明授权
Generating modified digital images utilizing a multimodal selection model based on verbal and gesture input 有权

公开(公告)号：US10817713B2

公开(公告)日：2020-10-27

申请号：US16192573

申请日：2018-11-15

Applicant: Adobe Inc.

Inventor： Trung Bui , Zhe Lin , Walter Chang , Nham Le , Franck Dernoncourt

IPC: G06K9/00 , G06N3/04 , G10L15/26 , G10L15/25

Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for generating modified digital images based on verbal and/or gesture input by utilizing a natural language processing neural network and one or more computer vision neural networks. The disclosed systems can receive verbal input together with gesture input. The disclosed systems can further utilize a natural language processing neural network to generate a verbal command based on verbal input. The disclosed systems can select a particular computer vision neural network based on the verbal input and/or the gesture input. The disclosed systems can apply the selected computer vision neural network to identify pixels within a digital image that correspond to an object indicated by the verbal input and/or gesture input. Utilizing the identified pixels, the disclosed systems can generate a modified digital image by performing one or more editing actions indicated by the verbal input and/or gesture input.

Patent Agency Ranking