Patent search ap:("Google LLC") AND inv:"Chun-Ta Lu" Page 1

1.

发明授权
Multimodal image classifier using textual and visual embeddings 有权

公开(公告)号：US11907337B2

公开(公告)日：2024-02-20

申请号：US17046313

申请日：2019-11-18

Applicant: Google LLC

Inventor： Ariel Fuxman , Aleksei Timofeev , Zhen Li , Chun-Ta Lu , Manan Shah , Chen Sun , Krishnamurthy Viswanathan , Chao Jia

IPC: G06K9/62 , G06K9/46 , G06F18/24 , G06F18/214 , G06F18/2413

CPC classification number: G06F18/24 , G06F18/214 , G06F18/24147

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for realizing a multimodal image classifier. In an aspect, a method includes, for each image of a plurality of images: processing the image by a textual generator model to obtain a set of phrases that are descriptive of the content of the image, wherein each phrase is one or more terms, processing the set of phrases by a textual embedding model to obtain an embedding of predicted text for the image, and processing the image using an image embedding model to obtain an embedding of image pixels of the image. Then a multimodal image classifier is trained on the embeddings of predicted text for the images and the embeddings of image pixels for the images to produce, as output, labels of an output taxonomy to classify an image based on the image as input.

2.

发明公开
Training Image and Text Embedding Models 审中-公开

公开(公告)号：US20240330361A1

公开(公告)日：2024-10-03

申请号：US18741082

申请日：2024-06-12

Applicant: Google LLC

Inventor： Zhen Li , Yi-Ting Chen , Yaxi Gao , Da-Cheng Juan , Aleksei Timofeev , Chun-Ta Lu , Futang Peng , Sujith Ravi , Andrew Tomkins , Thomas J. Duerig

IPC: G06F16/55 , G06F16/538 , G06F16/9538 , G06F18/214 , G06F18/22 , G06F18/40 , G06N3/042 , G06N3/044 , G06N3/084

CPC classification number: G06F16/55 , G06F16/538 , G06F16/9538 , G06F18/2148 , G06F18/22 , G06F18/41 , G06N3/042 , G06N3/044 , G06N3/084

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an image embedding model. In one aspect, a method comprises: obtaining training data comprising a plurality of training examples, wherein each training example comprises: an image pair comprising a first image and a second image; and selection data indicating one or more of: (i) a co-click rate of the image pair, and (ii) a similar-image click rate of the image pair; and using the training data to train an image embedding model having a plurality of image embedding model parameters.

3.

发明公开
Training Image and Text Embedding Models 审中-公开

公开(公告)号：US20230205813A1

公开(公告)日：2023-06-29

申请号：US18171511

申请日：2023-02-20

Applicant: Google LLC

Inventor： Zhen Li , Yi-Ting Chen , Yaxi Gao , Da-Cheng Juan , Aleksei Timofeev , Chun-Ta Lu , Futang Peng , Sujith Ravi , Andrew Tomkins , Thomas J. Duerig

IPC: G06F16/55 , G06F16/538 , G06F16/9538 , G06N3/084 , G06F18/22 , G06F18/40 , G06F18/214 , G06N3/042 , G06N3/044

CPC classification number: G06F16/55 , G06F16/538 , G06F16/9538 , G06N3/084 , G06F18/22 , G06F18/41 , G06F18/2148 , G06N3/042 , G06N3/044

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an image embedding model. In one aspect, a method comprises: obtaining training data comprising a plurality of training examples, wherein each training example comprises: an image pair comprising a first image and a second image; and selection data indicating one or more of: (i) a co-click rate of the image pair, and (ii) a similar-image click rate of the image pair; and using the training data to train an image embedding model having a plurality of image embedding model parameters.

4.

发明授权
Training image and text embedding models 有权

公开(公告)号：US12038970B2

公开(公告)日：2024-07-16

申请号：US18171511

申请日：2023-02-20

Applicant: Google LLC

Inventor： Zhen Li , Yi-Ting Chen , Yaxi Gao , Da-Cheng Juan , Aleksei Timofeev , Chun-Ta Lu , Futang Peng , Sujith Ravi , Andrew Tomkins , Thomas J. Duerig

IPC: G06F16/00 , G06F16/538 , G06F16/55 , G06F16/9538 , G06F18/214 , G06F18/22 , G06F18/40 , G06N3/042 , G06N3/044 , G06N3/084

CPC classification number: G06F16/55 , G06F16/538 , G06F16/9538 , G06F18/2148 , G06F18/22 , G06F18/41 , G06N3/042 , G06N3/044 , G06N3/084

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an image embedding model. In one aspect, a method comprises: obtaining training data comprising a plurality of training examples, wherein each training example comprises: an image pair comprising a first image and a second image; and selection data indicating one or more of: (i) a co-click rate of the image pair, and (ii) a similar-image click rate of the image pair; and using the training data to train an image embedding model having a plurality of image embedding model parameters.

5.

发明申请
Multimodal Image Classifier using Textual and Visual Embeddings 有权

公开(公告)号：US20210264203A1

公开(公告)日：2021-08-26

申请号：US17046313

申请日：2019-11-18

Applicant: Google LLC

Inventor： Ariel Fuxman , Aleksei Timofeev , Zhen Li , Chun-Ta Lu , Manan Shah , Chen Sun , Krishnamurthy Viswanathan , Chao Jia

IPC: G06K9/62 , G06K9/46

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for realizing a multimodal image classifier. In an aspect, a method includes, for each image of a plurality of images: processing the image by a textual generator model to obtain a set of phrases that are descriptive of the content of the image, wherein each phrase is one or more terms, processing the set of phrases by a textual embedding model to obtain an embedding of predicted text for the image, and processing the image using an image embedding model to obtain an embedding of image pixels of the image. Then a multimodal image classifier is trained on the embeddings of predicted text for the images and the embeddings of image pixels for the images to produce, as output, labels of an output taxonomy to classify an image based on the image as input.

6.

发明申请
Preventing The Distribution Of Forbidden Network Content With Robustified Detection 有权

公开(公告)号：US20250053865A1

公开(公告)日：2025-02-13

申请号：US18561104

申请日：2022-12-14

Applicant: Google LLC

Inventor： Wei Qiao , Chun-Ta Lu , Yinatao Liu , Ariel Fuxman , Mehmet Nejat Tek , Dongjin Kwon , Florian Nils Stimberg

IPC: G06N20/00 , G06V20/70

Abstract: The technology is generally directed to the training and execution of a model to identify policy violating content that has been obfuscated. The model may be trained using obfuscated training images. The obfuscated training images may be associated with one or more labels, such as a policy, obfuscation label, etc. The obfuscated training images and associated labels may be input into the model. During training, the output of the model may be a policy prediction as to whether the obfuscated input images violate the content policy of a host or are approved content for publishing. During implementation, the model may receive content as input and provide as output a policy prediction for the content. The host may use the policy prediction provided by the model to determine whether or not to publish the content.

7.

发明申请
Training a Classifier Model Relating to a Concept and Image Ratings Provided by a User 有权

公开(公告)号：US20240428573A1

公开(公告)日：2024-12-26

申请号：US18341218

申请日：2023-06-26

Applicant: Google LLC

Inventor： Ariel Fuxman , Alexander Kenji Hata , Edward Benjamin Vendrow , Otilia Stretcu , Wenlei Zhou , Krishnamurthy Viswanathan , Aditya Avinash , Gabriel Berger , Andrew Ames Bunner , Javier Alejandro Rey , Wei Qiao , Yintao Liu , Guanzhong Wang , Thomas Nathan Denby , Mehmet Nejat Tek , Neil Gordon Alldrin , Enming Luo , Chun-Ta Lu

IPC: G06V10/778 , G06V10/764 , G06V10/774 , G06V10/82 , G06V10/94

Abstract: A computer-implemented method includes receiving an input from a user relating to a concept, automatically obtaining a first set of images from an unlabeled dataset of images based on the input, and obtaining a first rating via the user for each image from the first set of images. The method further includes training a classifier model relating to the concept based on the first set of images rated by the user, automatically obtaining a second set of images from the unlabeled dataset of images based on the classifier model trained based on the first set of images, and obtaining a second rating via the user for each image from the second set of images. The classifier model relating to the concept is retrained based on the first set of images rated by the user and the second set of images rated by the user to obtain an updated classifier model.

8.

发明公开
Multimodal Image Classifier using Textual and Visual Embeddings 审中-公开

公开(公告)号：US20240143700A1

公开(公告)日：2024-05-02

申请号：US18409411

申请日：2024-01-10

Applicant: Google LLC

Inventor： Ariel Fuxman , Aleksei Timofeev , Zhen Li , Chun-Ta Lu , Manan Shah , Chen Sun , Krishnamurthy Viswanathan , Chao Jia

IPC: G06F18/24 , G06F18/214 , G06F18/2413

CPC classification number: G06F18/24 , G06F18/214 , G06F18/24147

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for realizing a multimodal image classifier. In an aspect, a method includes, for each image of a plurality of images: processing the image by a textual generator model to obtain a set of phrases that are descriptive of the content of the image, wherein each phrase is one or more terms, processing the set of phrases by a textual embedding model to obtain an embedding of predicted text for the image, and processing the image using an image embedding model to obtain an embedding of image pixels of the image. Then a multimodal image classifier is trained on the embeddings of predicted text for the images and the embeddings of image pixels for the images to produce, as output, labels of an output taxonomy to classify an image based on the image as input.

9.

发明授权
Training image and text embedding models 有权

公开(公告)号：US11586927B2

公开(公告)日：2023-02-21

申请号：US16265793

申请日：2019-02-01

Applicant: Google LLC

Inventor： Zhen Li , Yi-ting Chen , Yaxi Gao , Da-Cheng Juan , Aleksei Timofeev , Chun-Ta Lu , Futang Peng , Sujith Ravi , Andrew Tomkins , Thomas J. Duerig

IPC: G06K9/00 , G06N3/084 , G06F16/538 , G06F16/9538 , G06K9/62 , G06N3/04

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an image embedding model. In one aspect, a method comprises: obtaining training data comprising a plurality of training examples, wherein each training example comprises: an image pair comprising a first image and a second image; and selection data indicating one or more of: (i) a co-click rate of the image pair, and (ii) a similar-image click rate of the image pair; and using the training data to train an image embedding model having a plurality of image embedding model parameters.

10.

发明申请
TRAINING IMAGE AND TEXT EMBEDDING MODELS 审中-公开

公开(公告)号：US20200250537A1

公开(公告)日：2020-08-06

申请号：US16265793

申请日：2019-02-01

Applicant: Google LLC

Inventor： Zhen Li , Yi-ting Chen , Yaxi Gao , Da-Cheng Juan , Aleksei Timofeev , Chun-Ta Lu , Futang Peng , Sujith Ravi , Andrew Tomkins , Thomas J. Duerig

IPC: G06N3/08 , G06K9/62 , G06F16/9538 , G06F16/538 , G06N3/04

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an image embedding model. In one aspect, a method comprises: obtaining training data comprising a plurality of training examples, wherein each training example comprises: an image pair comprising a first image and a second image; and selection data indicating one or more of: (i) a co-click rate of the image pair, and (ii) a similar-image click rate of the image pair; and using the training data to train an image embedding model having a plurality of image embedding model parameters.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification