-
公开(公告)号:US20240078258A1
公开(公告)日:2024-03-07
申请号:US18505776
申请日:2023-11-09
Applicant: Google LLC
Inventor: Zhen Li , Yi-ting Chen , Ning Ye , Yaxi Gao , Zijian Guo , Aleksei Timofeev , Futang Peng , Thomas J. Duerig
IPC: G06F16/55 , G06F16/242 , G06F16/953 , G06F18/22 , G06N3/044 , G06N3/084 , G06N20/00
CPC classification number: G06F16/55 , G06F16/2425 , G06F16/953 , G06F18/22 , G06N3/044 , G06N3/084 , G06N20/00
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for jointly training an image embedding model and a text embedding model. In one aspect, a method comprises: processing data from a historical query log of a search system to generate a candidate set of training examples, wherein each training example comprises: (i) a search query comprising a sequence of one or more words, (ii) an image, and (iii) selection data characterizing how often users selected the image in response to the image being identified by a search result for the search query; selecting a plurality of training examples from the candidate set of training examples; and using the training data to jointly train the image embedding model and the text embedding model.
-
公开(公告)号:US11907337B2
公开(公告)日:2024-02-20
申请号:US17046313
申请日:2019-11-18
Applicant: Google LLC
Inventor: Ariel Fuxman , Aleksei Timofeev , Zhen Li , Chun-Ta Lu , Manan Shah , Chen Sun , Krishnamurthy Viswanathan , Chao Jia
IPC: G06K9/62 , G06K9/46 , G06F18/24 , G06F18/214 , G06F18/2413
CPC classification number: G06F18/24 , G06F18/214 , G06F18/24147
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for realizing a multimodal image classifier. In an aspect, a method includes, for each image of a plurality of images: processing the image by a textual generator model to obtain a set of phrases that are descriptive of the content of the image, wherein each phrase is one or more terms, processing the set of phrases by a textual embedding model to obtain an embedding of predicted text for the image, and processing the image using an image embedding model to obtain an embedding of image pixels of the image. Then a multimodal image classifier is trained on the embeddings of predicted text for the images and the embeddings of image pixels for the images to produce, as output, labels of an output taxonomy to classify an image based on the image as input.
-
公开(公告)号:US20240330361A1
公开(公告)日:2024-10-03
申请号:US18741082
申请日:2024-06-12
Applicant: Google LLC
Inventor: Zhen Li , Yi-Ting Chen , Yaxi Gao , Da-Cheng Juan , Aleksei Timofeev , Chun-Ta Lu , Futang Peng , Sujith Ravi , Andrew Tomkins , Thomas J. Duerig
IPC: G06F16/55 , G06F16/538 , G06F16/9538 , G06F18/214 , G06F18/22 , G06F18/40 , G06N3/042 , G06N3/044 , G06N3/084
CPC classification number: G06F16/55 , G06F16/538 , G06F16/9538 , G06F18/2148 , G06F18/22 , G06F18/41 , G06N3/042 , G06N3/044 , G06N3/084
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an image embedding model. In one aspect, a method comprises: obtaining training data comprising a plurality of training examples, wherein each training example comprises: an image pair comprising a first image and a second image; and selection data indicating one or more of: (i) a co-click rate of the image pair, and (ii) a similar-image click rate of the image pair; and using the training data to train an image embedding model having a plurality of image embedding model parameters.
-
公开(公告)号:US20230205813A1
公开(公告)日:2023-06-29
申请号:US18171511
申请日:2023-02-20
Applicant: Google LLC
Inventor: Zhen Li , Yi-Ting Chen , Yaxi Gao , Da-Cheng Juan , Aleksei Timofeev , Chun-Ta Lu , Futang Peng , Sujith Ravi , Andrew Tomkins , Thomas J. Duerig
IPC: G06F16/55 , G06F16/538 , G06F16/9538 , G06N3/084 , G06F18/22 , G06F18/40 , G06F18/214 , G06N3/042 , G06N3/044
CPC classification number: G06F16/55 , G06F16/538 , G06F16/9538 , G06N3/084 , G06F18/22 , G06F18/41 , G06F18/2148 , G06N3/042 , G06N3/044
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an image embedding model. In one aspect, a method comprises: obtaining training data comprising a plurality of training examples, wherein each training example comprises: an image pair comprising a first image and a second image; and selection data indicating one or more of: (i) a co-click rate of the image pair, and (ii) a similar-image click rate of the image pair; and using the training data to train an image embedding model having a plurality of image embedding model parameters.
-
公开(公告)号:US20240143700A1
公开(公告)日:2024-05-02
申请号:US18409411
申请日:2024-01-10
Applicant: Google LLC
Inventor: Ariel Fuxman , Aleksei Timofeev , Zhen Li , Chun-Ta Lu , Manan Shah , Chen Sun , Krishnamurthy Viswanathan , Chao Jia
IPC: G06F18/24 , G06F18/214 , G06F18/2413
CPC classification number: G06F18/24 , G06F18/214 , G06F18/24147
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for realizing a multimodal image classifier. In an aspect, a method includes, for each image of a plurality of images: processing the image by a textual generator model to obtain a set of phrases that are descriptive of the content of the image, wherein each phrase is one or more terms, processing the set of phrases by a textual embedding model to obtain an embedding of predicted text for the image, and processing the image using an image embedding model to obtain an embedding of image pixels of the image. Then a multimodal image classifier is trained on the embeddings of predicted text for the images and the embeddings of image pixels for the images to produce, as output, labels of an output taxonomy to classify an image based on the image as input.
-
公开(公告)号:US11586927B2
公开(公告)日:2023-02-21
申请号:US16265793
申请日:2019-02-01
Applicant: Google LLC
Inventor: Zhen Li , Yi-ting Chen , Yaxi Gao , Da-Cheng Juan , Aleksei Timofeev , Chun-Ta Lu , Futang Peng , Sujith Ravi , Andrew Tomkins , Thomas J. Duerig
IPC: G06K9/00 , G06N3/084 , G06F16/538 , G06F16/9538 , G06K9/62 , G06N3/04
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an image embedding model. In one aspect, a method comprises: obtaining training data comprising a plurality of training examples, wherein each training example comprises: an image pair comprising a first image and a second image; and selection data indicating one or more of: (i) a co-click rate of the image pair, and (ii) a similar-image click rate of the image pair; and using the training data to train an image embedding model having a plurality of image embedding model parameters.
-
公开(公告)号:US20200250537A1
公开(公告)日:2020-08-06
申请号:US16265793
申请日:2019-02-01
Applicant: Google LLC
Inventor: Zhen Li , Yi-ting Chen , Yaxi Gao , Da-Cheng Juan , Aleksei Timofeev , Chun-Ta Lu , Futang Peng , Sujith Ravi , Andrew Tomkins , Thomas J. Duerig
IPC: G06N3/08 , G06K9/62 , G06F16/9538 , G06F16/538 , G06N3/04
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an image embedding model. In one aspect, a method comprises: obtaining training data comprising a plurality of training examples, wherein each training example comprises: an image pair comprising a first image and a second image; and selection data indicating one or more of: (i) a co-click rate of the image pair, and (ii) a similar-image click rate of the image pair; and using the training data to train an image embedding model having a plurality of image embedding model parameters.
-
公开(公告)号:US12038970B2
公开(公告)日:2024-07-16
申请号:US18171511
申请日:2023-02-20
Applicant: Google LLC
Inventor: Zhen Li , Yi-Ting Chen , Yaxi Gao , Da-Cheng Juan , Aleksei Timofeev , Chun-Ta Lu , Futang Peng , Sujith Ravi , Andrew Tomkins , Thomas J. Duerig
IPC: G06F16/00 , G06F16/538 , G06F16/55 , G06F16/9538 , G06F18/214 , G06F18/22 , G06F18/40 , G06N3/042 , G06N3/044 , G06N3/084
CPC classification number: G06F16/55 , G06F16/538 , G06F16/9538 , G06F18/2148 , G06F18/22 , G06F18/41 , G06N3/042 , G06N3/044 , G06N3/084
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training an image embedding model. In one aspect, a method comprises: obtaining training data comprising a plurality of training examples, wherein each training example comprises: an image pair comprising a first image and a second image; and selection data indicating one or more of: (i) a co-click rate of the image pair, and (ii) a similar-image click rate of the image pair; and using the training data to train an image embedding model having a plurality of image embedding model parameters.
-
公开(公告)号:US20210264203A1
公开(公告)日:2021-08-26
申请号:US17046313
申请日:2019-11-18
Applicant: Google LLC
Inventor: Ariel Fuxman , Aleksei Timofeev , Zhen Li , Chun-Ta Lu , Manan Shah , Chen Sun , Krishnamurthy Viswanathan , Chao Jia
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for realizing a multimodal image classifier. In an aspect, a method includes, for each image of a plurality of images: processing the image by a textual generator model to obtain a set of phrases that are descriptive of the content of the image, wherein each phrase is one or more terms, processing the set of phrases by a textual embedding model to obtain an embedding of predicted text for the image, and processing the image using an image embedding model to obtain an embedding of image pixels of the image. Then a multimodal image classifier is trained on the embeddings of predicted text for the images and the embeddings of image pixels for the images to produce, as output, labels of an output taxonomy to classify an image based on the image as input.
-
公开(公告)号:US20200250538A1
公开(公告)日:2020-08-06
申请号:US16265811
申请日:2019-02-01
Applicant: Google LLC
Inventor: Zhen Li , Yi-ting Chen , Ning Ye , Yaxi Gao , Zijian Guo , Aleksei Timofeev , Futang Peng , Thomas J. Duerig
IPC: G06N3/08 , G06K9/62 , G06F16/953 , G06F16/242 , G06N20/00 , G06N3/04
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for jointly training an image embedding model and a text embedding model. In one aspect, a method comprises: processing data from a historical query log of a search system to generate a candidate set of training examples, wherein each training example comprises: (i) a search query comprising a sequence of one or more words, (ii) an image, and (iii) selection data characterizing how often users selected the image in response to the image being identified by a search result for the search query; selecting a plurality of training examples from the candidate set of training examples; and using the training data to jointly train the image embedding model and the text embedding model.
-
-
-
-
-
-
-
-
-