Patent search ap:("Google LLC") AND inv:"Yang Song" Page 1

1.

发明公开
Inference Methods For Word Or Wordpiece Tokenization 审中-公开

公开(公告)号：US20240054288A1

公开(公告)日：2024-02-15

申请号：US18205609

申请日：2023-06-05

Applicant: Google LLC

Inventor： Xinying Song , Yang Song

IPC: G06F40/284 , G06F16/31 , G06F40/40

CPC classification number: G06F40/284 , G06F16/322 , G06F40/40

Abstract: Systems and methods for performing inference for word or wordpiece tokenization are disclosed using a left-to-right longest-match-first greedy process. In some examples, the vocabulary may be organized into a trie structure in which each node includes a precomputed token or token_ID and a fail link, so that the tokenizer can parse the trie in a single pass to generate a list of only those tokens or token_IDs that correspond to the longest matching vocabulary entries in the sample string, without the need for backtracking. In some examples, the vocabulary may be organized into a trie in which each node has a fail link, and any node that would share token(s) or token_ID(s) of a preceding node is instead given a prev_match link that points back to a chain of nodes with those token(s) or token_ID(s).

2.

发明授权
Extreme language model compression with optimal sub-words and shared projections 有权

公开(公告)号：US11797862B2

公开(公告)日：2023-10-24

申请号：US16749570

申请日：2020-01-22

Applicant: Google LLC

Inventor： Yang Song , Raghav Gupta , Dengyong Zhou , Sanqiang Zhao

IPC: G06N3/088 , G06F40/284 , G06N3/045

CPC classification number: G06N3/088 , G06F40/284 , G06N3/045

Abstract: Provided is a knowledge distillation technique for training a student language model that, relative to a larger teacher language model, has a significantly smaller vocabulary, lower embedding dimensions, and/or hidden state dimensions. Specifically, aspects of the present disclosure are directed to a dual-training mechanism that trains the teacher and student language models simultaneously to obtain optimal word embeddings for the student vocabulary. In some implementations, this approach can be combined with learning shared projection matrices that transfer layer-wise knowledge from the teacher language model to the student language model. Example experimental results have also demonstrated higher compression efficiency and accuracy when compared with other state-of-the-art compression techniques, including the ability to compress the BERTBASE model by more than 60×, with only a minor drop in downstream task metrics, resulting in a language model with a footprint of under 7 MB.

3.

发明授权
Fine-grained image similarity 有权

公开(公告)号：US10949708B2

公开(公告)日：2021-03-16

申请号：US16420154

申请日：2019-05-22

Applicant: Google LLC

Inventor： Yang Song , Jiang Wang , Charles J. Rosenberg

IPC: G06K9/00 , G06K9/62 , G06N20/00 , G06F16/51 , G06F16/583 , G06N3/04 , G06N20/10 , G06K9/66

Abstract: Methods, systems, and apparatus, for determining fine-grained image similarity. In one aspect, a method includes training an image embedding function on image triplets by selecting image triplets of first, second and third images; generating, by the image embedding function, a first, second and third representations of the features of the first, second and third images; determining, based on the first representation of features and the second representation of features, a first similarity measure for the first image to the second image; determining, based on the first representation of features and the third representation of features, a second similarity measure for the the first image to the third image; determining, based on the first and second similarity measures, a performance measure of the image embedding function for the image triplet; and adjusting the parameter weights of the image embedding function based on the performance measures for the image triplets.

4.

发明申请
Fine-Grained Image Similarity 审中-公开

公开(公告)号：US20190279030A1

公开(公告)日：2019-09-12

申请号：US16420154

申请日：2019-05-22

Applicant: Google LLC

Inventor： Yang Song , Jiang Wang , Charles J. Rosenberg

IPC: G06K9/62 , G06F16/583 , G06F16/51 , G06N20/00 , G06K9/66 , G06N3/04

Abstract: Methods, systems, and apparatus, for determining fine-grained image similarity. In one aspect, a method includes training an image embedding function on image triplets by selecting image triplets of first, second and third images; generating, by the image embedding function, a first, second and third representations of the features of the first, second and third images; determining, based on the first representation of features and the second representation of features, a first similarity measure for the first image to the second image; determining, based on the first representation of features and the third representation of features, a second similarity measure for the the first image to the third image; determining, based on the first and second similarity measures, a performance measure of the image embedding function for the image triplet; and adjusting the parameter weights of the image embedding function based on the performance measures for the image triplets.

5.

发明申请
LEARNING UNIFIED EMBEDDING 审中-公开

公开(公告)号：US20200090039A1

公开(公告)日：2020-03-19

申请号：US16494842

申请日：2017-11-17

Applicant: Google LLC

Inventor： Yang Song , Yuan Li , Bo Wu , Chao-Yeh Chen , Xiao Zhang , Hartwig Adam

IPC: G06N3/08 , G06N20/00 , G06K9/32

Abstract: A computer-implemented method for generating a unified machine learning model using a neural network on a data processing apparatus is described. The method includes the data processing apparatus determining respective learning targets for each of a plurality of object verticals. The data processing apparatus determines the respective learning targets based on two or more embedding outputs of the neural network. The method also includes the data processing apparatus training the neural network to identify data associated with each of the plurality of object verticals. The data processing apparatus trains the neural network using the respective learning targets and based on a first loss function. The data processing apparatus uses the neural network trained to generate a unified machine learning model, where the model is configured to identify particular data items associated with each of the plurality of object verticals.

6.

发明授权
Associating still images and videos 有权

公开(公告)号：US10394878B2

公开(公告)日：2019-08-27

申请号：US16138606

申请日：2018-09-21

Applicant: Google LLC

Inventor： Ming Zhao , Yang Song , Hartwig Adam , Ullas Gargi , Yushi Jing , Henry Allan Rowley

IPC: G06F16/438 , G06F16/41 , G06F16/435 , G06F16/951 , G06F16/783 , G06F17/10 , G06K9/62 , G06K9/00

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for associating still images and videos. One method includes receiving a plurality of images and a plurality of videos and determining whether the images are related to the videos. The determining includes, for an image and a video, extracting features from the image and extracting features frames of the video, and comparing the features to determine whether the image is related to the video. The method further includes maintaining a data store storing data associating each image with each video determined to be related to the image.

7.

发明公开
Extreme Language Model Compression with Optimal Sub-Words and Shared Projections 审中-公开

公开(公告)号：US20240013059A1

公开(公告)日：2024-01-11

申请号：US18471866

申请日：2023-09-21

Applicant: Google LLC

Inventor： Yang Song , Raghav Gupta , Dengyong Zhou , Sanqiang Zhao

IPC: G06N3/0455 , G06F40/40 , G06N3/08

CPC classification number: G06N3/0455 , G06F40/40 , G06N3/08 , G06F40/284

Abstract: Provided is a knowledge distillation technique for training a student language model that, relative to a larger teacher language model, has a significantly smaller vocabulary, lower embedding dimensions, and/or hidden state dimensions. Specifically, aspects of the present disclosure are directed to a dual-training mechanism that trains the teacher and student language models simultaneously to obtain optimal word embeddings for the student vocabulary. In some implementations, this approach can be combined with learning shared projection matrices that transfer layer-wise knowledge from the teacher language model to the student language model. Example experimental results have also demonstrated higher compression efficiency and accuracy when compared with other state-of-the-art compression techniques, including the ability to compress the BERTBASE model by more than 60×, with only a minor drop in downstream task metrics, resulting in a language model with a footprint of under 7 MB.

8.

发明授权
Extreme language model compression with optimal sub-words and shared projections 有权

公开(公告)号：US12260340B2

公开(公告)日：2025-03-25

申请号：US18471866

申请日：2023-09-21

Applicant: Google LLC

Inventor： Yang Song , Raghav Gupta , Dengyong Zhou , Sanqiang Zhao

IPC: G06N3/088 , G06F40/284 , G06N3/045

Abstract: Provided is a knowledge distillation technique for training a student language model that, relative to a larger teacher language model, has a significantly smaller vocabulary, lower embedding dimensions, and/or hidden state dimensions. Specifically, aspects of the present disclosure are directed to a dual-training mechanism that trains the teacher and student language models simultaneously to obtain optimal word embeddings for the student vocabulary. In some implementations, this approach can be combined with learning shared projection matrices that transfer layer-wise knowledge from the teacher language model to the student language model. Example experimental results have also demonstrated higher compression efficiency and accuracy when compared with other state-of-the-art compression techniques, including the ability to compress the BERTBASE model by more than 60×, with only a minor drop in downstream task metrics, resulting in a language model with a footprint of under 7 MB.

9.

发明授权
Inference methods for word or wordpiece tokenization 有权

公开(公告)号：US11763083B2

公开(公告)日：2023-09-19

申请号：US17798638

申请日：2020-05-18

Applicant: Google LLC

Inventor： Xinying Song , Yang Song

IPC: G06F40/30 , G06F40/284 , G06F16/31 , G06F40/40

CPC classification number: G06F40/284 , G06F16/322 , G06F40/40

Abstract: Systems and methods for performing inference for word or wordpiece tokenization are disclosed using a left-to-right longest-match-first greedy process. In some examples, the vocabulary may be organized into a trie structure in which each node includes a precomputed token or token ID and a fail link, so that the tokenizer can parse the trie in a single pass to generate a list of only those tokens or token IDs that correspond to the longest matching vocabulary entries in the sample string, without the need for backtracking. In some examples, the vocabulary may be organized into a trie in which each node has a fail link, and any node that would share token(s) or token_ID(s) of a preceding node is instead given a prev_match link that points back to a chain of nodes with those token(s) or token_ID(s).

10.

发明申请
ASSOCIATING STILL IMAGES AND VIDEOS 审中-公开

公开(公告)号：US20190340194A1

公开(公告)日：2019-11-07

申请号：US16511522

申请日：2019-07-15

Applicant: Google LLC

Inventor： Ming Zhao , Yang Song , Hartwig Adam , Ullas Gargi , Yushi Jing , Henry Allan Rowley

IPC: G06F16/438 , G06F16/783 , G06F16/951 , G06F16/435 , G06F17/10 , G06K9/00 , G06K9/62 , G06F16/41

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for associating still images and videos. One method includes receiving a plurality of images and a plurality of videos and determining whether the images are related to the videos. The determining includes, for an image and a video, extracting features from the image and extracting features frames of the video, and comparing the features to determine whether the image is related to the video. The method further includes maintaining a data store storing data associating each image with each video determined to be related to the image.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification