-
公开(公告)号:US20240378256A1
公开(公告)日:2024-11-14
申请号:US18636844
申请日:2024-04-16
Applicant: Google LLC
Inventor: Arash Sadr , Yu Tao , Daliang Li , Zachary Kenneth Fisher , Bhargav Kanagal Shamanna , Xinnan Yu , Rajiv Shailendra Menjoge , Marcin Tadeusz Bialek , Grzegorz Glowaty , Sumit K. Sanghai , Sanjiv Kumar
IPC: G06F16/9538 , G06F40/30 , G06Q30/0282 , G06Q30/0601
Abstract: Systems and methods for generating and utilizing artificial intelligence generated badges can include processing web information associated with a subject to determine particular qualities of the subject. The qualities can then be utilized to generate one or more badges. The badges can then be utilized for search result determination and display. The badges may be utilized for search result ranking and may be utilized to annotate search results in a search results interface.
-
公开(公告)号:US12033080B2
公开(公告)日:2024-07-09
申请号:US16442203
申请日:2019-06-14
Applicant: GOOGLE LLC
Inventor: Xinnan Yu , Shanshan Wu , Daniel Holtmann-Rice , Dmitry Storcheus , Sanjiv Kumar , Afshin Rostamizadeh
Abstract: A sparse dataset is encoded using a data-driven learned sensing matrix. For example, an example method includes receiving a dataset of sparse vectors with dimension d from a requesting process, initializing an encoding matrix of dimension k×d, selecting a subset of sparse vectors from the dataset, and updating the encoding matrix via machine learning. Updating the encoding matrix includes using a linear encoder to generate an encoded vector of dimension k for each vector in the subset, the linear encoder using the encoding matrix, using a non-linear decoder to decode each of the encoded vectors, the non-linear decoder using a transpose of the encoding matrix in a projected subgradient, and adjusting the encoding matrix using back propagation. The method also includes returning an embedding of each sparse vector in the dataset of sparse vectors, the embedding being generated with the updated encoding matrix.
-
公开(公告)号:US11763197B2
公开(公告)日:2023-09-19
申请号:US16850053
申请日:2020-04-16
Applicant: Google LLC
Inventor: Hugh Brendan McMahan , Dave Morris Bacon , Jakub Konecny , Xinnan Yu
Abstract: The present disclosure provides efficient communication techniques for transmission of model updates within a machine learning framework, such as, for example, a federated learning framework in which a high-quality centralized model is trained on training data distributed overt a large number of clients each with unreliable network connections and low computational power. In an example federated learning setting, in each of a plurality of rounds, each client independently updates the model based on its local data and communicates the updated model back to the server, where all the client-side updates are used to update a global model. The present disclosure provides systems and methods that reduce communication costs. In particular, the present disclosure provides at least: structured update approaches in which the model update is restricted to be small and sketched update approaches in which the model update is compressed before sending to the server.
-
公开(公告)号:US20230123941A1
公开(公告)日:2023-04-20
申请号:US18081376
申请日:2022-12-14
Applicant: Google LLC
Inventor: Xiang Wu , David Simcha , Daniel Holtmann-Rice , Sanjiv Kumar , Ananda Theertha Suresh , Ruiqi Guo , Xinnan Yu
Abstract: The present disclosure provides systems and methods that include or otherwise leverage use of a multiscale quantization model that is configured to provide a quantized dataset. In particular, the multiscale quantization model can receive and perform vector quantization of a first dataset. The multiscale quantization model can generate a residual dataset based at least in part on a result of the vector quantization. The multiscale quantization model can apply a rotation matrix to the residual dataset to generate a rotated residual dataset that includes a plurality of rotated residuals. The multiscale quantization model can perform reparameterization of each rotated residual in the rotated residual dataset into a direction component and a scale component. The multiscale quantization model can perform product quantization of the direction components of the plurality of rotated residuals, and perform scalar quantization of the scale components of the plurality of rotated residuals.
-
公开(公告)号:US10394777B2
公开(公告)日:2019-08-27
申请号:US14951909
申请日:2015-11-25
Applicant: Google LLC
Inventor: Xinnan Yu , Sanjiv Kumar , Ruiqi Guo
IPC: G06F17/00 , G06F16/22 , G06F16/33 , G06F16/951
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for efficiently performing linear projections. In one aspect, a method includes actions for obtaining a plurality of content items from one or more content sources. Additional actions include, extracting a plurality of features from each of the plurality of content items, generating a feature vector for each of the extracted features in order to create a search space, generating a series of element matrices based upon the generated feature vectors, transforming the series of element matrices into a structured matrix such that the transformation preserves one or more relationships associated with each element matrix of the series of element matrices, receiving a search object, searching the enhanced search space based on the received search object, provided one or more links to a content item that are responsive to the search object.
-
公开(公告)号:US12079700B2
公开(公告)日:2024-09-03
申请号:US15793455
申请日:2017-10-25
Applicant: GOOGLE LLC
Inventor: Daniel Holtmann-Rice , Sanjiv Kumar , Xinnan Yu , Krzysztof Marcin Choromanski , Ananda Theertha Suresh
Abstract: Techniques of generating input for a kernel-based machine learning system that uses a kernel to perform classification operations on data involve generating unbiased estimators for gaussian kernels according to a new framework called Structured Orthogonal Random Features (SORF). The unbiased estimator KSORF to the kernel involves a linear transformation matrix WSORF computed using products of a set of pairs of matrices, each pair including an orthogonal matrix and respective diagonal matrix whose elements are real numbers following a specified probability distribution. Typically, the orthogonal matrix is a Walsh-Hadamard matrix, the specified probability distribution is a Rademacher distribution, and there are at least two, usually three, pairs of matrices multiplied together to form the linear transformation matrix WSORF.
-
公开(公告)号:US11785073B2
公开(公告)日:2023-10-10
申请号:US17502794
申请日:2021-10-15
Applicant: Google LLC
Inventor: Ananda Theertha Suresh , Sanjiv Kumar , Hugh Brendan McMahan , Xinnan Yu
IPC: H04L67/10 , G06F17/16 , H03M7/30 , H03M7/40 , G06F17/12 , G06N20/00 , G06F17/18 , G06N7/01 , H04L67/01
CPC classification number: H04L67/10 , G06F17/12 , G06F17/16 , G06F17/18 , G06N7/01 , G06N20/00 , H03M7/3059 , H03M7/3082 , H03M7/40 , H04L67/01
Abstract: The present disclosure provides systems and methods for communication efficient distributed mean estimation. In particular, aspects of the present disclosure can be implemented by a system in which a number of vectors reside on a number of different clients, and a centralized server device seeks to estimate the mean of such vectors. According to one aspect of the present disclosure, a client computing device can rotate a vector by a random rotation matrix and then subsequently perform probabilistic quantization on the rotated vector. According to another aspect of the present disclosure, subsequent to quantization but prior to transmission, the client computing can encode the quantized vector according to a variable length coding scheme (e.g., by computing variable length codes).
-
公开(公告)号:US20210326757A1
公开(公告)日:2021-10-21
申请号:US17227851
申请日:2021-04-12
Applicant: Google LLC
Inventor: Ankit Singh Rawat , Xinnan Yu , Aditya Krishna Menon , Sanjiv Kumar
Abstract: Generally, the present disclosure is directed to systems and methods that perform spreadout regularization to enable learning of a multi-class classification model in the federated setting, where each user has access to the positive data associated with only a limited number of classes (e.g., a single class). Examples of such settings include decentralized training of face recognition models or speaker identification models, where in addition to the user specific facial images and voice samples, the class embeddings for the users also constitute sensitive information that cannot be shared with other users.
-
公开(公告)号:US20210049298A1
公开(公告)日:2021-02-18
申请号:US16994396
申请日:2020-08-14
Applicant: Google LLC
Inventor: Ananda Theertha Suresh , Xinnan Yu , Sanjiv Kumar , Sashank Jakkam Reddi , Venkatadheeraj Pichapati
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for privacy preserving training of a machine learning model.
-
公开(公告)号:US20190385063A1
公开(公告)日:2019-12-19
申请号:US16442203
申请日:2019-06-14
Applicant: GOOGLE LLC
Inventor: Xinnan Yu , Shanshan Wu , Daniel Holtmann-Rice , Dmitry Storcheus , Sanjiv Kumar , Afshin Rostamizadeh
Abstract: A sparse dataset is encoded using a data-driven learned sensing matrix. For example, an example method includes receiving a dataset of sparse vectors with dimension d from a requesting process, initializing an encoding matrix of dimension k×d, selecting a subset of sparse vectors from the dataset, and updating the encoding matrix via machine learning. Updating the encoding matrix includes using a linear encoder to generate an encoded vector of dimension k for each vector in the subset, the linear encoder using the encoding matrix, using a non-linear decoder to decode each of the encoded vectors, the non-linear decoder using a transpose of the encoding matrix in a projected subgradient, and adjusting the encoding matrix using back propagation. The method also includes returning an embedding of each sparse vector in the dataset of sparse vectors, the embedding being generated with the updated encoding matrix.
-
-
-
-
-
-
-
-
-