-
公开(公告)号:US20210358579A1
公开(公告)日:2021-11-18
申请号:US16618656
申请日:2017-09-29
申请人: Google LLC
发明人: Kai Chen , Eyal Oren , Hector Yee , James Wilson , Alvin Rajkomar , Michaela Hardt
摘要: A method is described for training a predictive model which increases the interpretability and trustworthiness of the model for end-users. The model is trained from data having multitude of features. Each feature is associated with a real value and a time component. Many predicates (atomic elements for training the model) are defined as binary functions operating on the features, and typically time sequences of the features or logical combinations thereof. The predicates can be limited to those functions which have human understandability or encode expert knowledge relative to a predication task of the model. We iteratively train a boosting model with input from an operator or human-in-the-loop. The human-in-the-loop is provided with tools to inspect the model as it is iteratively built and remove one or more of the predicates in the model, e.g. if it does not have indicia of trustworthiness, is not causally related to a prediction of the model, or is not understandable. We repeat the iterative process several times ultimately generate a final boosting model. The final model is then evaluated, e.g., for accuracy, complexity, trustworthiness and post-hoc explainability.
-
公开(公告)号:US20240211759A1
公开(公告)日:2024-06-27
申请号:US18596535
申请日:2024-03-05
申请人: Google LLC
发明人: Gregory S. Corrado , Kai Chen , Jeffrey A. Dean , Gary R. Holt , Julian P. Grady , Sharat Chikkerur , David W. Sculley, II
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using embedded function with a deep network. One of the methods includes receiving an input comprising a plurality of features, wherein each of the features is of a different feature type; processing each of the features using a respective embedding function to generate one or more numeric values, wherein each of the embedding functions operates independently of each other embedding function, and wherein each of the embedding functions is used for features of a respective feature type; processing the numeric values using a deep network to generate a first alternative representation of the input, wherein the deep network is a machine learning model composed of a plurality of levels of non-linear operations; and processing the first alternative representation of the input using a logistic regression classifier to predict a label for the input.
-
公开(公告)号:US20210125222A1
公开(公告)日:2021-04-29
申请号:US17140721
申请日:2021-01-04
申请人: Google LLC
IPC分类号: G06Q30/02 , G06F16/951 , G06F16/738 , G06F16/78
摘要: In general, in one aspect, a method includes compiling user interaction statistics for a set of content items displayed in association with a first target media document having a non-textual portion, at least some of the content items associated with one or more keywords, based on the interaction statistics, associating the first target media document with at least some of the keywords associated with the content items, and based on a common attribute of the first target media document and a second target media document having a non-textual portion, associating the second target media document with at least some of the keywords assigned to the first target media document. Other aspects include corresponding systems, apparatus, and computer programs stored on computer storage devices.
-
公开(公告)号:US10679124B1
公开(公告)日:2020-06-09
申请号:US15368460
申请日:2016-12-02
申请人: Google LLC
发明人: Gregory S. Corrado , Kai Chen , Jeffrey A. Dean , Gary R. Holt , Julian P. Grady , Sharat Chikkerur , David W. Sculley, II
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using embedded function with a deep network. One of the methods includes receiving an input comprising a plurality of features, wherein each of the features is of a different feature type; processing each of the features using a respective embedding function to generate one or more numeric values, wherein each of the embedding functions operates independently of each other embedding function, and wherein each of the embedding functions is used for features of a respective feature type; processing the numeric values using a deep network to generate a first alternative representation of the input, wherein the deep network is a machine learning model composed of a plurality of levels of non-linear operations; and processing the first alternative representation of the input using a logistic regression classifier to predict a label for the input.
-
5.
公开(公告)号:US11935634B2
公开(公告)日:2024-03-19
申请号:US15690721
申请日:2017-08-30
申请人: Google LLC
发明人: Alexander Mossin , Alvin Rajkomar , Eyal Oren , James Wilson , James Wexler , Patrik Sundberg , Andrew Dai , Yingwei Cui , Gregory Corrado , Hector Yee , Jacob Marcus , Jeffrey Dean , Benjamin Irvine , Kai Chen , Kun Zhang , Michaela Hardt , Xiaomi Sun , Nissan Hajaj , Peter Junteng Liu , Quoc Le , Xiaobing Liu , Yi Zhang
摘要: A system for predicting and summarizing medical events from electronic health records includes a computer memory storing aggregated electronic health records from a multitude of patients of diverse age, health conditions, and demographics including medications, laboratory values, diagnoses, vital signs, and medical notes. The aggregated electronic health records are converted into a single standardized data structure format and ordered arrangement per patient, e.g., into a chronological order. A computer (or computer system) executes one or more deep learning models trained on the aggregated health records to predict one or more future clinical events and summarize pertinent past medical events related to the predicted events on an input electronic health record of a patient having the standardized data structure format and ordered into a chronological order. An electronic device configured with a healthcare provider-facing interface displays the predicted one or more future clinical events and the pertinent past medical events of the patient.
-
公开(公告)号:US20230325657A1
公开(公告)日:2023-10-12
申请号:US17972466
申请日:2022-10-24
申请人: Google LLC
发明人: Gregory S. Corrado , Kai Chen , Jeffrey A. Dean , Gary R. Holt , Julian P. Grady , Sharat Chikkerur , David W. Sculley, II
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using embedded function with a deep network. One of the methods includes receiving an input comprising a plurality of features, wherein each of the features is of a different feature type; processing each of the features using a respective embedding function to generate one or more numeric values, wherein each of the embedding functions operates independently of each other embedding function, and wherein each of the embedding functions is used for features of a respective feature type; processing the numeric values using a deep network to generate a first alternative representation of the input, wherein the deep network is a machine learning model composed of a plurality of levels of non-linear operations; and processing the first alternative representation of the input using a logistic regression classifier to predict a label for the input.
-
公开(公告)号:US11687832B1
公开(公告)日:2023-06-27
申请号:US16983979
申请日:2020-08-03
申请人: Google LLC
发明人: Gregory S. Corrado , Kai Chen , Jeffrey A. Dean , Samy Bengio , Rajat Monga , Matthieu Devin
IPC分类号: G06N20/00 , G06N3/063 , G06N3/08 , G06N7/08 , G06N5/025 , G06F18/214 , G06F18/2411 , G06N7/01
CPC分类号: G06N20/00 , G06N3/063 , G06N3/08 , G06N7/08 , G06F18/214 , G06F18/2411 , G06N5/025 , G06N7/01
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a model using parameter server shards. One of the methods includes receiving, at a parameter server shard configured to maintain values of a disjoint partition of the parameters of the model, a succession of respective requests for parameter values from each of a plurality of replicas of the model; in response to each request, downloading a current value of each requested parameter to the replica from which the request was received; receiving a succession of uploads, each upload including respective delta values for each of the parameters in the partition maintained by the shard; and updating values of the parameters in the partition maintained by the parameter server shard repeatedly based on the uploads of delta values to generate current parameter values.
-
公开(公告)号:US10922488B1
公开(公告)日:2021-02-16
申请号:US16363460
申请日:2019-03-25
申请人: Google LLC
发明人: Tomas Mikolov , Kai Chen , Gregory S. Corrado , Jeffrey A. Dean
IPC分类号: G10L15/00 , G06F40/279 , G10L15/06 , G06N20/00 , G06F40/30
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for computing numeric representations of words. One of the methods includes obtaining a set of training data, wherein the set of training data comprises sequences of words; training a classifier and an embedding function on the set of training data, wherein training the embedding function comprises obtained trained values of the embedding function parameters; processing each word in the vocabulary using the embedding function in accordance with the trained values of the embedding function parameters to generate a respective numerical representation of each word in the vocabulary in the high-dimensional space; and associating each word in the vocabulary with the respective numeric representation of the word in the high-dimensional space.
-
公开(公告)号:US20180157869A1
公开(公告)日:2018-06-07
申请号:US15856619
申请日:2017-12-28
申请人: Google LLC
发明人: Robert Kniaz , Abhinay Sharma , Kai Chen , Sam Mardanbeigi
CPC分类号: G06F21/84 , G06F21/10 , G06Q30/02 , G06Q30/0242 , G06Q30/0246 , G06Q30/0273 , G06Q30/0277 , G06Q30/04
摘要: Systems and methods are provided to allow advertisers to make ads available to publishers through an advertising system. The advertising system provides tamper proof tracking of conversion activity between publishers and advertisers. Further, advertisers can define plural different conversions to be associated with a single ad click through.
-
公开(公告)号:US10733535B1
公开(公告)日:2020-08-04
申请号:US15665236
申请日:2017-07-31
申请人: Google LLC
发明人: Gregory S. Corrado , Kai Chen , Jeffrey A. Dean , Samy Bengio , Rajat Monga , Matthieu Devin
摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a model using parameter server shards. One of the methods includes receiving, at a parameter server shard configured to maintain values of a disjoint partition of the parameters of the model, a succession of respective requests for parameter values from each of a plurality of replicas of the model; in response to each request, downloading a current value of each requested parameter to the replica from which the request was received; receiving a succession of uploads, each upload including respective delta values for each of the parameters in the partition maintained by the shard; and updating values of the parameters in the partition maintained by the parameter server shard repeatedly based on the uploads of delta values to generate current parameter values.
-
-
-
-
-
-
-
-
-