-
公开(公告)号:US12248957B2
公开(公告)日:2025-03-11
申请号:US17899202
申请日:2022-08-30
Applicant: Google LLC
Inventor: Aiyou Chen , Timothy Chun-Wai Au
IPC: G06Q30/0204 , G06Q30/0201
Abstract: Techniques for preparing datasets for geo experiments and improving accuracy of geo experiments are presented herein. The system can access a dataset of a plurality of geographic pairs. Additionally, the system can calculate a first outcome estimate based on a difference in response data and a difference in input data for a first geographic pair. Moreover, the system can calculate a plurality of experimental uncertainty estimates associated with the plurality of geographic pairs during an experimental time interval. The system can access historical data associated with the plurality of geographic pairs. Furthermore, the system can determine a beta value and a trim rate that reduces a sum of the plurality estimates. Subsequently, the system can remove, based on the first outcome estimate and the beta value, the first geographic pair from the plurality of geographic pairs to generate the first subset of geographic pairs.
-
公开(公告)号:US11514274B2
公开(公告)日:2022-11-29
申请号:US16834843
申请日:2020-03-30
Applicant: Google LLC
Inventor: Aiyou Chen , Timothy C. Au , Nicolas Remy , Kevin Benac
Abstract: Systems, methods and computer-readable storage media utilized to prepare datasets for geo experiments. One method includes receiving one or more input parameters. The method further includes extracting, from the data, training data. The method further includes calculating a difference in input data and a difference in response data of the training data. The method further includes determining a first plurality of geographic pairs. The method further includes extracting, from the data, evaluation data. The method further includes separating each geographic pair of the first plurality of geographic pairs into a treatment region or a control region for a plurality of simulations of a plurality of different simulation subsets for each of a plurality of different subsets of geographic pairs. The method further includes calculating a plurality of uncertainty estimates. The method further includes selecting a first subset of geographic pairs and providing the selected subset of geographic pairs.
-
公开(公告)号:US10740602B2
公开(公告)日:2020-08-11
申请号:US15956542
申请日:2018-04-18
Applicant: GOOGLE LLC
Inventor: Ivan Ordonez , Swaminathan Krishnamurthy , David Paul , Tushar Udeshi , Aiyou Chen
IPC: G06K9/00 , G06T7/70 , G06F16/28 , G06F16/583
Abstract: Systems and methods for assigning word fragments to lines of text in optical character recognition (OCR) extracted data can include at least one processor obtaining a plurality of word fragments from OCR generated data associated with an image. The at least one processor can determine vertical coordinates of each of the word fragments in the image. The at least one processor can cluster the plurality of word fragments into one or more clusters of word fragments based on the vertical coordinates of the plurality of word fragments. The at least one processor can assign each word fragment of a respective cluster to a corresponding text line based on the clustering.
-
公开(公告)号:US10223728B2
公开(公告)日:2019-03-05
申请号:US14564123
申请日:2014-12-09
Applicant: Google LLC
Inventor: Aiyou Chen , Jeffrey David Oldham
Abstract: Systems and methods of directed item consumption recommendations are disclosed which include generating, with a server, empirical transition matrix data that includes row data for a first item and column data for a second item, and an entry in the empirical transition matrix data for a number of users that acquire the second item after the first item, generating, with the server, metadata transition matrix data by partitioning items for each item metadata type, setting a uniform transition probability for all items in the partition, and summing the uniform transition probabilities across all metadata types, generating, with the server, transition probability matrix data by multiplying the metadata transition matrix data with a weight parameter, adding the result to the empirical transition matrix data, and normalizing each row, and providing item recommendations to a user computing device communicatively coupled to the server according to the generated transition probability matrix data.
-
公开(公告)号:US20230036170A1
公开(公告)日:2023-02-02
申请号:US17899202
申请日:2022-08-30
Applicant: Google LLC
Inventor: Aiyou Chen , Timothy Chun-Wai Au
IPC: G06Q30/02
Abstract: Techniques for preparing datasets for geo experiments and improving accuracy of geo experiments are presented herein. The system can access a dataset of a plurality of geographic pairs. Additionally, the system can calculate a first outcome estimate based on a difference in response data and a difference in input data for a first geographic pair. Moreover, the system can calculate a plurality of experimental uncertainty estimates associated with the plurality of geographic pairs during an experimental time interval. The system can access historical data associated with the plurality of geographic pairs. Furthermore, the system can determine a beta value and a trim rate that reduces a sum of the plurality estimates. Subsequently, the system can remove, based on the first outcome estimate and the beta value, the first geographic pair from the plurality of geographic pairs to generate the first subset of geographic pairs.
-
公开(公告)号:US20210312497A1
公开(公告)日:2021-10-07
申请号:US16834915
申请日:2020-03-30
Applicant: Google LLC
Inventor: Aiyou Chen , Timothy C. Au
Abstract: Systems, methods and computer-readable storage media utilized to prepare experimental datasets for experimental analysis systems. One method includes identifying, by one or more processing circuits, a dataset of a plurality of geographic pairs associated with a geo experiment. The method further includes calculating, by the one or more processing circuits, a difference in input data and a difference in response data between the first geographic region and the second geographic region of each geographic pair. The method further includes calculating, by the one or more processing circuits, a plurality of outcome estimates. The method further includes selecting, by the one or more processing circuits, a first subset of geographic pairs of the plurality of different subsets of geographic pairs based a first outcome estimate of the plurality of outcome estimates that is about a prespecified value on the outcome estimates and providing the selected subset of geographic pairs.
-
公开(公告)号:US20210312221A1
公开(公告)日:2021-10-07
申请号:US16834843
申请日:2020-03-30
Applicant: Google LLC
Inventor: Aiyou Chen , Timothy C. Au , Nicolas Remy , Kevin Benac
Abstract: Systems, methods and computer-readable storage media utilized to prepare datasets for geo experiments. One method includes receiving one or more input parameters. The method further includes extracting, from the data, training data. The method further includes calculating a difference in input data and a difference in response data of the training data. The method further includes determining a first plurality of geographic pairs. The method further includes extracting, from the data, evaluation data. The method further includes separating each geographic pair of the first plurality of geographic pairs into a treatment region or a control region for a plurality of simulations of a plurality of different simulation subsets for each of a plurality of different subsets of geographic pairs. The method further includes calculating a plurality of uncertainty estimates. The method further includes selecting a first subset of geographic pairs and providing the selected subset of geographic pairs.
-
8.
公开(公告)号:US20190325211A1
公开(公告)日:2019-10-24
申请号:US15956542
申请日:2018-04-18
Applicant: GOOGLE LLC
Inventor: Ivan Ordonez , Swaminathan Krishnamurthy , David Paul , Tushar Udeshi , Aiyou Chen
Abstract: Systems and methods for assigning word fragments to lines of text in optical character recognition (OCR) extracted data can include at least one processor obtaining a plurality of word fragments from OCR generated data associated with an image. The at least one processor can determine vertical coordinates of each of the word fragments in the image. The at least one processor can cluster the plurality of word fragments into one or more clusters of word fragments based on the vertical coordinates of the plurality of word fragments. The at least one processor can assign each word fragment of a respective cluster to a corresponding text line based on the clustering.
-
-
-
-
-
-
-