-
公开(公告)号:US11232105B2
公开(公告)日:2022-01-25
申请号:US16288985
申请日:2019-02-28
Applicant: Microsoft Technology Licensing, LLC
Inventor: Khai Tran , Harsh Shah , Maneesh Varshney
IPC: G06F7/00 , G06F16/2455 , G06F16/242 , G06F16/2453 , G06F16/2452
Abstract: A unified metrics computation platform decouples user-facing query languages from backend execution engines with the help of an intermediate platform-agnostic language, based on relational algebra procedural query language. The user needs to only specify the computation logic at a single place. By leveraging the intermediate language representation, the unified metrics computation platform can execute the same computation logic on multiple execution engines. The unified metrics computation platform unifies batch, nearline and interactive computations by automatically converting existing user defined batch logic into nearline logic. The user is presented with a unified view of the batch and nearline computation results.
-
公开(公告)号:US11194812B2
公开(公告)日:2021-12-07
申请号:US16234331
申请日:2018-12-27
Applicant: Microsoft Technology Licensing, LLC
Inventor: Min Shen , Maneesh Varshney , David J. Stein , Jian Qiao
IPC: G06F16/00 , G06F16/2455 , G06F16/2458 , G06F16/22 , G06F16/23
Abstract: The disclosed embodiments provide a system for processing data. During operation, the system organizes fact data to be aggregated into sliding time window features and observation data associated with the fact data into a set of partitions based on a join key. Next, the system sorts the fact data and the observation data within the set of partitions by the join key and timestamps associated with the fact data and the observation data. For each observation record in the observation data, the system aggregates fact records in the sorted fact data that share a value of the join key with the observation record and that fall within a first time window associated with the observation record to produce a sliding time window feature. The system then stores the sliding time window feature in association with the observation record.
-
公开(公告)号:US20190339969A1
公开(公告)日:2019-11-07
申请号:US15973367
申请日:2018-05-07
Applicant: Microsoft Technology Licensing, LLC
Inventor: Walaa Eldin M. Moustafa , Ratandeep S. Ratti , Maneesh Varshney , Carl W. Steinbach , Adwait N. Tumbde , Khai Quang Tran , Vasanth Rajamani , Suja Viswesan , Shrikanth Shankar
Abstract: The disclosed embodiments provide a system for processing data. During operation, the system obtains a user-defined function (UDF) that is defined using a standard UDF interface for accessing multiple platforms. Next, the system converts data in a platform into a standard form that is supported by the standard UDF interface using a platform-specific wrapper for the UDF and a platform-specific implementation of the standard UDF interface. The system then generates output data in the standard form by applying one or more operations to the data in the standard form. Finally, the system converts the output data into a platform-specific form that is supported by the platform and returns the output data in the platform-specific form.
-
公开(公告)号:US11068509B2
公开(公告)日:2021-07-20
申请号:US16146750
申请日:2018-09-28
Applicant: Microsoft Technology Licensing, LLC
Inventor: Guillaume B. Saint-Jacques , Jeremy John Simpson , Maneesh Varshney , Ya Xu
IPC: G06F16/28 , G06Q30/02 , G06F16/951 , G06F16/2453 , G06F16/2455
Abstract: During operation, a system obtains a graph that includes a set of nodes representing users in an online network and a set of edges representing relationships or interactions between pairs of the nodes. Next, the system selects egos containing a subset of the users that are representative of a distribution of an attribute across the users. The system then generates clusters of users, such that each cluster contains an ego and alters selected from the ego's connections in the graph. The system further divides the clusters between a treatment group and a control group, exposes alters in the treatment group to a treatment variant of an A/B test, and exposes alters in the control group to a control variant of the A/B test. Finally, the system outputs a result of the A/B test.
-
公开(公告)号:US20200210430A1
公开(公告)日:2020-07-02
申请号:US16234331
申请日:2018-12-27
Applicant: Microsoft Technology Licensing, LLC
Inventor: Min Shen , Maneesh Varshney , David J. Stein , Jian Qiao
IPC: G06F16/2455 , G06F16/2458 , G06F16/23 , G06F16/22
Abstract: The disclosed embodiments provide a system for processing data. During operation, the system organizes fact data to be aggregated into sliding time window features and observation data associated with the fact data into a set of partitions based on a join key. Next, the system sorts the fact data and the observation data within the set of partitions by the join key and timestamps associated with the fact data and the observation data. For each observation record in the observation data, the system aggregates fact records in the sorted fact data that share a value of the join key with the observation record and that fall within a first time window associated with the observation record to produce a sliding time window feature. The system then stores the sliding time window feature in association with the observation record.
-
公开(公告)号:US20200210525A1
公开(公告)日:2020-07-02
申请号:US16235921
申请日:2018-12-28
Applicant: Microsoft Technology Licensing, LLC
Inventor: Jaewon Yang , Maneesh Varshney , Mikhail Obukhov , Sung Yoon
IPC: G06F17/27 , G06F16/28 , G06F16/332 , G06F16/33 , G06F16/338 , G06N7/00 , G06N20/00
Abstract: Techniques for predicting query language statements from natural language analytic questions. In one embodiment, for example, a computer-implemented method includes: receiving an input natural language analytic question; for each slot of a plurality of slots of a query language statement, using a multi-class classifier, trained on a set of possible values for the slot and a set of natural language analytic seed questions, to predict a probability, for each possible value in the set of possible values for the slot, that the input natural language analytic question is directed to the possible value; for each slot of the plurality of slots, selecting a particular possible value, of the set of possible values for the slot, to fill the slot based on the probability predicted for the slot; and generating the query language statement with the particular possible value selected for each slot of the plurality of slot.
-
公开(公告)号:US10191948B2
公开(公告)日:2019-01-29
申请号:US15056996
申请日:2016-02-29
Applicant: Microsoft Technology Licensing, LLC
Inventor: Srinivas S. Vemuri , Wenlei Xie , Suvodeep Pyne , Vinitha Reddy Gankidi , Maneesh Varshney , Mitul Tiwari
IPC: G06F17/30
Abstract: This disclosure is directed to large-scale graph processing to determine second-degree connections for members of a social network. A social graph is duplicated into two graphs, where each of the two graphs are partitioned into various partitions. The partitions are each sorted according to a predetermined key selected from each of the graphs. The partitions are then assigned logical Work Units, where a first set of Work Units are determined from a first graph and second set of Work Units are determined from a second graph. The Work Units are determined to be asymmetrical such that the partitions of the first set of Work Units are assigned differently than the partitions of the second set of Work Units. One set of Work Units are loaded in-memory and another set of Work Units are streamed to a mapping module process, which determines the second-degree connections from the sets of Work Units.
-
公开(公告)号:US10990612B2
公开(公告)日:2021-04-27
申请号:US16235910
申请日:2018-12-28
Applicant: Microsoft Technology Licensing, LLC
Inventor: Maneesh Varshney , Sunil Nagaraj
IPC: G06F16/28 , G06F16/25 , G06F16/22 , G06F16/2455
Abstract: A computer system and method provide efficient, intuitive, and human-friendly ways of expressing metric-centric queries about multidimensional database data and, at the same, provide a level of flexibility for expressing various metric-centric queries in an interactive manner. The expression techniques help to more easily formulate metric-centric queries that require aggregation operations and joins. The flexible techniques facilitate interactive analysis of multidimensional data from different perspectives including consolidating, drilling down, and slicing and dicing the data. As a result, the analysis task for the data analyst and the programming task for the application developer may be easier, more efficient, and more effective.
-
公开(公告)号:US20200278969A1
公开(公告)日:2020-09-03
申请号:US16288985
申请日:2019-02-28
Applicant: Microsoft Technology Licensing, LLC
Inventor: Khai Tran , Harsh Shah , Maneesh Varshney
IPC: G06F16/2455 , G06F16/242 , G06F16/2452 , G06F16/2453
Abstract: A unified metrics computation platform decouples user-facing query languages from backend execution engines with the help of an intermediate platform-agnostic language, based on relational algebra procedural query language. The user needs to only specify the computation logic at a single place. By leveraging the intermediate language representation, the unified metrics computation platform can execute the same computation logic on multiple execution engines. The unified metrics computation platform unifies batch, nearline and interactive computations by automatically converting existing user defined batch logic into nearline logic. The user is presented with a unified view of the batch and nearline computation results.
-
公开(公告)号:US20200210524A1
公开(公告)日:2020-07-02
申请号:US16235916
申请日:2018-12-28
Applicant: Microsoft Technology Licensing, LLC
Inventor: Jaewon Yang , Maneesh Varshney , Mikhail Obukhov , Sung Yoon
IPC: G06F17/27 , G06F16/28 , G06F16/33 , G06F16/332 , G06N20/20
Abstract: Online analytical processing system supporting natural language analytic questions. In one embodiments, for example, a computer-implemented method includes: receiving a natural language question; determining an intent of the natural language question; based on the intent of the natural language question, predicting a metric query language statement based on the natural language question; translating the metric query language statement to a structured query language statement; causing an execution of the structured query language statement against multidimensional database data; and providing an answer to the natural language question based on a result of the execution of the structured query language statement against the multidimensional database data.
-
-
-
-
-
-
-
-
-