Unified metrics computation platform

    公开(公告)号:US11232105B2

    公开(公告)日:2022-01-25

    申请号:US16288985

    申请日:2019-02-28

    Abstract: A unified metrics computation platform decouples user-facing query languages from backend execution engines with the help of an intermediate platform-agnostic language, based on relational algebra procedural query language. The user needs to only specify the computation logic at a single place. By leveraging the intermediate language representation, the unified metrics computation platform can execute the same computation logic on multiple execution engines. The unified metrics computation platform unifies batch, nearline and interactive computations by automatically converting existing user defined batch logic into nearline logic. The user is presented with a unified view of the batch and nearline computation results.

    Efficient aggregation of sliding time window features

    公开(公告)号:US11194812B2

    公开(公告)日:2021-12-07

    申请号:US16234331

    申请日:2018-12-27

    Abstract: The disclosed embodiments provide a system for processing data. During operation, the system organizes fact data to be aggregated into sliding time window features and observation data associated with the fact data into a set of partitions based on a join key. Next, the system sorts the fact data and the observation data within the set of partitions by the join key and timestamps associated with the fact data and the observation data. For each observation record in the observation data, the system aggregates fact records in the sorted fact data that share a value of the join key with the observation record and that fall within a first time window associated with the observation record to produce a sliding time window feature. The system then stores the sliding time window feature in association with the observation record.

    A/B testing using ego network clusters

    公开(公告)号:US11068509B2

    公开(公告)日:2021-07-20

    申请号:US16146750

    申请日:2018-09-28

    Abstract: During operation, a system obtains a graph that includes a set of nodes representing users in an online network and a set of edges representing relationships or interactions between pairs of the nodes. Next, the system selects egos containing a subset of the users that are representative of a distribution of an attribute across the users. The system then generates clusters of users, such that each cluster contains an ego and alters selected from the ego's connections in the graph. The system further divides the clusters between a treatment group and a control group, exposes alters in the treatment group to a treatment variant of an A/B test, and exposes alters in the control group to a control variant of the A/B test. Finally, the system outputs a result of the A/B test.

    EFFICIENT AGGREGATION OF SLIDING TIME WINDOW FEATURES

    公开(公告)号:US20200210430A1

    公开(公告)日:2020-07-02

    申请号:US16234331

    申请日:2018-12-27

    Abstract: The disclosed embodiments provide a system for processing data. During operation, the system organizes fact data to be aggregated into sliding time window features and observation data associated with the fact data into a set of partitions based on a join key. Next, the system sorts the fact data and the observation data within the set of partitions by the join key and timestamps associated with the fact data and the observation data. For each observation record in the observation data, the system aggregates fact records in the sorted fact data that share a value of the join key with the observation record and that fall within a first time window associated with the observation record to produce a sliding time window feature. The system then stores the sliding time window feature in association with the observation record.

    PREDICTING QUERY LANGUAGE STATEMENTS FROM NATURAL LANGUAGE ANALYTIC QUESTIONS

    公开(公告)号:US20200210525A1

    公开(公告)日:2020-07-02

    申请号:US16235921

    申请日:2018-12-28

    Abstract: Techniques for predicting query language statements from natural language analytic questions. In one embodiment, for example, a computer-implemented method includes: receiving an input natural language analytic question; for each slot of a plurality of slots of a query language statement, using a multi-class classifier, trained on a set of possible values for the slot and a set of natural language analytic seed questions, to predict a probability, for each possible value in the set of possible values for the slot, that the input natural language analytic question is directed to the possible value; for each slot of the plurality of slots, selecting a particular possible value, of the set of possible values for the slot, to fill the slot based on the probability predicted for the slot; and generating the query language statement with the particular possible value selected for each slot of the plurality of slot.

    Joins and aggregations on massive graphs using large-scale graph processing

    公开(公告)号:US10191948B2

    公开(公告)日:2019-01-29

    申请号:US15056996

    申请日:2016-02-29

    Abstract: This disclosure is directed to large-scale graph processing to determine second-degree connections for members of a social network. A social graph is duplicated into two graphs, where each of the two graphs are partitioned into various partitions. The partitions are each sorted according to a predetermined key selected from each of the graphs. The partitions are then assigned logical Work Units, where a first set of Work Units are determined from a first graph and second set of Work Units are determined from a second graph. The Work Units are determined to be asymmetrical such that the partitions of the first set of Work Units are assigned differently than the partitions of the second set of Work Units. One set of Work Units are loaded in-memory and another set of Work Units are streamed to a mapping module process, which determines the second-degree connections from the sets of Work Units.

    Metric-centric transformations of multidimensional database data

    公开(公告)号:US10990612B2

    公开(公告)日:2021-04-27

    申请号:US16235910

    申请日:2018-12-28

    Abstract: A computer system and method provide efficient, intuitive, and human-friendly ways of expressing metric-centric queries about multidimensional database data and, at the same, provide a level of flexibility for expressing various metric-centric queries in an interactive manner. The expression techniques help to more easily formulate metric-centric queries that require aggregation operations and joins. The flexible techniques facilitate interactive analysis of multidimensional data from different perspectives including consolidating, drilling down, and slicing and dicing the data. As a result, the analysis task for the data analyst and the programming task for the application developer may be easier, more efficient, and more effective.

    UNIFIED METRICS COMPUTATION PLATFORM
    9.
    发明申请

    公开(公告)号:US20200278969A1

    公开(公告)日:2020-09-03

    申请号:US16288985

    申请日:2019-02-28

    Abstract: A unified metrics computation platform decouples user-facing query languages from backend execution engines with the help of an intermediate platform-agnostic language, based on relational algebra procedural query language. The user needs to only specify the computation logic at a single place. By leveraging the intermediate language representation, the unified metrics computation platform can execute the same computation logic on multiple execution engines. The unified metrics computation platform unifies batch, nearline and interactive computations by automatically converting existing user defined batch logic into nearline logic. The user is presented with a unified view of the batch and nearline computation results.

    ANALYTICAL PROCESSING SYSTEM SUPPORTING NATURAL LANGUAGE ANALYTIC QUESTIONS

    公开(公告)号:US20200210524A1

    公开(公告)日:2020-07-02

    申请号:US16235916

    申请日:2018-12-28

    Abstract: Online analytical processing system supporting natural language analytic questions. In one embodiments, for example, a computer-implemented method includes: receiving a natural language question; determining an intent of the natural language question; based on the intent of the natural language question, predicting a metric query language statement based on the natural language question; translating the metric query language statement to a structured query language statement; causing an execution of the structured query language statement against multidimensional database data; and providing an answer to the natural language question based on a result of the execution of the structured query language statement against the multidimensional database data.

Patent Agency Ranking