Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for dynamically generating and configuring pre-aggregated datasets optimized for responding to particular types of data requests made against a large sub-optimal multidimensional dataset are disclosed. A dynamic aggregator monitors the query types and response latencies associated with queries made against the large multidimensional dataset. The dynamic aggregator defines pre-aggregated datasets based on the types of queries received from users and calculates a respective benefit score for each pre-aggregated dataset. The benefit score of each pre-aggregated dataset can be based on the recorded latencies and query count for the pre-aggregated dataset. The dynamic aggregator can decide whether to generate and/or maintain particular pre-aggregated datasets based on the current values of the benefit scores associated with the particular pre-aggregated datasets.
Abstract:
Systems and methods for retroactively performing dimension widening on a data element are provided. A described method includes receiving a set of data elements from a database and a set of dimension widening rules. The method further includes identifying one or more of the data elements to which any of the dimension widening rules are applicable. The method further includes processing the identified data elements by applying the applicable dimension widening rules to the identified data elements. A dimension widening delta is generated for each data element, and the dimension widening deltas may be stored. The method further includes merging one or more of the data elements with one or more of the dimension widening deltas in response to a query.
Abstract:
The subject technology provides a session associated with web site or application traffic data. An analytics tracking data communication is received. The subject technology determines analytics tracking data from the received analytics tracking data communication in which the analytics tracking data includes data for tracking an activity associated with a web site or application and includes visitor information. A new session associated with the analytics tracking data is created if it is determined that the activity is associated with an initial visit of a unique visitor corresponding to the visitor information. The subject technology determines campaign information for populating the new session associated with the analytics tracking data. The analytics tracking data including the determined campaign information is then stored.
Abstract:
Systems and methods for retroactively performing dimension widening on a data element are provided. A described method includes receiving a set of data elements from a database and a set of dimension widening rules. The method further includes identifying one or more of the data elements to which any of the dimension widening rules are applicable. The method further includes processing the identified data elements by applying the applicable dimension widening rules to the identified data elements. A dimension widening delta is generated for each data element, and the dimension widening deltas may be stored. The method further includes merging one or more of the data elements with one or more of the dimension widening deltas in response to a query.
Abstract:
The present disclosure is directed generally to systems and methods for the server side matching of web analytics and content viewing. According to the methods and systems disclosed herein, a first identifier is delivered to a client device when the client device accesses a first website. If the client device later accesses of a second website the first identifier can be processed by the system to determine if the client device previously accessed the first website.
Abstract:
A method and system dynamically create aggregate tables. An aggregate table manager is instantiated that receives a plurality of aggregate table definitions, and generates aggregate tables based on the received aggregate table definitions. The aggregate table manager creates a view corresponding to each of the aggregate tables. The aggregate table manager creates and updates data entries in the aggregate tables based on their corresponding aggregate table definitions and data extracted from web site log files.
Abstract:
The subject technology provides configurations for processing late and/or out-of-order analytics tracking data communications. An analytics tracking data communication is received. The subject technology determines a timestamp included in the analytics tracking data from the received analytics tracking data communication. The subject technology determines a session based on the determined new state changing information. The subject technology determines other analytics tracking data included in the analytics tracking data communication to associate with the determined new state changing information in the determined session if new state changing information is included in the analytics tracking data from the received analytics tracking data communication. The analytics tracking data and the other analytics tracking data associated with the new campaign information is then stored.
Abstract:
Systems and methods for performing dimension widening on a data element are provided. A described method includes receiving a set of dimension widening rules, and receiving a data element having one or more dimensions. The method further includes determining if any of the dimension widening rules are applicable to the data element. The data element may be stored in either a first database or second database based on a result of the determination. Data elements stored in the first database may be processed by applying the dimension widening rules to the data elements.