摘要:
Systems and methods are presented that may involve receiving a causal fact dataset including facts relating to items perceived to cause actions, wherein the causal fact dataset includes a data attribute that is associated with a causal fact datum. It may also involve pre-aggregating a plurality of the combinations of a plurality of causal fact data and associated data attributes in a causal bitmap. It may also involve selecting a subset of the pre-aggregated combinations based on suitability of a combination for the analytic purpose. It may also involve storing the subset of pre-aggregated combinations to facilitate querying of the subset.
摘要:
In embodiments of the present invention improved capabilities are described for identifying a first classification scheme associated with product attributes of a first grouping of products, identifying a second classification scheme associated with product attributes of a second grouping of products, and receiving a record of data relating to an item, the classification of which is uncertain. It may also involve receiving a dictionary of attributes associated with products and assigning the item to at least one of the classification schemes based on probabilistic matching among the attributes in the classification schemes, the attributes in the dictionary of attributes and the known attributes of the item.
摘要:
In embodiments of the present invention improved capabilities are described for identifying a classification scheme associated with product attributes of a grouping of products of an entity, receiving a record of data relating to an item of a competitor to the entity, the classification of which is uncertain, receiving a dictionary of attributes associated with products, and assigning a product code to the item, based on probabilistic matching among the attributes in the classification scheme, the attributes in the dictionary of attributes and at least one known attribute of the item.
摘要:
Using a computer, a database comprising a field is identified. A query relating to the field is identified. Prior to processing the query, the field is dynamically altered to conform to a desired bit size. The query is processed. The results of the query are returned.
摘要:
In embodiments of the present invention, a method is described for reducing bias by data fusion of a household panel data and a loyalty card data. In embodiments, a method is provided for receiving a consumer panel dataset in a data fusion facility, receiving a consumer point-of-sale dataset in a data fusion facility, receiving a dimension dataset in a data fusion facility, fusing the datasets received in the data fusion facility into a new panel dataset based at least in part on an encryption key, estimating a consumer behavior using a first model based on the consumer panel dataset, estimating a consumer behavior using a second model based only on those consumers present in both the consumer panel dataset and the consumer point-of-sale dataset, and refining the first model based at least on the results of the second model.
摘要:
In embodiments of the present invention, a method is described for reducing bias by data fusion of a household panel data and a loyalty card data. In embodiments, a method is provided for receiving a consumer panel dataset in a data fusion facility, receiving a consumer point-of-sale dataset in a data fusion facility, receiving a dimension dataset in a data fusion facility, fusing the datasets received in the data fusion facility into a new panel dataset based at least in part on an encryption key, estimating a consumer behavior using a first model based on the consumer panel dataset, estimating a consumer behavior using a second model based only on those consumers present in both the consumer panel dataset and the consumer point-of-sale dataset, and refining the first model based at least on the results of the second model.
摘要:
Systems and methods are presented that may involve storing a core information matrix in a partition within a partitioned database, wherein the partition is associated with a data characteristic. It may also involve associating a master processing node with a plurality of slave nodes, wherein each of the plurality of slave nodes is associated with a partition of the partitioned database. It may also involve submitting a query to the master processing node, wherein the query relates to a projection. It may also involve assigning analytic processing to at least one of the plurality of slave nodes by the master processing node, wherein the assignment is based at least in part on the partition association. It may also involve processing the projection-related query by the assigned slave node, wherein the analysis produces a partial projection result at the assigned slave node.
摘要:
Systems and methods are presented that may involve specifying an availability condition associated with a data hierarchy in a database. It may also involve storing the availability condition in a matrix and using the matrix to determine access to data in the data hierarchy. In embodiments, the data hierarchy may be a flexible data hierarchy wherein a selected dimension of data within the hierarchy may be held temporarily fixed while flexibly accessing other dimensions of the data. In embodiments, the process may further involve specifying an availability condition, wherein the specification of the availability condition does not require modification of the datum or restatement of the database.
摘要:
The present invention provides a method for updating data sources. The method may include identifying a plurality of data sources, identifying a plurality of overlapping attribute segments to use for comparing the data sources, calculating a factor as a function of each of the plurality of overlapping attribute segments, and using the factors to update a first group of values in the second data source to reduce bias. Further, at least a first data source is more accurate than a second data source.
摘要:
The methods and systems disclosed herein include an analytic platform, with a customized retailer portal application, that may be used to perform data projection and release methodologies in order to create an integrated, flexible, actionable view of data such as data relating to consumers, consumer behavior, commodity sales, and other commercial activities like those pertaining to relationships between consumers and stores.