摘要:
The present invention discloses a data storage method and apparatus. The method includes: obtaining a primary index of a data object, where a row identifier of the primary index includes at least one field; determining a row identifier of a secondary index of the data object according to the row identifier of the primary index, where the row identifier of the secondary index includes an index field and a field other than the index field in the at least one field of the row identifier of the primary index, and the index field is different from the first field in the at least one field of the row identifier of the primary index; generating the secondary index of the data object according to the row identifier of the secondary index; and storing the data object according to the primary index and the secondary index. With the data storage method and apparatus in embodiments of the present invention, storage space can be saved.
摘要:
An information processing device (100) processes a data structure that represents a set of points included in a multidimensional space, and includes: a search unit (10) that, when a particular multidimensional region is specified as a query region, specifies regions that are expressed as portions shared between regions starting with feature value prefixes and regions starting with coordinate prefixes with respect to respective dimensions, and that are included in the query region, and outputs selected sections that are sections included in a data structure corresponding to the specified regions, the feature value prefixes being prefixes obtained from bit representations of feature values of the points, and the coordinate prefixes being prefixes obtained from bit representations of coordinates of the points with respect to a particular dimension; and an aggregation unit (20) that calculates information regarding a set of points included in the query region, by using the selected sections.
摘要:
An index generating device (100) comprises: an index generating unit (101) which uses an interdata similarity degree which is computed by a similarity computation unit (105) to generate an index which includes a lower order node, and a higher order node which includes an entry having a link relation with the lower order node and indicating representative data having a similarity degree with data indicated by an entry in the lower order node greater than or equal to a similarity threshold value appended to the link relation; and a threshold setting unit (102) which sets the similarity threshold to a value which increases the further down the hierarchy.
摘要:
Embodiments of the present invention disclose a data indexing method and apparatus. According to the data indexing method and apparatus provided by the embodiments of the present invention, N one-dimensional indexes that correspond to N dimensions and are independent of each other are obtained according to the N dimensions, and it is determined whether address records included in the N one-dimensional indexes that correspond to the dimensions and are independent of each other have an intersection set, so as to obtain data pointed to by an address record corresponding to the intersection set, where the data is used as target indexing data, thereby solving the problem that a one-dimensional indexing technology cannot meet requirements for multi-dimensional indexing combined query and multi-dimensional analysis; in addition, by determining count value of tag number flag bits corresponding to the address records included in the N one-dimensional indexes, a speed requirement for the multi-dimensional analysis is easily and conveniently met, indexing complexity is reduced, and performance of data accurate data indexing is improved.
摘要:
One embodiment of the present invention provides a recommendation system. During operation, the system receives context information associated with the user, updates a plurality of user models based on the received context information, and identifies at least one spatial data structure that stores a plurality of items. A respective item is stored within the spatial data structure based on a vector value associated with the item. The system then queries the spatial data structure to obtain a first set of recommendable items that have vector values within a predetermined range, calculates a score for each item within the set of recommendable items based on the plurality of the user models and a characterization vector associated with each item, ranks the items within the set of recommendable items based on calculated scores, and recommends one or more top-ranked items to the user.
摘要:
An approach is provided to determine one or more dynamic ordered tree structures and transition tree structures (e.g., based on one or more transitions of a device) to facilitate querying and/or accessing data stores. An apparatus and method determines to generate at least one index structure, determines to associate index objects of the generated index structure with one or more data objects of at least one data store, determines to generate at least one transition index structure based on the at least one generated index structure, and determines to associate the transition index structure with index objects corresponding to one or more data objects of at least one data store based on a transition of a device. Also, the method and apparatus determines to generate at least one query, and determines to generate at least one transition index structure where a current index structure to resolve the query is absent.
摘要:
A session table includes one or more records, where each record represents a session. Session record information is stored in various fields, such as key fields, value fields, and timestamp fields. Session information is described as keys and values in order to support query/lookup operations. A session table is associated with a filter, which describes a set of keys that can be used for records in that table. A session table is populated using data contained in security information/events. Rules are created to identify events related to session information, extract the session information, and use the session information to modify a session table. A session table is partitioned so that the number of records in each session table partition is decreased. A session table is processed periodically so that active sessions are moved to the current partition.
摘要:
Described is a technology by which uploaded GPS data is indexed according to spatio-temporal relationships to facilitate efficient insertion and retrieval. The indexes may be converted to significantly smaller-sized data structures when new updates to that structure are not likely. GPS data is processed into a track of spatially-partitioned segments such that each segment has a cell. Each cell has an associated temporal index (a compressed start-end tree), into which data for that cell's segments are inserted. The temporal index may include an end time index that relates each segment's end time to a matching start time index. Given query input comprising a spatial predicate and a temporal predicate, tracks may be searched for by determining which spatial candidate cells may contain matching results. For each candidate cell, the search accesses the cell's associated temporal index to find any track or tracks that correspond to the temporal predicate.
摘要:
The search method includes: receiving a search request including plural search conditions; separating the plural search conditions into search conditions for individual databases; storing, for each database, the search condition and the database identifier in a condition record; searching a search result management table by the database identifier and the search condition; determining anterior search databases; searching each of the anterior search databases by the search condition to acquire a first search result; determining common entity identifiers included in common in the first search results; and searching each of posterior search databases by the common entity identifiers and the search condition.