Abstract:
A system and method for enabling a user to use a user computer to search the user's personal content items hosted by an online content management service, the online content management service having an indexer computer configured to generate a base index of the user's content items hosted by the service, a base index server computer configured to transmit the generated base index over a network to the user computer, an index mutation generator computer configured to generate index mutations for subsequent changes to user's hosted content items, and an index mutation sever computer configured to transmit the generated index mutations over the network to the user computer. In one aspect, the system and method enable searches of the user's hosted content items and changes thereto to be conducted at the user computer using an index local to the user computer constructed based on the base index and the index mutations received by the user computer.
Abstract:
An offline semantic processor of a resource-constrained voice-enabled device such as a mobile device utilizes an offline grammar model with reduced resource requirements to parse voice-based queries received by the device. The offline grammar model may be generated from a larger and more comprehensive grammar model used by an online voice-based query processor, and the generation of the offline grammar model may be based upon query usage data collected from one or more users to enable a subset of more popular voice-based queries from the online grammar model to be incorporated into the offline grammar model. In addition, such a device may collect query usage data and upload such data to an online service to enable an updated offline grammar model to be generated and downloaded back to the device and thereby enable a dynamic update of the offline grammar model to be performed.
Abstract:
A method and apparatus for acquiring people flow information, and a method and apparatus for processing people flow information are provided. The method for acquiring people flow information comprises: detecting entering and leaving of a target scene by people; determining specific attribute data of people flow entering and leaving the target scene, wherein, the specific attribute data includes at least a target number of people, among the detected people, which enters and leaves the target scene at the same time; and performing an operation of acquiring people flow information about the target scene, wherein, the operation of acquiring people flow information about the target scene includes at least: selecting, from a plurality of first-type counting modules, a first-type counting module corresponding to a predetermined number range of people within which the target number falls, and increasing a count value of the first-type counting module by 1, wherein, each first-type counting module corresponds to one of predetermined number ranges of people. The purpose of acquiring people flow information having a higher value in terms of business analysis, deep data mining and so on can be achieved by the solutions above.
Abstract:
A method includes receiving a first set of de-identified records for individuals from a first type of database for a first set of entities. The first type of database does not include longitudinal information that links the first set of de-identified records across the first set of entities. The method includes receiving a second set of de-identified records for a single individual from a second type of database for a second set of entities. The second type of database includes longitudinal information that links the second set of de-identified records across the second set of entities including over time. The method includes integrating the first type of databases and the second type of databases, which matches the individuals and the single individual. The method includes adding longitudinal information to the first type of database for the individuals based on the longitudinal information of the second type of database.
Abstract:
The invention relates to a method for finding a position of a data pattern in reference data. For this a reference index data structure comprising a reference data structure, a sorted index and a jump table is generated. The sorted index comprises for each position in the reference data structure an entry. Each entry comprises a position field which value refers to an associated position in the reference data structure. By means of the position a reference data pattern corresponding to said position could be reconstructed from the reference data structure. The entries of the sorted index are sorted according to the reference data pattern associated with the value in the position field. A search is performed through the sorted index by reconstructing a reference data pattern from the reference data structure and comparing it with the data pattern to be matched.
Abstract:
A retrieval system retrieves content data stored in a database using an index. The retrieval system includes a first registration unit configured to register a set definition including an index object item indicating a data item serving as the index and an acquisition condition for acquiring the content data from the database; an acquisition unit configured to acquire, when the set definition is registered by the first registration unit, content data included in the set definition, which satisfy the acquisition condition, from the database; and a generation unit configured to extract the index object item included in the set definition, to generate the index, in the content data acquired by the acquisition unit.
Abstract:
Embodiments of the present invention provide a metadata querying method and apparatus. The method includes: sampling at least one piece of first metadata from to-be-searched-for metadata; using at least a part of feature values in each piece of the sampled first metadata as an index, and searching a sparse index table preset in a memory for a corresponding container identifier, where the sparse index table records container identifiers, and a location index of each container identifier in the index table is at least a part of feature values of metadata stored in a container corresponding to the container identifier; selecting, according to the number of times that a same container identifier is found, a container corresponding to a container identifier that meets a set condition; loading metadata in the selected container into a metadata cache; and searching the metadata cache for a data block that is the same as the to-be-searched-for metadata. In the embodiments of the present invention, querying performance can be improved and occupied memory space can be reduced.
Abstract:
Examples of the present disclosure describe systems and methods for enhancing the privacy of a personal search index. In some aspects, a personal cleartext document may be used to generate an encrypted document digest and an encrypted document on a first device. A second device may decrypt the document digest, build a personal search index based on the decrypted document digest, and store the encrypted document in a data store. The first device may subsequently receive a cleartext search query that is used to query the personal search index on the second device for encrypted documents.
Abstract:
A method for updating a database for anonymous visitor tracking includes receiving (402) at least one cookie from a visitor to a first domain name. Based on the at least one cookie, the method determines (404) whether the visitor has an associated unique identity for the first domain name. When the visitor does not have an associated unique identity, the method includes determining (406), for each affiliated domain name in a plurality of affiliated domain names, whether the visitor has an associated unique identity for the affiliated domain name. When the visitor does not have an associated unique identity for the affiliated domain name, the method includes generating (408) a unique identity associated with the visitor and storing the unique identity with the first domain name in an associated identity database. A record associated with the unique identity is updated (410) with a time at which the visitor navigated to the first domain name. An updated first-party cookie, including the unique identity, is generated and transmitted for storage on a device associated with the visitor.
Abstract:
Embodiments of the present application provide a position information providing method and device. The method includes: receiving, by a server, a keyword input by a first user; determining a target user characteristic according to the keyword; determining a first region, the first region including multiple second regions; determining, according to position information of users in the first region, the quantities of second users having the target user characteristic in respective second regions; determining candidate regions from the multiple second regions according to the quantities of the second users; and providing the candidate regions for the first user. The method provided in the embodiments of the present application can improve the accuracy of position information provided for a user.