Abstract:
Systems and methods to enhance enhancing a service for a user. The system collecting documents viewed or words posted by a user. Determining a list of topic words for the user based on words in the documents viewed or words posted. Identifying a list of topic words associated with the user, based on words in the one or more documents and the words posted by the user. Assigning each of the topic words to at least one of a plurality of topics based on correlations between the topic words of the user and topic words from other users. Estimating a set of interest topics for the user based on the topics assigned to the topic words of the user.
Abstract:
The present invention relates to systems, methods, and user interfaces for browsing a collection of content items saved by a user or by one or more buddies associated with a given user. The method of the present invention comprises saving one or more content items and one or more associated keywords as specified by a user. An interface is generated that displays the one or more saved content items and the one or more associated keywords, as well as the one or more buddies associated with a given user. A user indication of the selection of a given keyword or the selection of a given buddy by the user is received. The one or more displayed content items are filtered according to the selected keyword, buddy, or combination of selected keyword and buddy.
Abstract:
A method for restoring compressed data. A difference file and a reference file corresponding to a compressed file are accessed. The difference file is partially decompressed into a partially decompressed difference file in an intermediate representation. The reference file is partially decompressed into a partially decompressed reference file in the intermediate representation. A partially decompressed compressed file is generated based on said partially decompressed difference file and said partially decompressed reference file.
Abstract:
Techniques are described for identifying one or more “interest twins” of a user. An interest twin of a user in another user that has demonstrated interests in items that are the same as or similar to the items in which the user has demonstrated an interest. Various techniques are described for reducing the overhead in interest twin determination operations. Once the interest twins for a user have been identified, the knowledge of the interest twins may be used in a variety of ways to enhance to experience of the user. For example, a mechanism may be provided which allows the user to see a list of items in which the user's interest twins have indicated an interest.
Abstract:
Embodiments are directed towards a Modified Sequitur algorithm (MSA) using pipelining and indexed arrays to identify trending topics within a plurality of documents having user generated content (UGC). The documents are parallelized and distributed across a plurality of network devices, which place at least some of the received documents into a buffer for which the MSA may then be applied to the documents within the buffer to identify n-grams or phrases within the documents' contents. The identified phrases are further analyzed to remove extraneous co-occurrences of phrases, and/or words based on a part of speech analysis. A weighting of the remaining phrases is used to identify trending topic phrases. Links to content in the plurality of UGC documents that is associated with the trending topic phrases may then be displayed to a client device.
Abstract:
A query-centric system and process for distributing reverse indices for a distributed content system. Relevance ranking techniques in organizing distributed system indices. Query-centric configuration subprocesses (1) analyze query data, partitioning terms for reverse index server(s) (RIS), (2) distribute each partitioned data set by generally localizing search terms for the RIS that have some query-centric correlation, and (3) generate and maintain a map for the partitioned reverse index system terms by mapping the terms for the reverse index to a plurality of different index server nodes. Indexing subprocess element builds distributed reverse indices from content host indices. Routines of the query execution use the map derived in the configuration to more efficiently return more relevant search results to the searcher.
Abstract:
A system comprises a plurality of computing resource. Attribute values for the computing resources are stored in a peer-to-peer network in the system. A method for storing an attribute value comprises receiving an attribute value for a resource in the system, identifying a node in the peer-to-peer network for storing the attribute value, wherein the node stores a range of attribute values encompassing the received attribute value, and storing the attribute value at the identified node.
Abstract:
A method for compressing compressed data. Compressed files are accessed. At least a portion of the compressed files are partially decompressed into partially decompressed files in an intermediate representation. The partially decompressed files are compared. The partially decompressed files are compressed.
Abstract:
An annotation method for annotating content includes displaying a set of suggested keywords on an editing interface page configured to receive one or more annotations for the content. A request is received via the editing interface page to annotate the content with at least one keyword from the set of suggested keywords. Association information is generated that associates the at least one keyword with the content.
Abstract:
A peer-to-peer system includes a plurality of nodes. The plurality of nodes includes at least an initial node and a neighboring node. The initial node is operable to determine whether to place an object at either the initial node or another node in the system based on a comparison of storage utilizations for the initial node and the neighboring node.