Abstract:
Systems and methods for eliminating duplicate events are described. In one embodiment, an event is captured, wherein the event comprises a user interaction with an article on a client device and it is determined whether the event is a duplicate of a stored event.
Abstract:
Systems and methods for processing media files are described. In one embodiment, one or more events are captured having associated event data and associated with a client device, wherein each event is associated with an article and at least one of the articles is a media file, wherein at least one of the events is captured in real time upon the occurrence of the event, at least some of the event data and articles associated with the events are indexed and stored, a search query is received, and the at least one media file is determined as relevant to the search query.
Abstract:
Methods and systems for correlating connections between users and links between articles to identify search and/or ad spamming are disclosed. Social networks can be used to identify connections between users for correlation with links between articles, which can be identified through searches of article contents and/or back tracing accesses to articles. One disclosed method comprises identifying first associations between a plurality of users in a network of associated users; identifying second associations between one or more users and one or more articles; identifying third associations between at least some of the articles or between some of the users and access to some of the articles; and determining at least one of the third associations is correlated with one or more of the first associations.
Abstract:
Systems and methods for information capture from articles are described. In one embodiment, a plurality of keystrokes associated with an application is received, each keystroke is processed to determine an associated action forming a plurality of associated actions, and an event is determined based at least in part on the plurality of associated actions.
Abstract:
Systems and methods for the unification of search results are described. In one described system, a program, such as a search engine, executing on a client device receives a search query. The search engine executes the search on a local index and receives a first result set, which is relevant to the query entered by the user. The search query is also executed against a global index. The search engine receives a second result set from the global index. Once the search engine has received both result sets, the search engine combines the result sets to create a combined result set. The search engine may cause the combined result set to be displayed or otherwise output to a user.
Abstract:
The method of the present invention utilizes machine-learning techniques, particularly Support Vector Machines in combination with a neural network, to process a unique machine-learning enabled representation of the audio bitstream. Using this method, a classifying machine is able to autonomously detect characteristics of a piece of music, such as the artist or genre, and classify it accordingly. The method includes transforming digital time-domain representation of music into a frequency-domain representation, then dividing that frequency data into time slices, and compressing it into frequency bands to form multiple learning representations of each song. The learning representations that result are processed by a group of Support Vector Machines, then by a neural network, both previously trained to distinguish among a given set of characteristics, to determine the classification.
Abstract:
An autonomous citation indexing system which can be used as an assistant agent automates and enhances the task of finding publications in electronic form, including publications located on the world wide web. The system parses citations from papers and identifies citations to the same paper that may differ in syntax. The system also extracts and provides the context of citations to a given paper, allowing a researcher to determine what is published in other papers about a given paper. Common citations and word or string vector distance similarity are used to find related articles in a search.
Abstract:
Methods and systems are provided for selectively storing event data. In one described method, an event associated with an article is captured. The captured event can comprise event data. The method further comprises evaluating the event for satisfaction of a criterion. If the event satisfies the criterion, it is indexed.
Abstract:
Systems and methods that improve client-side searching are described. In one aspect, a system and method for identifying an article, and determining a ranking score for the article based at least in part on duration data, access data, URL data, or trajectory data associated with the article is described.
Abstract:
Methods and systems for correlating connections between users and links between articles to identify search and/or ad spamming are disclosed. Social networks can be used to identify connections between users for correlation with links between articles, which can be identified through searches of article contents and/or back tracing accesses to articles. One disclosed method comprises identifying first associations between a plurality of users in a network of associated users; identifying second associations between one or more users and one or more articles; identifying third associations between at least some of the articles or between some of the users and access to some of the articles; and determining at least one of the third associations is correlated with one or more of the first associations.