摘要:
The invention provides a method of interactively crawling data records on a web page. Users may select various data records of interest on a web page to generate templates to search for similar data items on the same web page or on different web pages. A tree matching algorithm may be used to compare and extract data matching the generated template.
摘要:
A method is provided to ubiquitously promote products or services provided by a plurality of advertisers across a plurality of content-based applications. The method may include receiving data from the plurality of advertisers describing the products or services provided by the plurality of advertisers. The received data is stored in an advertisement database where a user may subsequently query the advertisement database to locate data promoting a product or service. Additionally, terms and payment data are extracted from the received advertisement data to efficiently promote the products or services provided by the advertiser across the plurality of content-based applications.
摘要:
A peer-to-peer advertisement platform is provided to ubiquitously promote products or services supplied by advertisers across content-based applications executing on nodes in a peer-to-peer network. The peer-to-peer advertisement platform may include a registration component to register nodes in the peer-to-peer advertising platform, an advertisement submission component to receive advertisement data from the advertisers, and a distribution component to distribute the advertisement data to the nodes registered in the peer-to-peer advertisement platform. The peer-to-peer advertisement platform also includes a money sharing component to reward nodes based on a contribution level assigned to the node. Accordingly, the peer-to-peer advertisement platform stores the advertisement data locally at the plurality of nodes registered in the peer-to-peer advertising platform and shares a portion of the revenue generated from the advertisement data with the nodes registered in the peer-to-peer advertising platform.
摘要:
A peer-to-peer advertisement platform is provided to ubiquitously promote products or services supplied by advertisers across content-based applications executing on nodes in a peer-to-peer network. The peer-to-peer advertisement platform may include a registration component to register nodes in the peer-to-peer advertising platform, an advertisement submission component to receive advertisement data from the advertisers, and a distribution component to distribute the advertisement data to the nodes registered in the peer-to-peer advertisement platform. The peer-to-peer advertisement platform also includes a money sharing component to reward nodes based on a contribution level assigned to the node. Accordingly, the peer-to-peer advertisement platform stores the advertisement data locally at the plurality of nodes registered in the peer-to-peer advertising platform and shares a portion of the revenue generated from the advertisement data with the nodes registered in the peer-to-peer advertising platform.
摘要:
A system and method are disclosed for providing documents related to a search request. The search request may include a search query of one or more keywords, or the search request may be a demographic search query including one or more demographic attributes. An index containing data crawled from publisher's websites, demographic information of registered users, along with the search history of the registered users can be created. Once a search request is received, the search request can be compared to the information stored in the index, and one or more documents related to the request can be provided.
摘要:
A system and method are disclosed for providing documents related to a search request. The search request may include a search query of one or more keywords, or the search request may be a demographic search query including one or more demographic attributes. An index containing data crawled from publisher's websites, demographic information of registered users, along with the search history of the registered users can be created. Once a search request is received, the search request can be compared to the information stored in the index, and one or more documents related to the request can be provided.
摘要:
An influential persons identification system and method for identifying a set of influential persons (or influencers) in a social network (such as an online social network). The influential persons set is generated such that by sending a message to the set the message will be propagated through the network at the greatest speed and coverage. A ranking of users is generated, and a pruning process is performed starting with the top-ranked user and working down the list. For each user on the list, the user is identified as an influencer and then the user and each of his friends are deleted from the social network users list. Next, the same process is performed for the second-ranked user, the third-ranked user, and so forth. The process terminates when the list of users of the social network is exhausted or the desired number of influencers on the influential person set is reached.
摘要:
Computer-readable media having computer-executable instructions and apparatuses provide a keyphrase navigation map (KNM) for a document page. Keyphrases are extracted from the document page. Keyphrase clusters are subsequently formed by a measure of relevancy, and a salient keyphrase is determined for each cluster. A thumbnail is formed with tags corresponding to the salient keyphrases. A selected tag is expanded with associated keyphrases. An associated keyphrase may be further selected in order to facilitate the navigation of the document page. The displayed tags on the thumbnail are positioned in accordance with locations of associated keyphrases in the document page.
摘要:
Embodiments of the invention relate to improvements to the support vector machine (SVM) classification model. When text data is significantly unbalanced (i.e., positive and negative labeled data are in disproportion), the classification quality of standard SVM deteriorates. Embodiments of the invention are directed to a weighted proximal SVM (WPSVM) model that achieves substantially the same accuracy as the traditional SVM model while requiring significantly less computational time. A weighted proximal SVM (WPSVM) model in accordance with embodiments of the invention may include a weight for each training error and a method for estimating the weights, which automatically solves the unbalanced data problem. And, instead of solving the optimization problem via the KKT (Karush-Kuhn-Tucker) conditions and the Sherman-Morrison-Woodbury formula, embodiments of the invention use an iterative algorithm to solve an unconstrained optimization problem, which makes WPSVM suitable for classifying relatively high dimensional data.
摘要:
Embodiments of the invention relate to improvements to the support vector machine (SVM) classification model. When text data is significantly unbalanced (i.e., positive and negative labeled data are in disproportion), the classification quality of standard SVM deteriorates. Embodiments of the invention are directed to a weighted proximal SVM (WPSVM) model that achieves substantially the same accuracy as the traditional SVM model while requiring significantly less computational time. A weighted proximal SVM (WPSVM) model in accordance with embodiments of the invention may include a weight for each training error and a method for estimating the weights, which automatically solves the unbalanced data problem. And, instead of solving the optimization problem via the KKT (Karush-Kuhn-Tucker) conditions and the Sherman-Morrison-Woodbury formula, embodiments of the invention use an iterative algorithm to solve an unconstrained optimization problem, which makes WPSVM suitable for classifying relatively high dimensional data.