摘要:
Methods and systems are provided that may be used to determine a probability of whether a visitor to a web document is likely to click on a web advertisement. An exemplary method may include detecting one or more features in a web document. One or more expert statistical models to which the web document belongs may be determined and associated weightings may be determined based, at least in part, on the one or more features detected. A click-through-rate probability for a web advertisement to be placed on the web document may be estimated based on the one or more expert statistical models.
摘要:
Methods for selecting one or more advertisements based on previously captured relevance data to serve to a client system requesting a primary webpage is provided. The client displays a referring webpage having a hyperlink to the primary webpage. Upon selection of the hyperlink, the client sends a request to a content server storing the primary webpage. The content server classifies the primary webpage for content and retrieves persistent relevance information, possibly including a referrer of the primary webpage comprising a URL address of the referring webpage, a listing of other recently visited webpages, a listing of any bid phrases from previously displayed advertisements, and a listing of recent click data. The content server sends the primary webpage to the client, which includes an advertisement server request. The transaction between the content server and the advertisement server includes persistence relevance information to select advertisements to serve to the client.
摘要:
The present invention is directed towards a method and system for characterizing web content based on capturing semantics of folksonomies relating to content entities of user generated content. The method and system includes determining a plurality of tags that describe a plurality of content entities and determining a co-occurrence of the tags. The method and system further includes generating weighted vectors based on the determined co-occurrence of tags and characterizing the content entity based on the weight vectors. Thereby, the characterization of the content entity may be used for any number of suitable purposes, including, by way of example, improving search results and associated advertising relevancy.
摘要:
The present invention is directed towards systems, methods and computer program products for providing query-based advertising content. According to one embodiment, a method for providing query-based advertising content comprises receiving a web query and generating an ad query associated with the web query, wherein the ad query is generated on the basis of one of a query to advertisement relationship, a query to feature vector relationship, or a rewritten query vector. An advertisement is selected on the basis of the generated ad query.
摘要:
A predictor for determining a degree of relevance between a query rewrite and a search query is provided. The predictor may receive a search query from a user via a terminal and identify a set of candidate query rewrites associated with the search query. The predictor may then extract a set of features from advertisements associated with the query rewrites and the search query and determine a degree of relevance between the advertisements and the search query based on a prediction model. The predictor may then determine the degree of relevance between the rewrites and the search query based on the determined degree of relevance between the advertisements and the search query.
摘要:
An improved system and method is provided for feature selection for text classification using subspace sampling. A text classifier generator may be provided for selecting a small set of features using subspace sampling from the corpus of training data to train a text classifier for using the small set of features for classification of texts. To select the small set of features, a subspace of features from the corpus of training data may be randomly sampled according to a probability distribution over the set of features where a probability may be assigned to each of the features that is proportional to the square of the Euclidean norms of the rows of left singular vectors of a matrix of the features representing the corpus of training texts. The small set of features may classify texts using only the relevant features among a very large number of training features.
摘要:
A system and method to facilitate importation of data taxonomies within a network are described. Advertiser entities access a data storage module within a network-based entity to retrieve content information from one or more content taxonomies stored within the data storage module. Subsequently, the advertiser entities select advertisements targeted to specific users based on the retrieved content information and further transmit the advertisements to the network-based entity. Furthermore, publisher entities and/or advertiser entities transmit data, such as, for example, associated taxonomy information, to the network-based entity. The entity receives the respective taxonomy information and parses the taxonomy information to extract node information and associated categories related to the received information. Finally, the entity integrates the node information and associated categories into one or more taxonomies stored within the data storage module. Alternatively, the entity maps the node information and associated categories into corresponding nodes within one or more taxonomies stored within the data storage module, and further stores the mapping information into a mapping database within the data storage module.
摘要:
Methods for selecting advertisements to serve to a client requesting a primary webpage is provided. The client displays a referring webpage having a hyperlink to the primary webpage. Upon selection of the hyperlink, the client sends a request to a content server storing the primary webpage, the request including a referrer of the primary webpage comprising a URL address of the referring webpage. The content server sends the primary webpage to the client which includes the referrer and an advertisement request mechanism configured to make an advertisement request to an advertisement server and attach the referrer to the advertisement request. The advertisement server uses the referrer to select one or more advertisements to serve to the client. The referrer may comprise one or more search query terms submitted by the client. The advertisement server may also use the content of the primary webpage to select the one or more advertisements.
摘要:
Described are a system and method for determined an event occurrence rate. A sample set of content items may be obtained. Each of the content items may be associated with at least one region in a hierarchical data structure. A first impression volume may be determined for the at least one region as a function of a number of impressions registered for the content items associated with the at least one region. A scale factor may be applied to the first impression volume to generate a second impression volume. The scale factor may be selected so that the second impression volume is within a predefined range of a third impression volume. A click-through-rate (CTR) may be estimated as a function of the second impression volume and a number of clicks on the content item.
摘要:
An improved system and method for matching objects belonging to hierarchies is provided and an optimal matching between two feature spaces organized as taxonomies may be learned. The matching may be performed through a multi-level exploration of the hierarchical feature spaces by using multi-armed bandits where the arms of the bandit may be dependent due to the structure induced by the taxonomies. Upon the arrival of an object assigned to the first taxonomy, multi-armed bandits may be run at multiple levels of the taxonomies to select an object assigned to the second taxonomy. Then shrinkage estimation may be performed in a Bayesian framework to exploit dependencies among the arms by estimating payoff probabilities from a beta-binomial model to update payoff probabilities for matching objects from the taxonomies.