摘要:
A logical directory ranking system ranks documents or web pages utilizing logical directories. The present system groups together compound documents as a single information node with one or more leaves, constructing a logical directory graph. URLs can be grouped at a level of granularity below an individual directory. For example, the URLs may be grouped together on the basis of hostname, domain, or any level of the hierarchy of the URLs. Edges in the logical directory graph are formed by links between the logical directories. Edges have weights corresponding to the number of links between logical directories. Nodes have weights corresponding to the number of web pages or leaves represented by a node. A ranking level is determined for each node as a function of the node weight and the edge weight. The ranking level is then applied to each URL that the node represents.
摘要:
A dangling web page processing system ranks dangling web pages on the web. The system ranks dangling web pages of high quality that cannot be crawled by a crawler. In addition, the system adjusts ranks to penalize dangling web pages that return errors when links on the dangling web pages are crawled. By providing a rank for dangling web pages, the present system allows the concentration of crawling resources on those dangling web pages that have the highest rank in the uncrawled region. The system operates locally to the dangling web pages, providing efficient determination of ranks for the dangling web pages. The system explicitly discriminates against web pages on the basis of whether they point to penalty pages, i.e., pages that return an error when a link is followed. By incorporating more fine-grained information such as this into ranking, the system can improve the quality of individual search results and better manage resources for crawling.
摘要:
A modular scoring system using rank aggregation merges search results into an ordered list of results using many different features of documents. The ranking functions of the present system can easily be customized to the needs of a particular corpus or collection of users such as an intranet. Rank aggregation is independent of the underlying score distributions between the different factors, and can be applied to merge any set of ranking functions. Rank aggregation holds the advantage of combining the influence of many different heuristic factors in a robust way to produce high-quality results for queries. The modular scoring system combines factors such as indegree, page ranking, URL length, proximity to the root server of an intranet, etc, to form a single ordering on web pages that closely obeys the individual orderings, but also mediates between the collective wisdom of individual heuristics.
摘要:
An improved system and method for pricing of overlapping impression pools of online advertisement impressions for advertising demand is provided. An inventory of online advertisement impressions may be grouped in impression pools according to attributes of the advertisement impressions and advertisers' requests for impressions targeting specific attributes may be received. An optimal price may be computed for each of the impression pools of the inventory of online advertisement impressions using dual values of an optimization program. The values of a dual variable for prices of impression pools on the supply constraints of an objective function for allocating the impression pools may be extracted and iteratively increased on those impression pools which have a dual value greater than the book rate value.
摘要:
An improved system and method for web destination profiling for online population-targeted advertising is provided. A web destination profiler may be provided for generating web destination profiles. Traffic may be analyzed at a particular web destination in order to understand the population visiting the web destination. The analysis of user traffic, including differentiated clickstream data, may be applied for determining known characteristics of a web destination profile. Moreover, unknown characteristics of a web destination profile may be determined using a variety of techniques including inferring characteristics by modeling traffic flow through other web destinations, estimating characteristics from other web destination profiles by predicting traffic flow through other web destinations, propagating characteristics to a web destination profile by smoothing a joint distribution of characteristics of other web destination profiles, and so forth. Web destination profiles may be used by applications such as an online application for population-targeted advertising.
摘要:
A system (and method) for a recovery of data from a lost sector in a storage system, which includes a set of readable and lost sectors in a plurality of disks in the storage system, includes identifying a lost sector of at least one disk of the storage system, determining whether the data from the lost sector is capable of being recovered, and, if the data from the lost sector is capable of being recovered, generating a recovery formula for the lost sector and recovering the data from the lost sector based on the recovery formula.
摘要:
An improved system and method for allocating and pricing impression pools of advertisement impressions with frequency capping is provided. An upper bound on the number of impressions which an impression pool can supply to satisfy advertiser demand may be generated and used as a constraint to allocate impressions to satisfy advertiser requests. Either a deterministic upper bound may be generated or a stochastic upper bound may be generated on the number of impressions which an impression pool can supply to satisfy advertiser demand, and this upper bound may be used as a constraint to allocate impressions to satisfy advertiser requests for advertisement placements on the display advertising properties. In an embodiment, frequency caps, display frequencies, arrival rates of unique users, and departure rates of unique users may be used to compute the upper bounds on the number of impressions which an impression pool can supply to satisfy advertiser demand.
摘要:
An improved system and method for pricing of overlapping impression pools of online advertisement impressions for advertising demand is provided. An inventory of online advertisement impressions may be grouped in impression pools according to attributes of the advertisement impressions and advertisers' requests for impressions targeting specific attributes may be received. An optimal price may be computed for each of the impression pools of the inventory of online advertisement impressions using dual values of an optimization program. The values of a dual variable for prices of impression pools on the supply constraints of an objective function for allocating the impression pools may be extracted and iteratively increased on those impression pools which have a dual value greater than the book rate value.
摘要:
An improved system and method for generating a maximum utility slate of advertisements for online advertisement auctions is provided. Various utility factors for each advertisement that may be a candidate in a slate of advertisements may be applied within a framework in order to generate a maximum utility slate of advertisements. Either backward or forward dynamic programming may be applied to recursively evaluate the utility of subslates of advertisements in order to generate a maximum utility slate of advertisements. In an embodiment, a network with directed edges and associated costs may be defined, and the longest path may be found in the directed network for constructing a maximum utility slate of advertisements. Various utility factors may be applied for different objectives of an auctioneer and the framework presented may be extended for revenue ordering, exclusion of bidders, ordering slates according to first and second price utilities, and so forth.
摘要:
The present invention introduces methods for allocating: overlapping inventory. In the system of the present invention overlapping inventory problems are reformulated as a network transport problem. Specifically, different inventory types are represented as inventory network nodes. Similarly, corresponding inventory requests are also represented as request network nodes. The different inventory network nodes corresponding to inventory that can satisfy inventory requests are coupled to the request network nodes associated with those inventory requests. A source node is then coupled to the inventory network nodes and a destination node is coupled to the request network nodes. A flow limit of the available inventory is assigned to the connections between the source node and the inventory network nodes. Finally, inventory requests that must be satisfied are represented as flow minimums between the request network nodes and the destination network node. The transport network problem is then solved to solve the corresponding overlapping inventory allocation problem.