摘要:
Techniques are provided for the efficient location, processing, and retrieval of local product information derived from web pages generally locatable through form queries submitted to web pages often referred to as the “deep” or “hidden” web. In an embodiment, information such as product information and dealer-location information is located on a web page form such as a dealer-locator form. After location of a suitable web page form, editorial wrapping is performed to create an automated information extraction process. Using the automated information extractor, deep-web crawling is performed. A grid-based extraction of individual business records is performed, and matching and ingestion are performed in conjunction with a business listing database. Finally, metadata tags are added to entries in the business listing database. Metadata tags also may be added to entries in other databases.
摘要:
Techniques are provided for the efficient location, processing, and retrieval of local product information derived from web pages generally locatable through form queries submitted to web pages often referred to as the “deep” or “hidden” web. In an embodiment, information such as product information and dealer-location information is located on a web page form such as a dealer-locator form. After location of a suitable web page form, editorial wrapping is performed to create an automated information extraction process. Using the automated information extractor, deep-web crawling is performed. A grid-based extraction of individual business records is performed, and matching and ingestion are performed in conjunction with a business listing database. Finally, metadata tags are added to entries in the business listing database. Metadata tags also may be added to entries in other databases.
摘要:
Various users' navigational behaviors relative to search results presented by a search engine are monitored. URLs that are visited and revised queries that are submitted after the submission of an original query are placed within a trail that begins with the original query. These trails are grouped based on the original queries with which they begin. For each trail group, a set of URLs that frequently occur in that group's trails, and a set of revised queries that frequently occur in that group's trails, are determined. These frequently occurring elements are mapped to the original queries with which all the trails in the corresponding trail group begin. In response to subsequent submissions of the same original query, the search engine ensures that URLs and revised queries that are mapped to the original query are prominently displayed on the search results pages that are initially returned in response to those submissions.
摘要:
The present invention is directed towards systems and methods for generating and displaying the difference between a primary result set and a secondary result set. According to the present invention, a method for displaying the difference between a primary result set and a secondary result set for a query comprises generating a primary result set and a secondary result set, the primary result set and secondary result set generated according to one or more respective disparate search algorithms. A difference result set is determined according to a difference between items in the primary result set and second result set, which is displayed to a user.
摘要:
Various users' navigational behaviors relative to search results presented by a search engine are monitored. URLs that are visited and revised queries that are submitted after the submission of an original query are placed within a trail that begins with the original query. These trails are grouped based on the original queries with which they begin. For each trail group, a set of URLs that frequently occur in that group's trails, and a set of revised queries that frequently occur in that group's trails, are determined. These frequently occurring elements are mapped to the original queries with which all the trails in the corresponding trail group begin. In response to subsequent submissions of the same original query, the search engine ensures that URLs and revised queries that are mapped to the original query are prominently displayed on the search results pages that are initially returned in response to those submissions.
摘要:
The present invention is directed towards systems and methods for generating and displaying the difference between a primary result set and a secondary result set. According to the present invention, a method for displaying the difference between a primary result set and a secondary result set for a query comprises generating a primary result set and a secondary result set, the primary result set and secondary result set generated according to one or more respective disparate search algorithms. A difference result set is determined according to a difference between items in the primary result set and second result set, which is displayed to a user.
摘要:
Determine a plurality of first dwell durations for a plurality of first web pages, each first dwell duration indicating a time period a user has spent with a first web page. Access a plurality of first quality ratings for the first web pages, each first quality rating indicating a quality of a first web page as a part of a search result generated for a first search query. Access a predefined quality rating threshold. Correlate the first dwell durations and the first quality ratings. And, determine a dwell duration threshold, such that a second user spending a second dwell duration greater than or equal to the dwell duration threshold with a second web page indicates that the second user is satisfied with the second web page identified in a second search result generated by a search engine in response to a second search query requested by the second user.
摘要:
The present invention is directed towards systems and methods for generating and displaying the difference between a primary result set and a secondary result set. According to the present invention, a method for displaying the difference between a primary result set and a secondary result set for a query comprises generating a primary result set and a secondary result set, the primary result set and secondary result set generated according to one or more respective disparate search algorithms. A difference result set is determined according to a difference between items in the primary result set and second result set, which is displayed to a user.