Abstract:
Methods and systems for simulating a search, for the purpose of evaluating one or more scoring functions used in ordering item listings for presentation in a search results page are described. Consistent with some embodiments, a simulation platform includes a real-time simulation module that receives search result sets for search queries that result in the conclusion of a transaction. The result set is then processed by the simulation platform with one or more test scoring functions, such that the resulting position of the item listing that has resulted in the transaction can be compared with the actual position at which the item listing was displayed in the actual search results. For each test scoring function, an average rank shift metric is determined, and displayed, thereby providing a metric with which to base decisions about which scoring functions to use in the production system.
Abstract:
In various example embodiments, systems and methods for providing category aspect information by mining historical data is provided. In example embodiments, a table comprising joined data is accessed. The table includes historical data that comprises user behavior data based on actions performed with past queries by users, data describing publication, and a determined category for each publication. Demand scores based on the joined data are determined. The determined demand scores are used to determine most relevant aspect name and aspect value pairs for a category. Publications having at least one of the most relevant aspect name and aspect value pairs are displayed visually distinguished from less relevant publications that exclude the most relevant aspect name and aspect value pairs.
Abstract:
Systems and methods for generating data dictionaries are disclosed. In one example, a system comprises a candidate list generator module to generate a list of keywords from search query information, a validation module to validate candidate token pairs based on a divergence value wherein candidate token pairs having divergence values exceeding a divergence threshold are valid token pairs, and a data dictionary module to receive the validated token pairs as entries in a vocabulary, and provide the validated token pairs in response to a search query containing a keyword corresponding to at least one entry in the vocabulary.
Abstract:
Systems and methods for providing category aspect information by mining historical data is provided. A table comprising joined data is accessed. The table includes historical data that comprises user behavior data based on actions performed with past queries by users listing data that includes aspect data for each listing and a determined category for each listing. Demand scores based on the joined data are determined. The determined demand scores are used to determine at least one relevant aspect name for a category. A message is transmitted for display on a device of the user that indicates the at least one most relevant aspect name for the category and suggests the user provide an aspect value that corresponds to the at least one most relevant aspect name to be included in a new listing.
Abstract:
Methods and systems for simulating a search, for the purpose of evaluating one or more scoring functions used in ordering item listings for presentation in a search results page are described. Consistent with some embodiments, a simulation platform includes a real-time simulation module that receives search result sets for search queries that result in the conclusion of a transaction. The result set is then processed by the simulation platform with one or more test scoring functions, such that the resulting position of the item listing that has resulted in the transaction can be compared with the actual position at which the item listing was displayed in the actual search results. For each test scoring function, an average rank shift metric is determined, and displayed, thereby providing a metric with which to base decisions about which scoring functions to use in the production system.
Abstract:
Systems and methods to process a query with a unified storage interface are described. The system receives a query from a client machine and generates a query expression tree based on the query expression. The system generates a cursor expression tree based on the query expression tree. The system executes a plurality of software components in the cursor expression tree to retrieve data from a first storage device. The plurality of software components comprise a first software component that is utilized to retrieve data irrespective of a plurality of storage devices and a second software component that is utilized to retrieve data from a first storage device. Finally, the system communicates search results to the client machine, the search results include at least a portion of the data.
Abstract:
In various example embodiments, systems and methods for providing category aspect information by mining historical data is provided. In example embodiments, a table comprising joined data is accessed. The table includes historical data that comprises user behavior data based on actions performed with past queries by users, data describing publication, and a determined category for each publication. Demand scores based on the joined data are determined. The determined demand scores are used to determine most relevant aspect name and aspect value pairs for a category. Publications having at least one of the most relevant aspect name and aspect value pairs are displayed visually distinguished from less relevant publications that exclude the most relevant aspect name and aspect value pairs.
Abstract:
In various example embodiments, systems and methods for providing category aspect information by mining historical data is provided. In example embodiments, a table comprising joined data is accessed. The table includes historical data that comprises user behavior data based on actions performed with past queries by users, listing data that includes aspect data for each listing, and a determined category for each listing. Demand scores based on the joined data are determined. The determined demand scores are used to determine at least one relevant aspect name for a category. A message is transmitted for display on a device of the user that indicates the at least one most relevant aspect name for the category and suggests the user provide an aspect value that corresponds to the at least one most relevant aspect name to be included in a new listing.