摘要:
A neural network is used to process a set of ranking features in order to determine the relevancy ranking for a set of documents or other items. The neural network calculates a predicted relevancy score for each document and the documents can then be ordered by that score. Alternate embodiments apply a set of data transformations to the ranking features before they are input to the neural network. Training can be used to adapt both the neural network and certain of the data transformations to target environments.
摘要:
A neural network is used to process a set of ranking features in order to determine the relevancy ranking for a set of documents or other items. The neural network calculates a predicted relevancy score for each document and the documents can then be ordered by that score. Alternate embodiments apply a set of data transformations to the ranking features before they are input to the neural network. Training can be used to adapt both the neural network and certain of the data transformations to target environments.
摘要:
A query pipeline for an enterprise search system is configurable by a user of the system. A user may create rules for custom query transformation and parallel query generation, federation of queries, mixing of results and application of display layouts to the received search results. A user interface (UI) assists a user in configuring the search pipeline. For example, a user may enter condition action rules for queries that affect how a query is transformed, how parallel queries are generated, how queries are federated, how search results are ranked and displayed, how rules are ordered and the like.
摘要:
A customizable ranking model of a search engine using custom ranking model configuration and parameters of a pre-defined human-readable format. The architecture can employ a markup language schema to represent the custom ranking model. In one implementation, the schema developed utilizes XML (extensible markup language) for representing the custom ranking model. Weights for dynamic and static relevance ingredients can be altered per ranking model and new relevance ingredients can be added. Additionally, features are provided for improving relevance such as adding terms to a thesaurus for synonym expansion, for example, the ability to deal with single terms either as compounds, and/or using custom word breaking rules.
摘要:
A query received from a user is directed to a particular search application (e.g. an Enterprise search portal) that is associated with a result source from which to retrieve results. The received query may be federated to additional result sources when the received query is determined to be a popular query in a result source. Query logs associated with the additional result sources are analyzed to determine when a query is popular as compared to the original result source. The query may be altered before being executed that uses one or more of the additional result sources. When the query (altered/unaltered) is determined to be popular for any of the additional result sources as compared to the original result source, the query is executed using that additional result source.
摘要:
Technologies are described herein for ranking documents with social tags. A number ranking feature containing a number of times a document was tagged is received. A textual property ranking feature containing a union of each social tag associated with the document is also received. The number ranking feature is transformed into a static input value. Further, the textual property ranking feature is transformed into a dynamic input value. A document rank for the document is determined by inputting the static input value and/or the dynamic input value into a ranking function.
摘要:
Various technologies and techniques are disclosed for calculating authorship dates for a document. A portion of a document to select to look for possible authorship dates is determined. The possible authorship dates are extracted from the portion of the document. A revised authorship date of the document is generated using a neural network. The revised authorship date is returned to an application or process that requested the date.
摘要:
A markup language schema utilized to represent internal ranking models. In one implementation, the schema developed utilizes XML (extensible markup language) for internal ranking model representation. Other markups languages can be employed.
摘要:
Tools and techniques for expertise ranking using social distance are provided. These tools may receive search queries from users, and extract from these search queries record identifiers associated with the users. In addition, the tools may extract query strings from the search queries. In connection with processing these queries, the tools may identify other users associated with a given user, with some of these other users being first-level colleagues of a given user, and some of these other users being second-level colleagues. The tools may identify documents within a search store that are associated with the other users, and may search these documents for any occurrences of the query string. In turn, results of the search may be ranked based on a social distance between the user and the other users, with the social distance indicating whether the other users are first-level or second-level colleagues of the user.
摘要:
Embodiments are configured to provide information based on a user query. In an embodiment, a system includes a search component having a ranking component that can be used to rank search results as part of a query response. In one embodiment, the ranking component includes a ranking algorithm that can use one or more click-through features to rank search results which may be returned in response to a query. Other embodiments are available.