摘要:
The present invention relates to a system and methodology to facilitate extraction of information from a large unstructured corpora such as from the World Wide Web and/or other unstructured sources. Information in the form of answers to questions can be automatically composed from such sources via probabilistic models and cost-benefit analyses to guide resource-intensive information-extraction procedures employed by a knowledge-based question answering system. The analyses can leverage predictions of the ultimate quality of answers generated by the system provided by Bayesian or other statistical models. Such predictions, when coupled with a utility model can provide the system with the ability to make decisions about the number of queries issued to a search engine (or engines), given the cost of queries and the expected value of query results in refining an ultimate answer. Given a preference model, information extraction actions can be taken with the highest expected utility. In this manner, the accuracy of answers to questions can be balanced with the cost of information extraction and analysis to compose the answers.
摘要:
Architecture for improving text searches using information redundancy. A search component is coupled with an analysis component to rerank documents returned in a search according to a redundancy values. Each returned document is used to develop a corresponding word probability distribution that is further used to rerank the returned documents according to the associated redundancy values. In another aspect thereof, the query component is coupled with a projection component to project answer redundancy from one document search to another. This includes obtaining the benefit of considerable answer redundancy from a second data source by projecting the success of the search of the second data source against a first data source.
摘要:
Systems and methods that estimate user preference, via automatic interpretation of user behavior. A user behavior component associated with a search engine can automatically interpret collective behavior of users (e.g., web search users). Such feedback component can include user behavior features and predictive models (e.g., from a user behavior component) that are robust to noise, which can be present in observed user interactions with the search results (e.g., malicious and/or irrational user activity.)
摘要:
The present invention relates to a system and methodology to facilitate extraction of information from a large unstructured corpora such as from the World Wide Web and/or other unstructured sources. Information in the form of answers to questions can be automatically composed from such sources via probabilistic models and cost-benefit analyses to guide resource-intensive information-extraction procedures employed by a knowledge-based question answering system. The analyses can leverage predictions of the ultimate quality of answers generated by the system provided by Bayesian or other statistical models. Such predictions, when coupled with a utility model can provide the system with the ability to make decisions about the number of queries issued to a search engine (or engines), given the cost of queries and the expected value of query results in refining an ultimate answer. Given a preference model, information extraction actions can be taken with the highest expected utility. In this manner, the accuracy of answers to questions can be balanced with the cost of information extraction and analysis to compose the answers.
摘要:
The present invention relates to systems and methods that employ user models to personalize generalized queries and/or search results according to information that is relevant to respective user characteristics. A system is provided that facilitates generating personalized searches of information. The system includes a user model to determine characteristics of a user. The user model may be assembled automatically via an analysis of a user's content, activities, and overall context. A personalization component automatically modifies queries and/or search results in view of the user model in order to personalize information searches for the user. A user interface receives the queries and displays the search results from one or more local and/or remote search engines, wherein the interface can be adjusted in a range from more personalized searches to more generalized searches.
摘要:
Context-based user behavior data is collected from a search mechanism. This data includes, for a given query, user feedback (implicit and explicit) on the query and context information on the query. A predictive pattern is applied to the context-based user behavior data in order to produce predicted user satisfaction data. Data mining techniques may be used to create and improve one or more predictive patterns. Predicted user satisfaction data can be used to monitor or improve search mechanism performance, via a display reporting the performance or identification of any queries with a shared characteristic and sub-par user satisfaction. A dynamically-improving search mechanism uses the predicted user satisfaction data to improve the performance of the search mechanism.
摘要:
Techniques and systems are disclosed for providing changed content identification for an online document that is accessed by a user or user agent. A reference point for an online document that a user or user agent is interested in accessing is identified, comprising a stored prior version of the document. The prior version of the document is retrieved, when the user or user agent accesses the online document, such as by using the reference point. Elements of the prior version are compared with elements of a current version of the document, to determine whether there are differences between the versions. If changes are identified between the prior version and the current version, the current version is automatically updated with visual or auditory representations that identify those changes of content.
摘要:
The present invention relates to a system and methodology for dynamic presentation of search result information within a selected area of a display. In one aspect, a computerized interface for data presentation is provided. The system includes a lens component associated with a portion of a user interface display, wherein the lens component defines an area to display information from at least one search result. A layout component displays a detailed subset of information within the lens component based upon the search result.
摘要:
Techniques and systems are disclosed for providing changed content identification for an online document that is accessed by a user or user agent. A reference point for an online document that a user or user agent is interested in accessing is identified, comprising a stored prior version of the document. The prior version of the document is retrieved, when the user or user agent accesses the online document, such as by using the reference point. Elements of the prior version are compared with elements of a current version of the document, to determine whether there are differences between the versions. If changes are identified between the prior version and the current version, the current version is automatically updated with visual or auditory representations that identify those changes of content.
摘要:
A search engine user interface that reduces the need for explicit search rules; dynamically responds as user input is entered to give immediate feedback to a user; is not limited to searching data residing in a single store; and may be used with a plurality of search engines, is provided. The search engine user interface provides search functions for a plurality of types of file metadata and types of file content. The search engine user interface provides an active query box, query editing, word-wheeling, and query narrowing and broadening. The user interface provides accordion behavior for visual elements of the user interface, integrated custom tagging, multiple independent search parameters, and filtering and integrated custom tagging in a common file dialog box.