Cost-benefit approach to automatically composing answers to questions by extracting information from large unstructured corpora
    1.
    发明申请
    Cost-benefit approach to automatically composing answers to questions by extracting information from large unstructured corpora 有权
    通过从大型非结构化语料库中提取信息来自动构成问题答案的成本效益方法

    公开(公告)号:US20050033711A1

    公开(公告)日:2005-02-10

    申请号:US10635274

    申请日:2003-08-06

    摘要: The present invention relates to a system and methodology to facilitate extraction of information from a large unstructured corpora such as from the World Wide Web and/or other unstructured sources. Information in the form of answers to questions can be automatically composed from such sources via probabilistic models and cost-benefit analyses to guide resource-intensive information-extraction procedures employed by a knowledge-based question answering system. The analyses can leverage predictions of the ultimate quality of answers generated by the system provided by Bayesian or other statistical models. Such predictions, when coupled with a utility model can provide the system with the ability to make decisions about the number of queries issued to a search engine (or engines), given the cost of queries and the expected value of query results in refining an ultimate answer. Given a preference model, information extraction actions can be taken with the highest expected utility. In this manner, the accuracy of answers to questions can be balanced with the cost of information extraction and analysis to compose the answers.

    摘要翻译: 本发明涉及一种便利从诸如万维网和/或其他非结构化来源的大型非结构化语料库提取信息的系统和方法。 通过概率模型和成本效益分析,可以通过这些来源自动构成问题答案形式的信息,以指导基于知识的问答系统采用的资源密集型信息提取程序。 分析可以利用由贝叶斯或其他统计模型提供的系统生成的答案的最终质量的预测。 当与实用新型相结合时,这种预测可以为系统提供对发出给搜索引擎(或引擎)的查询数量的决定的能力,考虑到查询的成本和查询结果的期望值来提炼最终的 回答。 给定一个偏好模型,可以采用最高预期效用的信息提取动作。 以这种方式,可以将问题答案的准确性与信息提取和分析的成本进行平衡,以构成答案。

    Utilizing information redundancy to improve text searches
    2.
    发明申请
    Utilizing information redundancy to improve text searches 失效
    利用信息冗余来改进文本搜索

    公开(公告)号:US20060116996A1

    公开(公告)日:2006-06-01

    申请号:US11336360

    申请日:2006-01-20

    IPC分类号: G06F17/30

    摘要: Architecture for improving text searches using information redundancy. A search component is coupled with an analysis component to rerank documents returned in a search according to a redundancy values. Each returned document is used to develop a corresponding word probability distribution that is further used to rerank the returned documents according to the associated redundancy values. In another aspect thereof, the query component is coupled with a projection component to project answer redundancy from one document search to another. This includes obtaining the benefit of considerable answer redundancy from a second data source by projecting the success of the search of the second data source against a first data source.

    摘要翻译: 使用信息冗余改进文本搜索的架构。 搜索组件与分析组件耦合,以根据冗余值重新排列在搜索中返回的文档。 每个返回的文档用于开发相应的字概率分布,其进一步用于根据相关联的冗余值重新排列返回的文档。 在另一方面,查询组件与投影组件耦合以将答复冗余从一个文档搜索投射到另一个。 这包括通过针对第一数据源投射搜索第二数据源的成功来从第二数据源获得相当多的应答冗余的好处。

    MINING WEB SEARCH USER BEHAVIOR TO ENHANCE WEB SEARCH RELEVANCE
    3.
    发明申请
    MINING WEB SEARCH USER BEHAVIOR TO ENHANCE WEB SEARCH RELEVANCE 审中-公开
    采矿网搜索用户行为来增强网页搜索的相关性

    公开(公告)号:US20070208730A1

    公开(公告)日:2007-09-06

    申请号:US11457733

    申请日:2006-07-14

    IPC分类号: G06F17/30

    CPC分类号: G06F16/337 G06F16/9535

    摘要: Systems and methods that estimate user preference, via automatic interpretation of user behavior. A user behavior component associated with a search engine can automatically interpret collective behavior of users (e.g., web search users). Such feedback component can include user behavior features and predictive models (e.g., from a user behavior component) that are robust to noise, which can be present in observed user interactions with the search results (e.g., malicious and/or irrational user activity.)

    摘要翻译: 通过用户行为的自动解释来估计用户偏好的系统和方法。 与搜索引擎相关联的用户行为组件可以自动解释用户(例如,网络搜索用户)的集体行为。 这样的反馈组件可以包括用户行为特征和对噪声鲁棒的预测模型(例如,来自用户行为组件),其可以存在于观察到的与搜索结果(例如,恶意和/或不合理的用户活动)的用户交互中。

    COST-BENEFIT APPROACH TO AUTOMATICALLY COMPOSING ANSWERS TO QUESTIONS BY EXTRACTING INFORMATION FROM LARGE UNSTRUCTURED CORPORA

    公开(公告)号:US20060294037A1

    公开(公告)日:2006-12-28

    申请号:US11469136

    申请日:2006-08-31

    IPC分类号: G06N5/02 G06F17/00

    摘要: The present invention relates to a system and methodology to facilitate extraction of information from a large unstructured corpora such as from the World Wide Web and/or other unstructured sources. Information in the form of answers to questions can be automatically composed from such sources via probabilistic models and cost-benefit analyses to guide resource-intensive information-extraction procedures employed by a knowledge-based question answering system. The analyses can leverage predictions of the ultimate quality of answers generated by the system provided by Bayesian or other statistical models. Such predictions, when coupled with a utility model can provide the system with the ability to make decisions about the number of queries issued to a search engine (or engines), given the cost of queries and the expected value of query results in refining an ultimate answer. Given a preference model, information extraction actions can be taken with the highest expected utility. In this manner, the accuracy of answers to questions can be balanced with the cost of information extraction and analysis to compose the answers.

    Systems, methods, and interfaces for providing personalized search and information access
    5.
    发明申请
    Systems, methods, and interfaces for providing personalized search and information access 审中-公开
    用于提供个性化搜索和信息访问的系统,方法和接口

    公开(公告)号:US20060074883A1

    公开(公告)日:2006-04-06

    申请号:US10958560

    申请日:2004-10-05

    IPC分类号: G06F17/30

    CPC分类号: G06F16/9535

    摘要: The present invention relates to systems and methods that employ user models to personalize generalized queries and/or search results according to information that is relevant to respective user characteristics. A system is provided that facilitates generating personalized searches of information. The system includes a user model to determine characteristics of a user. The user model may be assembled automatically via an analysis of a user's content, activities, and overall context. A personalization component automatically modifies queries and/or search results in view of the user model in order to personalize information searches for the user. A user interface receives the queries and displays the search results from one or more local and/or remote search engines, wherein the interface can be adjusted in a range from more personalized searches to more generalized searches.

    摘要翻译: 本发明涉及采用用户模型根据与各个用户特征相关的信息个性化广义查询和/或搜索结果的系统和方法。 提供了一种有助于生成信息的个性化搜索的系统。 该系统包括用于确定用户特征的用户模型。 可以通过对用户内容,活动和整体上下文的分析来自动组合用户模型。 个性化组件根据用户模型自动修改查询和/或搜索结果,以个性化用户的信息搜索。 用户界面接收查询并显示来自一个或多个本地和/或远程搜索引擎的搜索结果,其中可以在从更个性化搜索到更广义搜索的范围内调整界面。

    Automated satisfaction measurement for web search
    6.
    发明申请
    Automated satisfaction measurement for web search 有权
    网页搜索的自动满意度测量

    公开(公告)号:US20050125390A1

    公开(公告)日:2005-06-09

    申请号:US10806271

    申请日:2004-03-22

    摘要: Context-based user behavior data is collected from a search mechanism. This data includes, for a given query, user feedback (implicit and explicit) on the query and context information on the query. A predictive pattern is applied to the context-based user behavior data in order to produce predicted user satisfaction data. Data mining techniques may be used to create and improve one or more predictive patterns. Predicted user satisfaction data can be used to monitor or improve search mechanism performance, via a display reporting the performance or identification of any queries with a shared characteristic and sub-par user satisfaction. A dynamically-improving search mechanism uses the predicted user satisfaction data to improve the performance of the search mechanism.

    摘要翻译: 从搜索机制收集基于上下文的用户行为数据。 对于给定的查询,该数据包括查询上的用户反馈(隐式和显式)以及关于查询的上下文信息。 预测模式被应用于基于上下文的用户行为数据,以便产生预测的用户满意度数据。 数据挖掘技术可用于创建和改进一个或多个预测模式。 预测的用户满意度数据可以用于通过显示器报告具有共享特性和次标准用户满意度的任何查询的性能或标识来监视或改进搜索机制的性能。 动态改进的搜索机制使用预测的用户满意度数据来提高搜索机制的性能。

    IDENTIFYING CHANGES FOR ONLINE DOCUMENTS
    7.
    发明申请
    IDENTIFYING CHANGES FOR ONLINE DOCUMENTS 有权
    识别在线文档的更改

    公开(公告)号:US20100318892A1

    公开(公告)日:2010-12-16

    申请号:US12484607

    申请日:2009-06-15

    IPC分类号: G06F17/00

    摘要: Techniques and systems are disclosed for providing changed content identification for an online document that is accessed by a user or user agent. A reference point for an online document that a user or user agent is interested in accessing is identified, comprising a stored prior version of the document. The prior version of the document is retrieved, when the user or user agent accesses the online document, such as by using the reference point. Elements of the prior version are compared with elements of a current version of the document, to determine whether there are differences between the versions. If changes are identified between the prior version and the current version, the current version is automatically updated with visual or auditory representations that identify those changes of content.

    摘要翻译: 公开了用于为由用户或用户代理访问的在线文档提供改变的内容标识的技术和系统。 识别用户或用户代理感兴趣访问的在线文档的参考点,包括文档的存储的先前版本。 当用户或用户代理访问在线文档时,例如通过使用参考点,检索文档的先前版本。 将先前版本的元素与文档的当前版本的元素进行比较,以确定版本之间是否存在差异。 如果在先前版本和当前版本之间识别到更改,则会自动更新当前版本,以便识别内容更改的视觉或听觉表示。

    Wave lens systems and methods for search results
    8.
    发明申请
    Wave lens systems and methods for search results 有权
    波形透镜系统和搜索结果的方法

    公开(公告)号:US20050216859A1

    公开(公告)日:2005-09-29

    申请号:US10809172

    申请日:2004-03-25

    CPC分类号: G06F3/0481

    摘要: The present invention relates to a system and methodology for dynamic presentation of search result information within a selected area of a display. In one aspect, a computerized interface for data presentation is provided. The system includes a lens component associated with a portion of a user interface display, wherein the lens component defines an area to display information from at least one search result. A layout component displays a detailed subset of information within the lens component based upon the search result.

    摘要翻译: 本发明涉及用于在显示器的选定区域内动态呈现搜索结果信息的系统和方法。 一方面,提供用于数据呈现的计算机化接口。 该系统包括与用户界面显示器的一部分相关联的透镜部件,其中透镜部件限定从至少一个搜索结果显示信息的区域。 布局组件根据搜索结果显示镜头组件内的详细信息子集。

    Identifying changes for online documents
    9.
    发明授权
    Identifying changes for online documents 有权
    识别在线文档的更改

    公开(公告)号:US09330191B2

    公开(公告)日:2016-05-03

    申请号:US12484607

    申请日:2009-06-15

    IPC分类号: G06F17/00 G06F17/30

    摘要: Techniques and systems are disclosed for providing changed content identification for an online document that is accessed by a user or user agent. A reference point for an online document that a user or user agent is interested in accessing is identified, comprising a stored prior version of the document. The prior version of the document is retrieved, when the user or user agent accesses the online document, such as by using the reference point. Elements of the prior version are compared with elements of a current version of the document, to determine whether there are differences between the versions. If changes are identified between the prior version and the current version, the current version is automatically updated with visual or auditory representations that identify those changes of content.

    摘要翻译: 公开了用于为由用户或用户代理访问的在线文档提供改变的内容标识的技术和系统。 识别用户或用户代理感兴趣访问的在线文档的参考点,包括文档的存储的先前版本。 当用户或用户代理访问在线文档时,例如通过使用参考点,检索文档的先前版本。 将先前版本的元素与文档的当前版本的元素进行比较,以确定版本之间是否存在差异。 如果在先前版本和当前版本之间识别到更改,则会自动更新当前版本,以便识别内容更改的视觉或听觉表示。

    Search engine user interface
    10.
    发明申请
    Search engine user interface 有权
    搜索引擎用户界面

    公开(公告)号:US20070005576A1

    公开(公告)日:2007-01-04

    申请号:US11172365

    申请日:2005-06-29

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3097 G06F17/30979

    摘要: A search engine user interface that reduces the need for explicit search rules; dynamically responds as user input is entered to give immediate feedback to a user; is not limited to searching data residing in a single store; and may be used with a plurality of search engines, is provided. The search engine user interface provides search functions for a plurality of types of file metadata and types of file content. The search engine user interface provides an active query box, query editing, word-wheeling, and query narrowing and broadening. The user interface provides accordion behavior for visual elements of the user interface, integrated custom tagging, multiple independent search parameters, and filtering and integrated custom tagging in a common file dialog box.

    摘要翻译: 搜索引擎用户界面,减少了对显式搜索规则的需求; 当输入用户输入时动态响应,以立即向用户提供反馈; 不限于搜索驻留在单个商店中的数据; 并且可以与多个搜索引擎一起使用。 搜索引擎用户界面为多种类型的文件元数据和文件内容的类型提供搜索功能。 搜索引擎用户界面提供了一个活动的查询框,查询编辑,单词轮询和查询缩小和扩展。 用户界面为用户界面的视觉元素,集成自定义标签,多个独立搜索参数以及在通用文件对话框中过滤和集成自定义标记提供手风琴行为。