SYSTEMS AND METHODS FOR PERSONAL UBIQUITOUS INFORMATION RETRIEVAL AND REUSE
    1.
    发明申请
    SYSTEMS AND METHODS FOR PERSONAL UBIQUITOUS INFORMATION RETRIEVAL AND REUSE 审中-公开
    个人信息检索和重用的系统和方法

    公开(公告)号:US20070112742A1

    公开(公告)日:2007-05-17

    申请号:US11619949

    申请日:2007-01-04

    IPC分类号: G06F17/30

    摘要: The present invention relates to systems and methods providing content-access-based information retrieval. Information items from a plurality of disparate information sources that have been previously accessed or considered are automatically indexed in a data store, whereby a multifaceted user interface is provided to efficiently retrieve the items in a cognitively relevant manner. Various display output arrangements are possible for the retrieved information items including timeline visualizations and multidimensional grid visualizations. Input options include explicit, implicit, and standing queries for retrieving data along with explicit and implicit tagging of items for ease of recall and retrieval. In one aspect, an automated system is provided that facilitates concurrent searching across a plurality of information sources. A usage analyzer determines user accessed items and a content analyzer stores subsets of data corresponding to the items, wherein at least two of the items are associated with disparate information sources, respectively. An automated indexing component indexes the data subsets according to past data access patterns as determined by the usage analyzer. A search component responds to a search query, initiates a search across the indexed data, and outputs links to locations of a subset and/or provides sparse representations of the subset.

    摘要翻译: 本发明涉及提供基于内容访问的信息检索的系统和方法。 来自先前访问或考虑的多个不同信息源的信息项目被自动索引到数据存储器中,由此提供多方面用户界面以有效地以认知相关的方式检索项目。 对于所检索的信息项,包括时间线可视化和多维网格可视化,各种显示输出布置是可能的。 输入选项包括用于检索数据的显式,隐式和常规查询,以及容易召回和检索的项目的显式和隐式标记。 在一个方面,提供了一种便于在多个信息源上并行搜索的自动化系统。 使用分析器确定用户访问的项目,并且内容分析器存储对应于项目的数据子集,其中至少两个项目分别与不同的信息源相关联。 自动索引组件根据使用分析器确定的过去的数据访问模式来索引数据子集。 搜索组件响应搜索查询,在索引的数据上发起搜索,并且输出到子集的位置的链接和/或提供子集的稀疏表示。

    Systems and methods for constructing and using models of memorability in computing and communications applications
    2.
    发明申请
    Systems and methods for constructing and using models of memorability in computing and communications applications 审中-公开
    在计算和通信应用中构建和使用可记忆性模型的系统和方法

    公开(公告)号:US20060190440A1

    公开(公告)日:2006-08-24

    申请号:US11348096

    申请日:2006-02-06

    IPC分类号: G06F17/30

    摘要: One or more models of memorability are provided that facilitate various computer-based applications including those centering on the storage, retrieval, and processing of information, applications that remind people about items they risk not recalling or overlooking, and facilitating communications of reminders. In one application, the models are used to help compose and navigate large personal stores of information about a user's activities, communications, images, and other content. In another application, views of files in directories are extended with the addition of memory landmarks, and a means for controlling the number of landmarks provided via changing a threshold on inferred memorability. Another application centers on the use of models of memorability to select subsets of images from larger sets representing events, for display in a slide show or ambient photo display. In another application, a system is provided that facilitates computer-based searching for information by providing for the design and analysis of timeline visualizations in connection with displaying results to queries based at least in part on an index of content. A query is received by a query component (which can be part of search engine that provides a unified index of information a user has been exposed to). The query component parses the query into portions relevant to effecting a meaningful search in accordance with the subject invention. The query component can access and populate a data store which may include information searched for. A landmark component receives and/or accesses information from the query component as well as the data store, and anchors public and/or personal landmark events to search results-related information.

    摘要翻译: 提供了一种或多种可记忆模式,便于各种基于计算机的应用程序,包括以信息的存储,检索和处理为中心的应用程序,提醒人们有关他们不会召回或忽略的项目以及促进提醒通信的应用程序。 在一个应用程序中,这些模型用于帮助组织和浏览大型个人商店,了解有关用户的活动,通信,图像和其他内容的信息。 在另一个应用中,目录中的文件的视图通过添加内存标记而被扩展,以及用于通过改变推定的可记忆性的阈值来控制提供的地标数量的手段。 另一个应用程序集中在使用可记忆模型来选择代表事件的较大集合的图像子集,以便在幻灯片放映或环境照片显示中显示。 在另一应用中,提供了一种系统,其通过至少部分地基于内容索引来提供对结果进行查询的结果提供对时间线可视化的设计和分析,从而促进基于计算机的信息搜索。 查询组件(可以是搜索引擎的一部分,它提供用户已经暴露的统一的信息索引)接收到查询。 查询组件根据本发明解析查询到与实现有意义的搜索有关的部分。 查询组件可以访问和填充可能包括搜索到的信息的数据存储。 地标组件从查询组件和数据存储接收和/或访问信息,并且锚定公共和/或个人地标事件以搜索结果相关信息。

    Systems and methods for constructing and using models of memorability in computing and communications applications

    公开(公告)号:US20060129606A1

    公开(公告)日:2006-06-15

    申请号:US11348018

    申请日:2006-02-06

    IPC分类号: G06F17/00

    摘要: One or more models of memorability are provided that facilitate various computer-based applications including those centering on the storage, retrieval, and processing of information, applications that remind people about items they risk not recalling or overlooking, and facilitating communications of reminders. In one application, the models are used to help compose and navigate large personal stores of information about a user's activities, communications, images, and other content. In another application, views of files in directories are extended with the addition of memory landmarks, and a means for controlling the number of landmarks provided via changing a threshold on inferred memorability. Another application centers on the use of models of memorability to select subsets of images from larger sets representing events, for display in a slide show or ambient photo display. In another application, a system is provided that facilitates computer-based searching for information by providing for the design and analysis of timeline visualizations in connection with displaying results to queries based at least in part on an index of content. A query is received by a query component (which can be part of search engine that provides a unified index of information a user has been exposed to). The query component parses the query into portions relevant to effecting a meaningful search in accordance with the subject invention. The query component can access and populate a data store which may include information searched for. A landmark component receives and/or accesses information from the query component as well as the data store, and anchors public and/or personal landmark events to search results-related information.

    Search engine user interface
    4.
    发明申请
    Search engine user interface 有权
    搜索引擎用户界面

    公开(公告)号:US20070005576A1

    公开(公告)日:2007-01-04

    申请号:US11172365

    申请日:2005-06-29

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3097 G06F17/30979

    摘要: A search engine user interface that reduces the need for explicit search rules; dynamically responds as user input is entered to give immediate feedback to a user; is not limited to searching data residing in a single store; and may be used with a plurality of search engines, is provided. The search engine user interface provides search functions for a plurality of types of file metadata and types of file content. The search engine user interface provides an active query box, query editing, word-wheeling, and query narrowing and broadening. The user interface provides accordion behavior for visual elements of the user interface, integrated custom tagging, multiple independent search parameters, and filtering and integrated custom tagging in a common file dialog box.

    摘要翻译: 搜索引擎用户界面,减少了对显式搜索规则的需求; 当输入用户输入时动态响应,以立即向用户提供反馈; 不限于搜索驻留在单个商店中的数据; 并且可以与多个搜索引擎一起使用。 搜索引擎用户界面为多种类型的文件元数据和文件内容的类型提供搜索功能。 搜索引擎用户界面提供了一个活动的查询框,查询编辑,单词轮询和查询缩小和扩展。 用户界面为用户界面的视觉元素,集成自定义标签,多个独立搜索参数以及在通用文件对话框中过滤和集成自定义标记提供手风琴行为。

    Systems, methods, and interfaces for providing personalized search and information access
    5.
    发明申请
    Systems, methods, and interfaces for providing personalized search and information access 审中-公开
    用于提供个性化搜索和信息访问的系统,方法和接口

    公开(公告)号:US20060074883A1

    公开(公告)日:2006-04-06

    申请号:US10958560

    申请日:2004-10-05

    IPC分类号: G06F17/30

    CPC分类号: G06F16/9535

    摘要: The present invention relates to systems and methods that employ user models to personalize generalized queries and/or search results according to information that is relevant to respective user characteristics. A system is provided that facilitates generating personalized searches of information. The system includes a user model to determine characteristics of a user. The user model may be assembled automatically via an analysis of a user's content, activities, and overall context. A personalization component automatically modifies queries and/or search results in view of the user model in order to personalize information searches for the user. A user interface receives the queries and displays the search results from one or more local and/or remote search engines, wherein the interface can be adjusted in a range from more personalized searches to more generalized searches.

    摘要翻译: 本发明涉及采用用户模型根据与各个用户特征相关的信息个性化广义查询和/或搜索结果的系统和方法。 提供了一种有助于生成信息的个性化搜索的系统。 该系统包括用于确定用户特征的用户模型。 可以通过对用户内容,活动和整体上下文的分析来自动组合用户模型。 个性化组件根据用户模型自动修改查询和/或搜索结果,以个性化用户的信息搜索。 用户界面接收查询并显示来自一个或多个本地和/或远程搜索引擎的搜索结果,其中可以在从更个性化搜索到更广义搜索的范围内调整界面。

    Cost-benefit approach to automatically composing answers to questions by extracting information from large unstructured corpora
    6.
    发明申请
    Cost-benefit approach to automatically composing answers to questions by extracting information from large unstructured corpora 有权
    通过从大型非结构化语料库中提取信息来自动构成问题答案的成本效益方法

    公开(公告)号:US20050033711A1

    公开(公告)日:2005-02-10

    申请号:US10635274

    申请日:2003-08-06

    摘要: The present invention relates to a system and methodology to facilitate extraction of information from a large unstructured corpora such as from the World Wide Web and/or other unstructured sources. Information in the form of answers to questions can be automatically composed from such sources via probabilistic models and cost-benefit analyses to guide resource-intensive information-extraction procedures employed by a knowledge-based question answering system. The analyses can leverage predictions of the ultimate quality of answers generated by the system provided by Bayesian or other statistical models. Such predictions, when coupled with a utility model can provide the system with the ability to make decisions about the number of queries issued to a search engine (or engines), given the cost of queries and the expected value of query results in refining an ultimate answer. Given a preference model, information extraction actions can be taken with the highest expected utility. In this manner, the accuracy of answers to questions can be balanced with the cost of information extraction and analysis to compose the answers.

    摘要翻译: 本发明涉及一种便利从诸如万维网和/或其他非结构化来源的大型非结构化语料库提取信息的系统和方法。 通过概率模型和成本效益分析,可以通过这些来源自动构成问题答案形式的信息,以指导基于知识的问答系统采用的资源密集型信息提取程序。 分析可以利用由贝叶斯或其他统计模型提供的系统生成的答案的最终质量的预测。 当与实用新型相结合时,这种预测可以为系统提供对发出给搜索引擎(或引擎)的查询数量的决定的能力,考虑到查询的成本和查询结果的期望值来提炼最终的 回答。 给定一个偏好模型,可以采用最高预期效用的信息提取动作。 以这种方式,可以将问题答案的准确性与信息提取和分析的成本进行平衡,以构成答案。

    Sensing, storing, indexing, and retrieving data leveraging measures of user activity, attention, and interest
    7.
    发明申请
    Sensing, storing, indexing, and retrieving data leveraging measures of user activity, attention, and interest 有权
    利用用户活动,注意力和兴趣的度量来感知,存储,索引和检索数据

    公开(公告)号:US20070016553A1

    公开(公告)日:2007-01-18

    申请号:US11172121

    申请日:2005-06-29

    IPC分类号: G06F17/30 G06F7/00

    摘要: Various components and processes are provided to enable data processing on multiple data types where aspects of the history of user activity, attention, interest, location, or other interaction with data is determined and employed to enhance information storage and access. In one particular aspect, a data manipulation system is provided. The system includes one or more data items that are associated with one or more tags and indicate at least one user's interaction or activity with the data items. A manipulation tool that processes the data items to determine a subset of data items based at least in part on the user's interaction with the data items. Methods are described for using the manipulation tool to weight terms in an index, to compress indexes, to influence the rank of items returned in a search, to generate additional queries for data items either automatically or with user direction, or for improved presentation of data items.

    摘要翻译: 提供了各种组件和过程,以便能够确定多种数据类型的数据处理,其中确定用户活动,注意力,兴趣,位置或其他与数据的交互的历史的方面以增强信息存储和访问。 在一个特定方面,提供了数据操纵系统。 系统包括与一个或多个标签相关联的一个或多个数据项,并指示至少一个用户与数据项的交互或活动。 操纵工具,其至少部分地基于用户与数据项的交互来处理数据项以确定数据项的子集。 描述了使用操纵工具对索引中的权重进行加权,压缩索引,影响搜索中返回的项目的等级的方法,以自动地或与用户方向生成对数据项的附加查询,或用于改进数据呈现 物品。

    Principles and methods for personalizing newsfeeds via an analysis of information novelty and dynamics
    8.
    发明申请
    Principles and methods for personalizing newsfeeds via an analysis of information novelty and dynamics 失效
    通过分析信息新奇和动态来个性化新闻素材的原理和方法

    公开(公告)号:US20050198056A1

    公开(公告)日:2005-09-08

    申请号:US10827729

    申请日:2004-04-20

    IPC分类号: G06F17/30 G06F17/00

    摘要: A system and methodology is provided for filtering temporal streams of information such as news stories by statistical measures of information novelty. Various techniques can be applied to custom tailor news feeds or other types of information based on information that a user has already reviewed. Methods for analyzing information novelty are provided along with a system that personalizes and filters information for users by identifying the novelty of stories in the context of stories they have already reviewed. The system employs novelty-analysis algorithms that represent articles as a bag of words and named entities. The algorithms analyze inter- and intra-document dynamics by considering how information evolves over time from article to article, as well as within individual articles.

    摘要翻译: 提供了一种系统和方法,用于通过信息新颖性的统计测量过滤信息的时间流,如新闻故事。 基于用户已经审查的信息,可以将各种技术应用于定制定制新闻馈送或其他类型的信息。 提供分析信息新颖性的方法以及通过在已经审查的故事的上下文中识别故事的新颖性来为用户个人化和过滤信息的系统。 该系统采用新颖性分析算法,将文章表示为一包单词和命名实体。 这些算法通过考虑随着时间的推移从文章到文章以及个人文章中的信息如何发展,分析了文档间和文档内部的动态。

    Analysis of topic dynamics of web search
    9.
    发明申请
    Analysis of topic dynamics of web search 审中-公开
    网页搜索的主题动态分析

    公开(公告)号:US20070005646A1

    公开(公告)日:2007-01-04

    申请号:US11171123

    申请日:2005-06-30

    IPC分类号: G06F17/00

    CPC分类号: G06F16/9535 G06F2216/03

    摘要: The subject invention relates to probabilistic models that are trained from transitions among various topics of pages visited by a sample population of search users. In one aspect, probabilistic models of topic transitions are learned for individual users and groups of users. Topic transitions for individuals versus larger groups are analyzed, wherein the relative accuracies of personal models of topic dynamics with models constructed from sets of pages drawn from similar groups and from a larger population of users are compared. To exploit temporal dynamics, the accuracy of these models are tested for predicting transitions in topics of visits at increasingly more distant times in the future. The models can be applied to search topic dynamics of tagged pages, and then utilized to predict topics of subsequent pages visited by users.

    摘要翻译: 本发明涉及由搜索用户的样本群访问的各个主题之间的转换训练的概率模型。 在一个方面,为个人用户和用户组学习主题转换的概率模型。 对个人与较大群体的主题过渡进行了分析,其中比较了主题动态个人模型的相对准确性,以及从较大群体的用户组中绘制的模型构建的模型。 为了利用时间动力学,对这些模型的准确性进行了测试,以预测未来越来越遥远的访问主题的过渡。 这些模型可以应用于搜索标签页面的主题动态,然后用于预测用户访问的后续页面的主题。

    COST-BENEFIT APPROACH TO AUTOMATICALLY COMPOSING ANSWERS TO QUESTIONS BY EXTRACTING INFORMATION FROM LARGE UNSTRUCTURED CORPORA

    公开(公告)号:US20060294037A1

    公开(公告)日:2006-12-28

    申请号:US11469136

    申请日:2006-08-31

    IPC分类号: G06N5/02 G06F17/00

    摘要: The present invention relates to a system and methodology to facilitate extraction of information from a large unstructured corpora such as from the World Wide Web and/or other unstructured sources. Information in the form of answers to questions can be automatically composed from such sources via probabilistic models and cost-benefit analyses to guide resource-intensive information-extraction procedures employed by a knowledge-based question answering system. The analyses can leverage predictions of the ultimate quality of answers generated by the system provided by Bayesian or other statistical models. Such predictions, when coupled with a utility model can provide the system with the ability to make decisions about the number of queries issued to a search engine (or engines), given the cost of queries and the expected value of query results in refining an ultimate answer. Given a preference model, information extraction actions can be taken with the highest expected utility. In this manner, the accuracy of answers to questions can be balanced with the cost of information extraction and analysis to compose the answers.