Prefetching and caching documents according to probability ranked need S
list
    1.
    发明授权
    Prefetching and caching documents according to probability ranked need S list 失效
    预取和缓存文件按概率排列需要S列表

    公开(公告)号:US6098064A

    公开(公告)日:2000-08-01

    申请号:US83645

    申请日:1998-05-22

    IPC分类号: G06F17/30 H04L29/06 H04L29/08

    摘要: A method is presented for determining whether to prefetch and cache documents on a computer. In one embodiment, documents are prefetched and cached on a client computer from servers located on the Internet in accordance with their computed need probability. Those document with a higher need probability are prefetched and cached before documents with lower need probabilities. The need probability for a document is computed using both a document context factor and a document history factor. The context factor of the need probability of a document is determined by computing the correlation between words in the document and a context Q of the operating environment. The history factor of the need probability of a document is determined by integrating both the recency of document use and the frequency of document use.

    摘要翻译: 提出了一种用于确定是否在计算机上预取和缓存文档的方法。 在一个实施例中,根据其计算出的需求概率,将文档从位于因特网上的服务器预取和缓存在客户端计算机上。 那些具有较高需求概率的文档在具有较低需求概率的文档之前被预取和缓存。 使用文档上下文因素和文档历史因子来计算文档的需求概率。 通过计算文档中的单词和操作环境的上下文Q之间的相关性来确定文档的需要概率的上下文因素。 通过整合文件使用的新近度和文档使用频率来确定文档的需求概率的历史因素。

    System And Method For Providing A Topic-Directed Search
    2.
    发明申请
    System And Method For Providing A Topic-Directed Search 有权
    提供主题搜索的系统和方法

    公开(公告)号:US20100057716A1

    公开(公告)日:2010-03-04

    申请号:US12354681

    申请日:2009-01-15

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F17/3071

    摘要: A system and method for providing a topic-directed search is provided, which advantageously harnesses user-provided topical indexes and an ability to characterize indexes according to how articles fall under their topical organizations. A corpus of articles and an index that includes topics from the articles is maintained. For each topic, a coarse-grained topic model is built, which includes the characteristic words included in the articles relating to the topic and scores assigned to the characteristic words. A search query is executed against the index. The topics that match the search terms are chosen by their scores. The topics that match the coarse-grained topic models and the articles corresponding to the search query are presented. In contrast to conventional search engines, search results are organized according to topic and search results can be offered across multiple indexes, where part of returned results are selected from most-relevant indexes with their most-relevant topics.

    摘要翻译: 提供了一种用于提供主题定向搜索的系统和方法,其有利地利用用户提供的主题索引和根据文章属于其主题组织的方式来表征索引的能力。 维护文章的语料库和包含文章主题的索引。 对于每个主题,构建了一个粗粒度主题模型,其中包括与主题相关的文章中包含的特征词以及分配给特征词的分数。 针对索引执行搜索查询。 与搜索词匹配的主题由他们的分数来选择。 介绍了与粗粒度主题模型匹配的主题以及与搜索查询相对应的文章。 与常规搜索引擎相比,搜索结果根据主题进行组织,搜索结果可以跨多个索引提供,其中返回结果的一部分从与其最相关的主题的最相关的索引中选择。

    Apparatus and methods for accessing a collection of content portions
    4.
    发明授权
    Apparatus and methods for accessing a collection of content portions 失效
    用于访问内容部分的集合的装置和方法

    公开(公告)号:US07028053B2

    公开(公告)日:2006-04-11

    申请号:US10248408

    申请日:2003-01-16

    IPC分类号: G06F17/30 G06F15/00

    摘要: Techniques for browsing through a large collection of content portions uses a scatter and gather approach where a collection of content portions is clustered into one or more clusters using multi-modal data modeling. Additionally, as part of the multi-modal data modeling, “proximal cues” surrounding links or connections or surrounding “image links” in a content portion are used to quickly identify the user's information needs. Thus, taking into account proximal cues during multi-modal data modeling improves the scattering and the gathering process, as well as personalizes the scattering and the gathering process to most effectively cluster content portions of interest to the user.

    摘要翻译: 用于浏览大量内容部分的技术使用分散和收集方法,其中使用多模态数据建模将内容部分的集合聚类成一个或多个集群。 另外,作为多模式数据建模的一部分,使用内容部分中的链接或连接或周围“图像链接”的“近端提示”来快速识别用户的信息需求。 因此,在多模态数据建模过程中,考虑到近端线索可以改善散射和收集过程,以及个性化散射和收集过程,以最有效地将感兴趣的内容部分聚类到用户。

    System and method for inferring user information need in hypermedia linked document collection
    5.
    发明授权
    System and method for inferring user information need in hypermedia linked document collection 有权
    用于推断用户信息的系统和方法需要超媒体链接的文档收集

    公开(公告)号:US07017110B1

    公开(公告)日:2006-03-21

    申请号:US09540063

    申请日:2000-03-31

    IPC分类号: G06F17/00

    摘要: The present invention provides a system and method for inferring information need in a collection of hypermedia documents that is based on the observation that a user's hypertext link traversal decisions are typically based on the nature of that user's information need. The system identifies the hypermedia linkage structure among the plurality of documents in the collection. The documents include content items that may be relevant to a user information need. The system then accepts a user path item that represents a user's hypermedia link traversal history and applies a network flow model to the user path item in the hypermedia link information in order to create a document vector. The system also determines the distribution of the content items in the document collection, and then compares the document vector to the content item distribution in order to determine an inferred information need.

    摘要翻译: 本发明提供了一种用于在超媒体文档的集合中推断信息需求的系统和方法,其基于用户的超文本链接遍历决定通常基于该用户的信息需要的性质的观察。 系统识别集合中的多个文档之间的超媒体联动结构。 这些文件包括可能与用户信息需要相关的内容项。 然后,系统接受表示用户的超媒体链接遍历历史的用户路径项目,并将网络流模型应用于超媒体链接信息中的用户路径项目,以便创建文档向量。 系统还确定文档收集中的内容项目的分布,然后将文档向量与内容项目分布进行比较,以确定推断的信息需求。

    System and method for identifying users relevant to a topic of interest
    6.
    发明授权
    System and method for identifying users relevant to a topic of interest 有权
    用于识别与感兴趣的主题相关的用户的系统和方法

    公开(公告)号:US08275769B1

    公开(公告)日:2012-09-25

    申请号:US13087308

    申请日:2011-04-14

    IPC分类号: G06F17/30

    CPC分类号: G06Q10/101

    摘要: A system and method for identifying users relevant to a topic of interest is provided. A query comprising one or more topics is executed against a corpus of messages. Voting users associated with the messages matching the query are identified. A set of candidate users comprising users connected to the voting users is generated. A relevancy score is computed for each candidate user. The candidate users are ranked by their respective relevancy score.

    摘要翻译: 提供了一种用于识别与感兴趣的主题相关的用户的系统和方法。 针对消息语料库执行包括一个或多个主题的查询。 识别与匹配查询的消息相关联的投票用户。 生成包括连接到投票用户的用户的一组候选用户。 为每个候选用户计算相关性分数。 候选人按照各自的相关分数进行排名。

    System and method for identifying similarities among objects in a collection
    7.
    发明授权
    System and method for identifying similarities among objects in a collection 失效
    用于识别集合中的对象之间的相似性的系统和方法

    公开(公告)号:US06941321B2

    公开(公告)日:2005-09-06

    申请号:US09421767

    申请日:1999-10-19

    IPC分类号: G06F17/30

    摘要: A system and method for browsing, retrieving, and recommending information from a collection uses multi-modal features of the documents in the collection, as well as an analysis of users' prior browsing and retrieval behavior. The system and method are premised on various disclosed methods for quantitatively representing documents in a document collection as vectors in multi-dimensional vector spaces, quantitatively determining similarity between documents, and clustering documents according to those similarities. The system and method also rely on methods for quantitatively representing users in a user population, quantitatively determining similarity between users, clustering users according to those similarities, and visually representing clusters of users by analogy to clusters of documents.

    摘要翻译: 用于从集合中浏览,检索和推荐信息的系统和方法使用集合中的文档的多模态特征,以及用户先前浏览和检索行为的分析。 系统和方法以各种公开的方法为前提,用于定量表示文档集合中的文档,作为多维向量空间中的向量,定量地确定文档之间的相似性,并根据这些相似性对文档进行聚类。 系统和方法还依赖于定量表示用户群体中的用户的方法,定量地确定用户之间的相似性,根据这些相似性对用户进行聚类,并且通过类似于文档集合直观地表示用户群。

    System and method for quantitatively representing data objects in vector space
    8.
    发明授权
    System and method for quantitatively representing data objects in vector space 有权
    用于定量表示向量空间中的数据对象的系统和方法

    公开(公告)号:US06922699B2

    公开(公告)日:2005-07-26

    申请号:US09421416

    申请日:1999-10-19

    IPC分类号: G06F17/30

    摘要: A system and method for browsing, retrieving, and recommending information from a collection uses multi-modal features of the documents in the collection, as well as an analysis of users' prior browsing and retrieval behavior. The system and method are premised on various disclosed methods for quantitatively representing documents in a document collection as vectors in multi-dimensional vector spaces, quantitatively determining similarity between documents, and clustering documents according to those similarities. The system and method also rely on methods for quantitatively representing users in a user population, quantitatively determining similarity between users, clustering users according to those similarities, and visually representing clusters of users by analogy to clusters of documents.

    摘要翻译: 用于从集合中浏览,检索和推荐信息的系统和方法使用集合中的文档的多模态特征,以及用户先前浏览和检索行为的分析。 系统和方法以各种公开的方法为前提,用于定量表示文档集合中的文档,作为多维向量空间中的向量,定量地确定文档之间的相似性,并根据这些相似性对文档进行聚类。 系统和方法还依赖于定量表示用户群体中的用户的方法,定量地确定用户之间的相似性,根据这些相似性对用户进行聚类,并且通过类似于文档集合直观地表示用户群。

    System for ranking search results from a collection of documents using spreading activation techniques
    10.
    发明授权
    System for ranking search results from a collection of documents using spreading activation techniques 有权
    用于使用扩展激活技术从文档集合中搜索结果排序的系统

    公开(公告)号:US06272507B1

    公开(公告)日:2001-08-07

    申请号:US09163595

    申请日:1998-09-29

    IPC分类号: G06F1500

    摘要: A system and method for ranking the results of a search on a collection of linked documents. Documents found on the Web are typically referred to as Web pages. The system utilizes various information relating to the collection of linked documents, including the topology, content and historical usage of the linked collections of documents. The ranking is based on historical patterns and information about the current context of interest (e.g. what the user or group seems to be currently interested in doing). A spreading activation technique is used to identify the frequency of activation of the documents in the search results. Spreading activation techniques are based on representations of Web pages as nodes in graph networks representing usage, content, and hypertext relations among Web pages. After performing the spreading activation based on an initial set defined by the search results, each document from the results may be ranked based on their level of activation.

    摘要翻译: 一种用于对链接文档集合进行搜索的结果进行排名的系统和方法。 在Web上找到的文档通常被称为网页。 该系统利用与链接文件的收集有关的各种信息,包括链接的文档集合的拓扑,内容和历史使用。 排名是基于历史模式和关于当前感兴趣的上下文的信息(例如用户或组似乎目前感兴趣的)。 扩展激活技术用于识别搜索结果中文档的激活频率。 扩展激活技术基于网页的表示,作为表示网页中的使用,内容和超文本关系的图形网络中的节点。 在基于由搜索结果定义的初始集合执行扩展激活之后,来自结果的每个文档可以基于其激活水平进行排名。