System and method for inferring user information need in hypermedia linked document collection
    1.
    发明授权
    System and method for inferring user information need in hypermedia linked document collection 有权
    用于推断用户信息的系统和方法需要超媒体链接的文档收集

    公开(公告)号:US07017110B1

    公开(公告)日:2006-03-21

    申请号:US09540063

    申请日:2000-03-31

    IPC分类号: G06F17/00

    摘要: The present invention provides a system and method for inferring information need in a collection of hypermedia documents that is based on the observation that a user's hypertext link traversal decisions are typically based on the nature of that user's information need. The system identifies the hypermedia linkage structure among the plurality of documents in the collection. The documents include content items that may be relevant to a user information need. The system then accepts a user path item that represents a user's hypermedia link traversal history and applies a network flow model to the user path item in the hypermedia link information in order to create a document vector. The system also determines the distribution of the content items in the document collection, and then compares the document vector to the content item distribution in order to determine an inferred information need.

    摘要翻译: 本发明提供了一种用于在超媒体文档的集合中推断信息需求的系统和方法,其基于用户的超文本链接遍历决定通常基于该用户的信息需要的性质的观察。 系统识别集合中的多个文档之间的超媒体联动结构。 这些文件包括可能与用户信息需要相关的内容项。 然后,系统接受表示用户的超媒体链接遍历历史的用户路径项目,并将网络流模型应用于超媒体链接信息中的用户路径项目,以便创建文档向量。 系统还确定文档收集中的内容项目的分布,然后将文档向量与内容项目分布进行比较,以确定推断的信息需求。

    System and method for identifying similarities among objects in a collection
    2.
    发明授权
    System and method for identifying similarities among objects in a collection 失效
    用于识别集合中的对象之间的相似性的系统和方法

    公开(公告)号:US06941321B2

    公开(公告)日:2005-09-06

    申请号:US09421767

    申请日:1999-10-19

    IPC分类号: G06F17/30

    摘要: A system and method for browsing, retrieving, and recommending information from a collection uses multi-modal features of the documents in the collection, as well as an analysis of users' prior browsing and retrieval behavior. The system and method are premised on various disclosed methods for quantitatively representing documents in a document collection as vectors in multi-dimensional vector spaces, quantitatively determining similarity between documents, and clustering documents according to those similarities. The system and method also rely on methods for quantitatively representing users in a user population, quantitatively determining similarity between users, clustering users according to those similarities, and visually representing clusters of users by analogy to clusters of documents.

    摘要翻译: 用于从集合中浏览,检索和推荐信息的系统和方法使用集合中的文档的多模态特征,以及用户先前浏览和检索行为的分析。 系统和方法以各种公开的方法为前提,用于定量表示文档集合中的文档,作为多维向量空间中的向量,定量地确定文档之间的相似性,并根据这些相似性对文档进行聚类。 系统和方法还依赖于定量表示用户群体中的用户的方法,定量地确定用户之间的相似性,根据这些相似性对用户进行聚类,并且通过类似于文档集合直观地表示用户群。

    System and method for quantitatively representing data objects in vector space
    3.
    发明授权
    System and method for quantitatively representing data objects in vector space 有权
    用于定量表示向量空间中的数据对象的系统和方法

    公开(公告)号:US06922699B2

    公开(公告)日:2005-07-26

    申请号:US09421416

    申请日:1999-10-19

    IPC分类号: G06F17/30

    摘要: A system and method for browsing, retrieving, and recommending information from a collection uses multi-modal features of the documents in the collection, as well as an analysis of users' prior browsing and retrieval behavior. The system and method are premised on various disclosed methods for quantitatively representing documents in a document collection as vectors in multi-dimensional vector spaces, quantitatively determining similarity between documents, and clustering documents according to those similarities. The system and method also rely on methods for quantitatively representing users in a user population, quantitatively determining similarity between users, clustering users according to those similarities, and visually representing clusters of users by analogy to clusters of documents.

    摘要翻译: 用于从集合中浏览,检索和推荐信息的系统和方法使用集合中的文档的多模态特征,以及用户先前浏览和检索行为的分析。 系统和方法以各种公开的方法为前提,用于定量表示文档集合中的文档,作为多维向量空间中的向量,定量地确定文档之间的相似性,并根据这些相似性对文档进行聚类。 系统和方法还依赖于定量表示用户群体中的用户的方法,定量地确定用户之间的相似性,根据这些相似性对用户进行聚类,并且通过类似于文档集合直观地表示用户群。

    System and method for predicting web user flow by determining association strength of hypermedia links
    4.
    发明授权
    System and method for predicting web user flow by determining association strength of hypermedia links 有权
    通过确定超媒体链接的关联强度来预测网络用户流的系统和方法

    公开(公告)号:US06671711B1

    公开(公告)日:2003-12-30

    申请号:US09540976

    申请日:2000-03-31

    IPC分类号: G06F900

    摘要: The present invention also provides a system and method for predicting user traffic flow in a collection of hypermedia documents by determining the association strength of the hypermedia links. Hypermedia links are identified among a plurality of documents, where the documents include content items such as keywords that may or may not be relevant to a user information need. The distribution of the content items in the document collection is then determined. An information item is received as input, and is compared to the content items. In response to the comparison, association strengths are assigned to the hypermedia links. A network flow model uses the association strengths of the hypermedia links to predict user traffic flow in response to an initial condition.

    摘要翻译: 本发明还提供了一种用于通过确定超媒体链路的关联强度来预测超媒体文档集合中的用户业务流的系统和方法。 在多个文档中标识超媒体链接,其中文档包括诸如关键字之类的内容项目,这些关键字可能与用户信息需要相关也可能不相关。 然后确定文档集合中的内容项目的分发。 作为输入接收信息项,并与内容项进行比较。 响应于比较,将关联强度分配给超媒体链接。 网络流模型使用超媒体链路的关联强度来响应于初始条件来预测用户业务流。

    System and method for clustering data objects in a collection
    5.
    发明授权
    System and method for clustering data objects in a collection 有权
    集合中数据对象集群的系统和方法

    公开(公告)号:US06598054B2

    公开(公告)日:2003-07-22

    申请号:US09425039

    申请日:1999-10-19

    IPC分类号: G06F1700

    摘要: A system and method for browsing, retrieving, and recommending information from a collection uses multi-modal features of the documents in the collection, as well as an analysis of users' prior browsing and retrieval behavior. The system and method are premised on various disclosed methods for quantitatively representing documents in a document collection as vectors in multi-dimensional vector spaces, quantitatively determining similarity between documents, and clustering documents according to those similarities. The system and method also rely on methods for quantitatively representing users in a user population, quantitatively determining similarity between users, clustering users according to those similarities, and visually representing clusters of users by analogy to clusters of documents.

    摘要翻译: 用于从集合中浏览,检索和推荐信息的系统和方法使用集合中的文档的多模态特征,以及用户先前浏览和检索行为的分析。 系统和方法以各种公开的方法为前提,用于定量表示文档集合中的文档,作为多维向量空间中的向量,定量地确定文档之间的相似性,并根据这些相似性对文档进行聚类。 系统和方法还依赖于定量表示用户群体中的用户的方法,定量地确定用户之间的相似性,根据这些相似性对用户进行聚类,并且通过类似于文档集合直观地表示用户群。

    System and method for providing recommendations based on multi-modal user clusters
    6.
    发明授权
    System and method for providing recommendations based on multi-modal user clusters 有权
    基于多模态用户群提供建议的系统和方法

    公开(公告)号:US06567797B1

    公开(公告)日:2003-05-20

    申请号:US09425038

    申请日:1999-10-19

    IPC分类号: G06F700

    摘要: A system and method for browsing, retrieving, and recommending information from a collection uses multi-modal features of the documents in the collection, as well as an analysis of users' prior browsing and retrieval behavior. The system and method are premised on various disclosed methods for quantitatively representing documents in a document collection as vectors in multi-dimensional vector spaces, quantitatively determining similarity between documents, and clustering documents according to those similarities. The system and method also rely on methods for quantitatively representing users in a user population, quantitatively determining similarity between users, clustering users according to those similarities, and visually representing clusters of users by analogy to clusters of documents.

    摘要翻译: 用于从集合中浏览,检索和推荐信息的系统和方法使用集合中的文档的多模态特征,以及用户先前浏览和检索行为的分析。 系统和方法以各种公开的方法为前提,用于定量表示文档集合中的文档,作为多维向量空间中的向量,定量地确定文档之间的相似性,并根据这些相似性对文档进行聚类。 系统和方法还依赖于定量表示用户群体中的用户的方法,定量地确定用户之间的相似性,根据这些相似性对用户进行聚类,并且通过类似于文档集合直观地表示用户群。

    System and method for visually representing the contents of a multiple data object cluster
    7.
    发明授权
    System and method for visually representing the contents of a multiple data object cluster 有权
    用于可视地表示多数据对象集群的内容的系统和方法

    公开(公告)号:US06564202B1

    公开(公告)日:2003-05-13

    申请号:US09421419

    申请日:1999-10-19

    IPC分类号: G06F1730

    CPC分类号: G06F17/3061 Y10S707/99932

    摘要: A system and method for browsing, retrieving, and recommending information from a collection uses multi-modal features of the documents in the collection, as well as an analysis of users' prior browsing and retrieval behavior. The system and method are premised on various disclosed methods for quantitatively representing documents in a document collection as vectors in multi-dimensional vector spaces, quantitatively determining similarity between documents, and clustering documents according to those similarities. The system and method also rely on methods for quantitatively representing users in a user population, quantitatively determining similarity between users, clustering users according to those similarities, and visually representing clusters of users by analogy to clusters of documents.

    摘要翻译: 用于从集合中浏览,检索和推荐信息的系统和方法使用集合中的文档的多模态特征,以及用户先前浏览和检索行为的分析。 系统和方法以各种公开的方法为前提,用于定量表示文档集合中的文档,作为多维向量空间中的向量,定量地确定文档之间的相似性,并根据这些相似性对文档进行聚类。 系统和方法还依赖于定量表示用户群体中的用户的方法,定量地确定用户之间的相似性,根据这些相似性对用户进行聚类,并且通过类似于文档集合直观地表示用户群。

    Prefetching and caching documents according to probability ranked need S
list
    8.
    发明授权
    Prefetching and caching documents according to probability ranked need S list 失效
    预取和缓存文件按概率排列需要S列表

    公开(公告)号:US6098064A

    公开(公告)日:2000-08-01

    申请号:US83645

    申请日:1998-05-22

    IPC分类号: G06F17/30 H04L29/06 H04L29/08

    摘要: A method is presented for determining whether to prefetch and cache documents on a computer. In one embodiment, documents are prefetched and cached on a client computer from servers located on the Internet in accordance with their computed need probability. Those document with a higher need probability are prefetched and cached before documents with lower need probabilities. The need probability for a document is computed using both a document context factor and a document history factor. The context factor of the need probability of a document is determined by computing the correlation between words in the document and a context Q of the operating environment. The history factor of the need probability of a document is determined by integrating both the recency of document use and the frequency of document use.

    摘要翻译: 提出了一种用于确定是否在计算机上预取和缓存文档的方法。 在一个实施例中,根据其计算出的需求概率,将文档从位于因特网上的服务器预取和缓存在客户端计算机上。 那些具有较高需求概率的文档在具有较低需求概率的文档之前被预取和缓存。 使用文档上下文因素和文档历史因子来计算文档的需求概率。 通过计算文档中的单词和操作环境的上下文Q之间的相关性来确定文档的需要概率的上下文因素。 通过整合文件使用的新近度和文档使用频率来确定文档的需求概率的历史因素。

    Method and apparatus for finding related documents in a collection of linked documents using a bibliographic coupling link analysis
    9.
    发明授权
    Method and apparatus for finding related documents in a collection of linked documents using a bibliographic coupling link analysis 有权
    使用书目耦合链接分析在链接文档的集合中查找相关文档的方法和装置

    公开(公告)号:US06182091B2

    公开(公告)日:2001-01-30

    申请号:US09407789

    申请日:1999-09-29

    IPC分类号: G06F1721

    摘要: A method and apparatus for identifying related documents in a collection of linked documents. In the method the link structure of documents to other documents are analyzed. By analyzing only the link structure, a process intensive content analysis of the documents is avoided. A citation analysis technique, such as bibliographic coupling analysis, is performed on the set of documents to extract link information. For bibliographic coupling analysis that information would include the number of other documents that a given pair of documents link to. By using the link information, related documents are identified using a suitable analysis technique, such as clustering or spreading activation.

    摘要翻译: 一种用于在链接文档的集合中识别相关文档的方法和装置。 在该方法中,分析了文件与其他文档的链接结构。 通过仅分析链接结构,避免了文档的过程密集内容分析。 对文献集执行引文分析技术,如书目耦合分析,以提取链接信息。 对于书目耦合分析,信息将包括给定的一对文档链接到的其他文档的数量。 通过使用链接信息,使用合适的分析技术(例如聚类或扩展激活)来识别相关文档。

    System for categorizing documents in a linked collection of documents
    10.
    发明授权
    System for categorizing documents in a linked collection of documents 失效
    用于对文档的链接集合中的文档进行分类的系统

    公开(公告)号:US5895470A

    公开(公告)日:1999-04-20

    申请号:US842926

    申请日:1997-04-09

    IPC分类号: G06F17/30

    摘要: A system for extracting and analyzing information from a collection of linked documents at a locality to enable categorization of documents and prediction of documents relevant to a focus document. The system obtains and analyzes topology, usage and path information from for a collection at a locality, e.g. a web locality on the world wide web. For categorization, document meta information is represented as document vectors. Predefined criteria is applied to the document vectors to create lists of "similar" types of documents. For relevance prediction, networks representing topology, usage path and text similarity amongst the documents in the collection are created. A spreading activation technique is applied to the networks starting at a focus document to predict the documents relevant to the focus document. Using category and relevance prediction information, tools can be built to enable a user to more efficiently traverse through the collection of linked documents.

    摘要翻译: 一种用于从一个地点的链接文档集合中提取和分析信息的系统,以便对文档进行分类和与焦点文档相关的文档的预测。 该系统从一个地点的集合中获取和分析拓扑,使用和路径信息,例如。 万维网上的网站。 对于分类,文档元信息被表示为文档向量。 将预定义的标准应用于文档向量以创建“类似”类型的文档的列表。 对于相关性预测,创建代表集合中的文档之间的拓扑,使用路径和文本相似性的网络。 传播激活技术应用于从焦点文档开始的网络,以预测与焦点文档相关的文档。 使用类别和相关性预测信息,可以构建工具以使用户能够更有效地遍历链接文档的集合。