专利检索 ap:("Fredrick Baden Holt" OR "Anne Shu-Wan Kao" OR "Daniel John Pierce" OR "Stephen Robert Poteet" OR "Yuan-Jye Wu") AND inv:"Daniel John Pierce" 第 1 页

1.

发明授权
Methods, apparatus and computer program products for information retrieval and document classification utilizing a multidimensional subspace 有权
标题翻译：用于信息检索和利用多维子空间的文档分类的方法，装置和计算机程序产品

公开(公告)号：US06701305B1

公开(公告)日：2004-03-02

申请号：US09693114

申请日：2000-10-20

申请人： Fredrick Baden Holt , Anne Shu-Wan Kao , Daniel John Pierce , Stephen Robert Poteet , Yuan-Jye Wu

发明人： Fredrick Baden Holt , Anne Shu-Wan Kao , Daniel John Pierce , Stephen Robert Poteet , Yuan-Jye Wu

IPC分类号： G06F1700

CPC分类号： G06F17/30675 , G06F17/30613 , G06F17/30616 , G06F17/30663 , G06F17/3069 , G06F17/30707

摘要： Methods, apparatus and computer program products are provided for retrieving information from a text data collection and for classifying a document into none, one or more of a plurality of predefined classes. In each aspect, a representation of at least a portion of the original matrix is projected into a lower dimensional subspace and those portions of the subspace representation that relate to the term(s) of the query are weighted following the projection into the lower dimensional subspace. In order to retrieve the documents that are most relevant with respect to a query, the documents are then scored with documents having better scores being of generally greater relevance. Alternatively, in order to classify a document, the relationship of the document to the classes of documents is scored with the document then being classified in those classes, if any, that have the best scores.

摘要翻译： 提供了方法，装置和计算机程序产品，用于从文本数据收集中检索信息，并将文档分类为多个预定类别中的一个或多个。在每个方面，原始矩阵的至少一部分的表示被投影到较低维子空间中，并且与查询的项相关的子空间表示的那些部分被加权后跟随投影到较低维子空间中。为了检索与查询最相关的文档，然后使用具有更好分数的文档具有更大的相关性的文档进行评分。或者，为了对文档进行分类，将文档与文档类的关系进行评分，然后将文档分类为具有最佳分数的那些类别（如果有的话）。

2.

发明授权
Method and system for text mining using multidimensional subspaces 有权
标题翻译：使用多维子空间进行文本挖掘的方法和系统

公开(公告)号：US06611825B1

公开(公告)日：2003-08-26

申请号：US09328888

申请日：1999-06-09

申请人： D. Dean Billheimer , Andrew James Booker , Michelle Keim Condliff , Mark Thomas Greaves , Fredrick Baden Holt , Anne Shu-Wan Kao , Daniel John Pierce , Stephen Robert Poteet , Yuan-Jye Wu

发明人： D. Dean Billheimer , Andrew James Booker , Michelle Keim Condliff , Mark Thomas Greaves , Fredrick Baden Holt , Anne Shu-Wan Kao , Daniel John Pierce , Stephen Robert Poteet , Yuan-Jye Wu

IPC分类号： G06N500

CPC分类号： G06F17/30616 , Y10S707/99931

摘要： A text mining program is provided that allows a user to perform text mining operations, such as: information retrieval, term and document visualization, term and document clustering, term and document classification, summarization of individual documents and groups of documents, and document cross-referencing. This is accomplished by representing the text of a document collection using subspace transformations. This subspace transformation representation is performed by: constructing a term frequency matrix of the term frequencies for each of the documents, transforming the term frequencies for statistical purposes, and projecting the documents or the terms into a lower dimensional subspace. As the document collection is updated, the subspace is dynamically updated to reflect the new document collection.

摘要翻译： 提供了一种文本挖掘程序，允许用户执行文本挖掘操作，例如：信息检索，术语和文档可视化，术语和文档聚类，术语和文档分类，单个文档和文档组的摘要，参考。这是通过使用子空间转换表示文档集合的文本来实现的。该子空间变换表示通过以下方式来执行：为每个文档构建术语频率的项频率矩阵，将用于统计目的的术语频率变换，以及将文档或术语投影到较低维子空间中。随着文档集合的更新，子空间将被动态更新以反映新的文档集合。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类