-
公开(公告)号:US20080021924A1
公开(公告)日:2008-01-24
申请号:US11489243
申请日:2006-07-18
IPC分类号: G06F17/00
CPC分类号: G06F17/30864 , Y10S707/99944
摘要: Embodiments of the present invention are directed to acquiring information from the worldwide web, organizing information acquired from the worldwide web, and using the acquired and organized information to facilitate web-page searching, web-page browsing, and other worldwide-web-based activities. In one embodiment of the present invention, a database of concept objects is created from an initial set of semantic objects and from hyperlink information obtained from web pages by one or more web crawlers. The initial set of semantic objects is processed using hyperlink based objects created by the web crawler. The processed semantic objects are then associated with additional hyperlink-based objects to create a concept-object database. In certain embodiments of the present invention, the concept-object database can be further refined and supplemented in an automated fashion by additional web crawling, subsequent association of hyperlink-based objects with concept objects, and creation of new concept objects as well as by user input to, and editing of, the concept-object database. The concept-object database may be employed, in various embodiments of the present invention, to facilitate web browsing, web-page searching, and other worldwide-web-base activities.
摘要翻译: 本发明的实施例旨在从全球网络获取信息,组织从全球网络获取的信息,并且使用获取和组织的信息来促进网页搜索,网页浏览和其他全球网络活动 。 在本发明的一个实施例中,从一组初始语义对象和由一个或多个网页抓取器从网页获取的超链接信息创建概念对象的数据库。 使用由网络爬网程序创建的基于超链接的对象来处理初始语义对象集。 然后将经处理的语义对象与附加的基于超链接的对象相关联,以创建概念对象数据库。 在本发明的某些实施例中,概念对象数据库可以通过附加的网络爬行,随后的基于超链接的对象与概念对象的关联以及新的概念对象的创建以及由用户 输入到和编辑概念对象数据库。 在本发明的各种实施例中,可以采用概念对象数据库来促进网页浏览,网页搜索和其他全球网络基础活动。
-
公开(公告)号:US20100250586A1
公开(公告)日:2010-09-30
申请号:US12718774
申请日:2010-03-05
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , Y10S707/99944
摘要: Embodiments of the present invention are directed to acquiring information from the worldwide web, organizing information acquired from the worldwide web, and using the acquired and organized information to facilitate web-page searching, web-page browsing, and other worldwide-web-based activities. In one embodiment of the present invention, a database of concept objects is created from an initial set of semantic objects and from hyperlink information obtained from web pages by one or more web crawlers. The initial set of semantic objects is processed using hyperlink based objects created by the web crawler. The processed semantic objects are then associated with additional hyperlink-based objects to create a concept-object database. In certain embodiments of the present invention, the concept-object database can be further refined and supplemented in an automated fashion by additional web crawling, subsequent association of hyperlink-based objects with concept objects, and creation of new concept objects as well as by user input to, and editing of, the concept-object database. The concept-object database may be employed, in various embodiments of the present invention, to facilitate web browsing, web-page searching, and other worldwide-web-base activities.
摘要翻译: 本发明的实施例旨在从全球网络获取信息,组织从全球网络获取的信息,并且使用获取和组织的信息来促进网页搜索,网页浏览和其他全球网络活动 。 在本发明的一个实施例中,从一组初始语义对象和由一个或多个网页抓取器从网页获取的超链接信息创建概念对象的数据库。 使用由网络爬网程序创建的基于超链接的对象来处理初始语义对象集。 然后将经处理的语义对象与附加的基于超链接的对象相关联,以创建概念对象数据库。 在本发明的某些实施例中,概念对象数据库可以通过附加的网络爬行,随后的基于超链接的对象与概念对象的关联以及新的概念对象的创建以及由用户 输入到和编辑概念对象数据库。 在本发明的各种实施例中,可以采用概念对象数据库来促进网页浏览,网页搜索和其他全球网络基础活动。
-
公开(公告)号:US07707161B2
公开(公告)日:2010-04-27
申请号:US11489243
申请日:2006-07-18
IPC分类号: G06F7/00
CPC分类号: G06F17/30864 , Y10S707/99944
摘要: Embodiments of the present invention are directed to acquiring information from the worldwide web, organizing information acquired from the worldwide web, and using the acquired and organized information to facilitate web-page searching, web-page browsing, and other worldwide-web-based activities. In one embodiment of the present invention, a database of concept objects is created from an initial set of semantic objects and from hyperlink information obtained from web pages by one or more web crawlers. The initial set of semantic objects is processed using hyperlink based objects created by the web crawler. The processed semantic objects are then associated with additional hyperlink-based objects to create a concept-object database. In certain embodiments of the present invention, the concept-object database can be further refined and supplemented in an automated fashion by additional web crawling, subsequent association of hyperlink-based objects with concept objects, and creation of new concept objects as well as by user input to, and editing of, the concept-object database. The concept-object database may be employed, in various embodiments of the present invention, to facilitate web browsing, web-page searching, and other worldwide-web-base activities.
摘要翻译: 本发明的实施例旨在从全球网络获取信息,组织从全球网络获取的信息,并且使用获取和组织的信息来促进网页搜索,网页浏览和其他全球网络活动 。 在本发明的一个实施例中,从一组初始语义对象和由一个或多个网页抓取器从网页获取的超链接信息创建概念对象的数据库。 使用由网络爬网程序创建的基于超链接的对象来处理初始语义对象集。 然后将经处理的语义对象与附加的基于超链接的对象相关联,以创建概念对象数据库。 在本发明的某些实施例中,概念对象数据库可以通过附加的网络爬行,随后的基于超链接的对象与概念对象的关联以及新的概念对象的创建以及由用户 输入到和编辑概念对象数据库。 在本发明的各种实施例中,可以采用概念对象数据库来促进网页浏览,网页搜索和其他全球网络基础活动。
-
4.
公开(公告)号:US06611825B1
公开(公告)日:2003-08-26
申请号:US09328888
申请日:1999-06-09
申请人: D. Dean Billheimer , Andrew James Booker , Michelle Keim Condliff , Mark Thomas Greaves , Fredrick Baden Holt , Anne Shu-Wan Kao , Daniel John Pierce , Stephen Robert Poteet , Yuan-Jye Wu
发明人: D. Dean Billheimer , Andrew James Booker , Michelle Keim Condliff , Mark Thomas Greaves , Fredrick Baden Holt , Anne Shu-Wan Kao , Daniel John Pierce , Stephen Robert Poteet , Yuan-Jye Wu
IPC分类号: G06N500
CPC分类号: G06F17/30616 , Y10S707/99931
摘要: A text mining program is provided that allows a user to perform text mining operations, such as: information retrieval, term and document visualization, term and document clustering, term and document classification, summarization of individual documents and groups of documents, and document cross-referencing. This is accomplished by representing the text of a document collection using subspace transformations. This subspace transformation representation is performed by: constructing a term frequency matrix of the term frequencies for each of the documents, transforming the term frequencies for statistical purposes, and projecting the documents or the terms into a lower dimensional subspace. As the document collection is updated, the subspace is dynamically updated to reflect the new document collection.
摘要翻译: 提供了一种文本挖掘程序,允许用户执行文本挖掘操作,例如:信息检索,术语和文档可视化,术语和文档聚类,术语和文档分类,单个文档和文档组的摘要, 参考。 这是通过使用子空间转换表示文档集合的文本来实现的。 该子空间变换表示通过以下方式来执行:为每个文档构建术语频率的项频率矩阵,将用于统计目的的术语频率变换,以及将文档或术语投影到较低维子空间中。 随着文档集合的更新,子空间将被动态更新以反映新的文档集合。
-
公开(公告)号:US08060538B2
公开(公告)日:2011-11-15
申请号:US12718774
申请日:2010-03-05
IPC分类号: G06F17/30
CPC分类号: G06F17/30864 , Y10S707/99944
摘要: Embodiments of the present invention are directed to acquiring information from the worldwide web, organizing information acquired from the worldwide web, and using the acquired and organized information to facilitate web-page searching, web-page browsing, and other worldwide-web-based activities. In one embodiment of the present invention, a database of concept objects is created from an initial set of semantic objects and from hyperlink information obtained from web pages by one or more web crawlers. The initial set of semantic objects is processed using hyperlink based objects created by the web crawler. The processed semantic objects are then associated with additional hyperlink-based objects to create a concept-object database. In certain embodiments of the present invention, the concept-object database can be further refined and supplemented in an automated fashion by additional web crawling, subsequent association of hyperlink-based objects with concept objects, and creation of new concept objects as well as by user input to, and editing of, the concept-object database. The concept-object database may be employed, in various embodiments of the present invention, to facilitate web browsing, web-page searching, and other worldwide-web-base activities.
摘要翻译: 本发明的实施例旨在从全球网络获取信息,组织从全球网络获取的信息,并且使用获取和组织的信息来促进网页搜索,网页浏览和其他全球网络活动 。 在本发明的一个实施例中,从一组初始语义对象和由一个或多个网页抓取器从网页获取的超链接信息创建概念对象的数据库。 使用由网络爬网程序创建的基于超链接的对象来处理初始语义对象集。 然后将经处理的语义对象与附加的基于超链接的对象相关联,以创建概念对象数据库。 在本发明的某些实施例中,概念对象数据库可以通过附加的网络爬行,随后的基于超链接的对象与概念对象的关联以及新的概念对象的创建以及由用户 输入到和编辑概念对象数据库。 在本发明的各种实施例中,可以采用概念对象数据库来促进网页浏览,网页搜索和其他全球网络基础活动。
-
-
-
-