发明申请
- 专利标题: TEXT ANALYSIS TECHNIQUES
- 专利标题(中): 文本分析技术
-
申请号: US11556437申请日: 2006-11-03
-
公开(公告)号: US20080109454A1公开(公告)日: 2008-05-08
- 发明人: Alan R. Willse , Elizabeth G. Hetzler , Lawrence L. Hope , Theodore E. Tanasse , Susan L. Havre , Alan E. Turner , Margaret MacGregor , Catherine Nancarrow , Grant C. Nakamura
- 申请人: Alan R. Willse , Elizabeth G. Hetzler , Lawrence L. Hope , Theodore E. Tanasse , Susan L. Havre , Alan E. Turner , Margaret MacGregor , Catherine Nancarrow , Grant C. Nakamura
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
One embodiment of the present invention includes means determining a concept representation for a set of text documents based on partial order analysis and modifying this representation if it is determined to be unidentifiable. Furthermore, the embodiment includes means for labeling the representation, mapping documents to it to provide a corresponding document representation, generating a number of document signatures each of a different type, and performing several data processing applications each with a different one of the document signatures of differing types.
信息查询