专利检索 ap:("TELCORDIA TECH INC") AND inv:"BASSU DEVASIS" 第 1 页

1.

发明公开
CONCEPT BASED CROSS MEDIA INDEXING AND RETRIEVAL OF SPEECH DOCUMENTS 审中-公开
标题翻译：网络媒体的条款和语言文档需要建立索引

公开(公告)号：EP2030132A4

公开(公告)日：2010-07-14

申请号：EP07777361

申请日：2007-06-01

申请人： TELCORDIA TECH INC

发明人： BEHRENS CLIFFORD A , EGAN DENNIS , BASSU DEVASIS

IPC分类号： G06F17/30

CPC分类号： G06F17/30746 , G06F17/30681

2.

发明公开
INFORMATION RETRIEVAL AND TEXT MINING USING DISTRIBUTED LATENT SEMANTIC INDEXING 审中-公开
标题翻译：信息读取和文本挖掘利用分布式递延语义索引

公开(公告)号：EP1618467A4

公开(公告)日：2008-09-17

申请号：EP04750497

申请日：2004-04-23

申请人： TELCORDIA TECH INC

发明人： BEHRENS CLIFFORD A , BASSU DEVASIS

IPC分类号： G06F7/00 , G06F17/30 , G11B20060101

CPC分类号： G06F17/3071 , Y10S707/99935 , Y10S707/99943

摘要： The use of latent semantic indexing (LSI) for information retrieval and text mining operations is adapted to work on large heterogeneous data sets by first partitioning the data set into a number of smaller partitions having similar concept domains. A similarity graph network is generated in order to expose links between concept domains which are then exploited in determing which domains to query as well as in expanding the query vector. LSI is performed on those partitioned data sets most likely to contain information related to the user query or text mining operation. In this manner LSI can be applied to datasets that heretofore presented scalability problems. Additionally, the computation of the singular value decomposition of the term-by-document matrix can be accomplished at various distributed computers increasing the robustness of the retrieval and text mining system while decreasing search times.