Invention Application
- Patent Title: Annotating token sequences within documents
- Patent Title (中): 在文档中注释令牌序列
-
Application No.: US11532977Application Date: 2006-09-19
-
Publication No.: US20080072134A1Publication Date: 2008-03-20
- Inventor: Sreeram Viswanath Balakrishnan , Ganesh Ramakrishnan , Sachindra Joshi
- Applicant: Sreeram Viswanath Balakrishnan , Ganesh Ramakrishnan , Sachindra Joshi
- Main IPC: G06F17/00
- IPC: G06F17/00 ; G06F7/00

Abstract:
Token sequences within a number of documents are annotated. First, a base inverse index for unique tokens within the documents is received. The base inverse index includes a set of the unique tokens within the documents and a set of location lists for each unique token. Second, indices are created for a set of the token sequences within the documents from the base inverse index, to annotate the token sequences.
Information query