Invention Grant
- Patent Title: Processing an electronic document for information extraction
- Patent Title (中): 处理电子文件进行信息提取
-
Application No.: US10835215Application Date: 2004-04-29
-
Publication No.: US07672940B2Publication Date: 2010-03-02
- Inventor: Paul Viola , Hiu Chung Law , James Rinker
- Applicant: Paul Viola , Hiu Chung Law , James Rinker
- Applicant Address: US WA Redmond
- Assignee: Microsoft Corporation
- Current Assignee: Microsoft Corporation
- Current Assignee Address: US WA Redmond
- Agency: Westman, Champlin & Kelly, P.A.
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
The present invention relates generally to automatically processing electronic documents. In one aspect, features and/or properties of words are identified from a set of training documents to aid in extracting information from documents to be processed. The features and/or properties relate to text of the words, position of the words and the relationship to other words. A classifier is developed to express these features and/or properties. During information extraction, documents are processed and analyzed based on the classifier and information is extracted based on correspondence of the documents and the features/properties expressed by the classifier.
Public/Granted literature
- US20050125402A1 Processing an electronic document for information extraction Public/Granted day:2005-06-09
Information query