发明申请
- 专利标题: APPARATUS AND METHODS FOR OPERATOR TRAINING IN INFORMATION EXTRACTION
- 专利标题(中): 信息提取中操作员培训的装置和方法
-
申请号: US12398126申请日: 2009-03-04
-
公开(公告)号: US20100227301A1公开(公告)日: 2010-09-09
- 发明人: Cong Yu , Mridul Muralidharan , Arun Shankar Iyer , Philip Lewis Bohannon
- 申请人: Cong Yu , Mridul Muralidharan , Arun Shankar Iyer , Philip Lewis Bohannon
- 申请人地址: US CA Sunnyvale
- 专利权人: YAHOO! INC.
- 当前专利权人: YAHOO! INC.
- 当前专利权人地址: US CA Sunnyvale
- 主分类号: G09B19/00
- IPC分类号: G09B19/00
摘要:
Disclosed are methods and apparatus for extracting information from one or more documents. A training and execution plan is received, and such plan specifies invocation of a trainer operator for initiating training of a trainee operator based on a set of training documents so as to generate a new trained operator that is to then be invoked so as to extract information from one or more unknown documents. The trainee operator is configured to extract information from one or more unknown documents, and each training document is associated with classified information. After receipt of the training and execution plan, the trainer operator is automatically executed to train the trainee operator based on the specified training documents so as to generate a new trained operator for extracting information from documents. The new trained operator is a new version of the trainee operator. After receipt of the training and execution plan, both the trainee operator are automatically retained for later use in extracting information from one or more unknown documents and the new trained operator for later use in extracting information from one or more unknown documents. After receipt of the training and execution plan, the new trained operator is automatically executed on one or more unknown documents so as to extract information from such one or more unknown documents.
公开/授权文献
信息查询