Invention Grant
- Patent Title: Reclassification of training data to improve classifier accuracy
- Patent Title (中): 重新分类训练数据,提高分类精度
-
Application No.: US11764291Application Date: 2007-06-18
-
Publication No.: US09342588B2Publication Date: 2016-05-17
- Inventor: Rajesh Balchandran , Linda M. Boyer , Gregory Purdy
- Applicant: Rajesh Balchandran , Linda M. Boyer , Gregory Purdy
- Applicant Address: US NY Armonk
- Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
- Current Assignee Address: US NY Armonk
- Agency: Cuenot, Forsythe & Kim, LLC
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G06F17/30

Abstract:
A method of creating a statistical classification model for a classifier within a natural language understanding system can include processing training data using an existing statistical classification model. Sentences of the training data correctly classified into a selected class of the statistical classification model can be selected. The selected sentences of the training data can be assigned to a fringe group or a core group according to confidence score. The training data can be updated by associating the fringe group with a fringe subclass of the selected class and the core group with a core subclass of the selected class. A new statistical classification model can be built from the updated training data. The new statistical classification model can be output.
Public/Granted literature
- US20080312906A1 Reclassification of Training Data to Improve Classifier Accuracy Public/Granted day:2008-12-18
Information query