Invention Grant
- Patent Title: Extracting tokens in a natural language understanding application
- Patent Title (中): 在自然语言理解应用中提取令牌
-
Application No.: US11764285Application Date: 2007-06-18
-
Publication No.: US08285539B2Publication Date: 2012-10-09
- Inventor: Rajesh Balchandran , Linda M. Boyer , Gregory Purdy
- Applicant: Rajesh Balchandran , Linda M. Boyer , Gregory Purdy
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Cuenot, Forsythe & Kim, LLC
- Main IPC: G06F17/27
- IPC: G06F17/27

Abstract:
A method of processing text within a natural language understanding system can include applying a first tokenization technique to a sentence using a statistical tokenization model. A second tokenization technique using a named entity can be applied to the sentence when the first tokenization technique does not extract a needed token according to a class of the sentence. A token determined according to at least one of the tokenization techniques can be output.
Public/Granted literature
- US20080312905A1 Extracting Tokens in a Natural Language Understanding Application Public/Granted day:2008-12-18
Information query