发明申请
- 专利标题: METHODS FOR GENERATING NATURAL LANGUAGE PROCESSING SYSTEMS
- 专利标题(中): 用于生成自然语言处理系统的方法
-
申请号: US14964517申请日: 2015-12-09
-
公开(公告)号: US20160162456A1公开(公告)日: 2016-06-09
- 发明人: Robert J. Munro , Schuyler D. Erle , Christopher Walker , Sarah K. Luger , Jason Brenier , Gary C. King , Paul A. Tepper , Ross Mechanic , Andrew Gilchrist-Scott , Jessica D. Long , James B. Robinson , Brendan D. Callahan , Michelle Casbon , Ujjwal Sarin , Aneesh Nair , Veena Basavaraj , Tripti Saxena , Edgar Nunez , Martha G. Hinrichs , Haley Most , Tyler J. Schnoebelen
- 申请人: Robert J. Munro , Schuyler D. Erle , Christopher Walker , Sarah K. Luger , Jason Brenier , Gary C. King , Paul A. Tepper , Ross Mechanic , Andrew Gilchrist-Scott , Jessica D. Long , James B. Robinson , Brendan D. Callahan , Michelle Casbon , Ujjwal Sarin , Aneesh Nair , Veena Basavaraj , Tripti Saxena , Edgar Nunez , Martha G. Hinrichs , Haley Most , Tyler J. Schnoebelen
- 申请人地址: US CA San Francisco
- 专利权人: Idibon, Inc.
- 当前专利权人: Idibon, Inc.
- 当前专利权人地址: US CA San Francisco
- 主分类号: G06F17/24
- IPC分类号: G06F17/24 ; G06F17/22 ; G06F17/28
摘要:
Methods are presented for generating a natural language model. The method may comprise: ingesting training data representative of documents to be analyzed by the natural language model, generating a hierarchical data structure comprising at least two topical nodes within which the training data is to be subdivided into by the natural language model, selecting a plurality of documents among the training data to be annotated, generating an annotation prompt for each document configured to elicit an annotation about said document indicating which node among the at least two topical nodes said document is to be classified into, receiving the annotation based on the annotation prompt; and generating the natural language model using an adaptive machine learning process configured to determine patterns among the annotations for how the documents in the training data are to be subdivided according to the at least two topical nodes of the hierarchical data structure.
公开/授权文献
- US10127214B2 Methods for generating natural language processing systems 公开/授权日:2018-11-13