专利检索 ap:("Robert J. Munro" OR "Schuyler D. Erle" OR "Christopher Walker" OR "Sarah K. Luger" OR "Jason Brenier" OR "Gary C. King" OR "Paul A. Tepper" OR "Ross Mechanic" OR "Andrew Gilchrist-Scott" OR "Jessica D. Long" OR "James B. Robinson" OR "Brendan D. Callahan" OR "Michelle Casbon" OR "Ujjwal Sarin" OR "Aneesh Nair" OR "Veena Basavaraj" OR "Tripti Saxena" OR "Edgar Nunez" OR "Martha G. Hinrichs" OR "Haley Most" OR "Tyler J. Schnoebelen") AND inv:"Tripti Saxena" 第 1 页

1.

发明申请
METHODS FOR GENERATING NATURAL LANGUAGE PROCESSING SYSTEMS 审中-公开
标题翻译：用于生成自然语言处理系统的方法

公开(公告)号：US20160162456A1

公开(公告)日：2016-06-09

申请号：US14964517

申请日：2015-12-09

申请人： Robert J. Munro , Schuyler D. Erle , Christopher Walker , Sarah K. Luger , Jason Brenier , Gary C. King , Paul A. Tepper , Ross Mechanic , Andrew Gilchrist-Scott , Jessica D. Long , James B. Robinson , Brendan D. Callahan , Michelle Casbon , Ujjwal Sarin , Aneesh Nair , Veena Basavaraj , Tripti Saxena , Edgar Nunez , Martha G. Hinrichs , Haley Most , Tyler J. Schnoebelen

发明人： Robert J. Munro , Schuyler D. Erle , Christopher Walker , Sarah K. Luger , Jason Brenier , Gary C. King , Paul A. Tepper , Ross Mechanic , Andrew Gilchrist-Scott , Jessica D. Long , James B. Robinson , Brendan D. Callahan , Michelle Casbon , Ujjwal Sarin , Aneesh Nair , Veena Basavaraj , Tripti Saxena , Edgar Nunez , Martha G. Hinrichs , Haley Most , Tyler J. Schnoebelen

IPC分类号： G06F17/24 , G06F17/22 , G06F17/28

CPC分类号： G06F17/30598 , G06F3/0482 , G06F17/2241 , G06F17/241 , G06F17/272 , G06F17/2785 , G06F17/28 , G06F17/2809 , G06F17/30011 , G06F17/30401 , G06F17/30445 , G06F17/30604 , G06F17/30654 , G06F17/30705 , G06F17/30734 , G06F17/30864 , G06Q50/01

摘要： Methods are presented for generating a natural language model. The method may comprise: ingesting training data representative of documents to be analyzed by the natural language model, generating a hierarchical data structure comprising at least two topical nodes within which the training data is to be subdivided into by the natural language model, selecting a plurality of documents among the training data to be annotated, generating an annotation prompt for each document configured to elicit an annotation about said document indicating which node among the at least two topical nodes said document is to be classified into, receiving the annotation based on the annotation prompt; and generating the natural language model using an adaptive machine learning process configured to determine patterns among the annotations for how the documents in the training data are to be subdivided according to the at least two topical nodes of the hierarchical data structure.

摘要翻译： 提出了生成自然语言模型的方法。该方法可以包括：摄取表示要由自然语言模型分析的文档的训练数据，生成包括至少两个主题节点的分层数据结构，训练数据将在该节点内被自然语言模型细分，选择多个在要注释的训练数据中生成文档的注释提示，为每个文档生成关于所述文档的注释的注释提示，该注释指示所述文档中的至少两个主题节点中的哪个节点被分类，基于注释接收注释提示; 以及使用自适应机器学习过程来生成所述自然语言模型，所述自适应机器学习过程被配置为根据所述分级数据结构的所述至少两个主题节点来确定所述注释中的模式如何根据所述训练数据中的文档被细分。

2.

发明申请
OPTIMIZATION TECHNIQUES FOR ARTIFICIAL INTELLIGENCE 审中-公开
标题翻译：人工智能优化技术

公开(公告)号：US20160162457A1

公开(公告)日：2016-06-09

申请号：US14964520

申请日：2015-12-09

申请人： Robert J. Munro , Schuyler D. Erle , Jason Brenier , Paul A. Tepper , Tripti Saxena , Gary C. King , Jessica D. Long , Brendan D. Callahan , Tyler J. Schnoebelen , Stefan Krawczyk , Veena Basavaraj

发明人： Robert J. Munro , Schuyler D. Erle , Jason Brenier , Paul A. Tepper , Tripti Saxena , Gary C. King , Jessica D. Long , Brendan D. Callahan , Tyler J. Schnoebelen , Stefan Krawczyk , Veena Basavaraj

IPC分类号： G06F17/24 , G06F17/28

CPC分类号： G06F17/241 , G06F3/0482 , G06F16/243 , G06F16/24532 , G06F16/285 , G06F16/288 , G06F16/3329 , G06F16/35 , G06F16/367 , G06F16/93 , G06F16/951 , G06F17/2241 , G06F17/272 , G06F17/2785 , G06F17/28 , G06F17/2809 , G06Q50/01

摘要： Methods, apparatuses and computer readable medium are presented for generating a natural language model. A method for generating a natural language model comprises: selecting from a pool of documents, a first set of documents to be annotated; receiving annotations of the first set of documents elicited by first human readable prompts; training a natural language model using the annotated first set of documents; determining documents in the pool having uncertain natural language processing results according to the trained natural language model and/or the received annotations; selecting from the pool of documents, a second set of documents to be annotated comprising documents having uncertain natural language processing results; receiving annotations of the second set of documents elicited by second human readable prompts; and retraining a natural language model using the annotated second set of documents.

摘要翻译： 提出了用于生成自然语言模型的方法，装置和计算机可读介质。一种用于生成自然语言模型的方法包括：从文档池中选择要注释的第一组文档; 接收由第一可读提示引起的第一组文档的注释; 使用注释的第一组文件训练自然语言模型; 根据训练的自然语言模型和/或接收的注释，确定具有不确定的自然语言处理结果的池中的文档; 从文件池中选择要注释的第二组文档，其中包括具有不确定的自然语言处理结果的文档; 接收由第二可读提示引起的第二组文件的注释; 并使用注释的第二组文档重新培训自然语言模型。

3.

发明申请
INTELLIGENT SYSTEM THAT DYNAMICALLY IMPROVES ITS KNOWLEDGE AND CODE-BASE FOR NATURAL LANGUAGE UNDERSTANDING 审中-公开

公开(公告)号：US20190205377A1

公开(公告)日：2019-07-04

申请号：US16056263

申请日：2018-08-06

申请人： Robert J. Munro , Rob Voigt , Schuyler D. Erle , Brendan D. Callahan , Gary C. King , Jessica D. Long , Jason Brenier , Tripti Saxena , Stefan Krawczyk

发明人： Robert J. Munro , Rob Voigt , Schuyler D. Erle , Brendan D. Callahan , Gary C. King , Jessica D. Long , Jason Brenier , Tripti Saxena , Stefan Krawczyk

IPC分类号： G06F17/27

CPC分类号： G06F17/277 , G06F17/2715 , G06F17/2785

摘要： Systems, methods, and apparatuses are presented for a novel natural language tokenizer and tagger. In some embodiments, a method for tokenizing text for natural language processing comprises: generating from a pool of documents, a set of statistical models comprising one or more entries each indicating a likelihood of appearance of a character/letter sequence in the pool of documents; receiving a set of rules comprising rules that identify character/letter sequences as valid tokens; transforming one or more entries in the statistical models into new rules that are added to the set of rules when the entries indicate a high likelihood; receiving a document to be processed; dividing the document to be processed into tokens based on the set of statistical models and the set of rules, wherein the statistical models are applied where the rules fail to unambiguously tokenize the document; and outputting the divided tokens for natural language processing.

4.

发明申请
INTELLIGENT SYSTEM THAT DYNAMICALLY IMPROVES ITS KNOWLEDGE AND CODE-BASE FOR NATURAL LANGUAGE UNDERSTANDING 有权
标题翻译：智能系统动态改进自然语言理解知识和代码

公开(公告)号：US20160162466A1

公开(公告)日：2016-06-09

申请号：US14964512

申请日：2015-12-09

申请人： Robert J. Munro , Rob Voigt , Schuyler D. Erle , Brendan D. Callahan , Gary C. King , Jessica D. Long , Jason Brenier , Tripti Saxena , Stefan Krawczyk

发明人： Robert J. Munro , Rob Voigt , Schuyler D. Erle , Brendan D. Callahan , Gary C. King , Jessica D. Long , Jason Brenier , Tripti Saxena , Stefan Krawczyk

IPC分类号： G06F17/27

CPC分类号： G06F17/277 , G06F17/2715 , G06F17/2785

摘要： Systems, methods, and apparatuses are presented for a novel natural language tokenizer and tagger. In some embodiments, a method for tokenizing text for natural language processing comprises: generating from a pool of documents, a set of statistical models comprising one or more entries each indicating a likelihood of appearance of a character/letter sequence in the pool of documents; receiving a set of rules comprising rules that identify character/letter sequences as valid tokens; transforming one or more entries in the statistical models into new rules that are added to the set of rules when the entries indicate a high likelihood; receiving a document to be processed; dividing the document to be processed into tokens based on the set of statistical models and the set of rules, wherein the statistical models are applied where the rules fail to unambiguously tokenize the document; and outputting the divided tokens for natural language processing.

摘要翻译： 系统，方法和设备被呈现给一种新颖的自然语言标记器和标签器。在一些实施例中，用于对自然语言处理的文本进行标记化的方法包括：从文档池生成包括一个或多个条目的统计模型集合，每个条目表示在文档库中出现字符/字母序列的可能性; 接收一组包含将字符/字符序列识别为有效令牌的规则的规则; 将统计模型中的一个或多个条目转换为当条目表示高可能性时添加到规则集合中的新规则; 接收待处理的文件; 基于统计模型和规则集合将要处理的文档划分为令牌，其中在规则未能明确地标记文档的情况下应用统计模型; 并输出用于自然语言处理的分割令牌。

5.

发明申请
INTELLIGENT SYSTEM THAT DYNAMICALLY IMPROVES ITS KNOWLEDGE AND CODE-BASE FOR NATURAL LANGUAGE UNDERSTANDING 审中-公开

公开(公告)号：US20180095946A1

公开(公告)日：2018-04-05

申请号：US15596855

申请日：2017-05-16

申请人： Robert Munro , Rob Voigt , Schuyler D. Erle , Brendan D. Callahan , Gary C. King , Jessica D. Long , Jason Brenier , Tripti Saxena , Stefan Krawczyk

发明人： Robert Munro , Rob Voigt , Schuyler D. Erle , Brendan D. Callahan , Gary C. King , Jessica D. Long , Jason Brenier , Tripti Saxena , Stefan Krawczyk

IPC分类号： G06F17/27

CPC分类号： G06F17/277 , G06F17/2715 , G06F17/2785

摘要： Systems, methods, and apparatuses are presented for a novel natural language tokenizer and tagger. In some embodiments, a method for tokenizing text for natural language processing comprises: generating from a pool of documents, a set of statistical models comprising one or more entries each indicating a likelihood of appearance of a character/letter sequence in the pool of documents; receiving a set of rules comprising rules that identify character/letter sequences as valid tokens; transforming one or more entries in the statistical models into new rules that are added to the set of rules when the entries indicate a high likelihood; receiving a document to be processed; dividing the document to be processed into tokens based on the set of statistical models and the set of rules, wherein the statistical models are applied where the rules fail to unambiguously tokenize the document; and outputting the divided tokens for natural language processing.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类