Cross-Document Intelligent Authoring and Processing, With Arbitration for Semantically-Annotated Documents

发明申请

US20220245335A1 Cross-Document Intelligent Authoring and Processing, With Arbitration for Semantically-Annotated Documents 有权

请登陆查看更多内容

专利标题： Cross-Document Intelligent Authoring and Processing, With Arbitration for Semantically-Annotated Documents
申请号： US17724934

申请日： 2022-04-20
公开(公告)号： US20220245335A1

公开(公告)日： 2022-08-04
发明人: Andrew Paul Begun , Steven DeRose , Taqi Jaffri , Luis Marti Orosa , Michael B. Palmer , Jean Paoli , Christina Pavlopoulou , Elena Pricoiu , Swagatika Sarangi , Marcin Sawicki , Manar Shehadeh , Michael Taron , Bhaven Toprani , Zubin Rustom Wadia , David Watson , Eric White , Joshua Yongshin Fan , Kush Gupta , Andrew Minh Hoang , Zhanlin Liu , Jerome George Paliakkara , Zhaofeng Wu , Yue Zhang , Xiaoquan Zhou
申请人： Docugami, Inc.
申请人地址： US WA Kirkland
专利权人： Docugami, Inc.
当前专利权人： Docugami, Inc.
当前专利权人地址： US WA Kirkland
主分类号： G06F40/186
IPC分类号： G06F40/186 ; G06N20/00 ; G06F40/30 ; G06F40/169 ; G06F40/117 ; G06F40/106 ; G06F40/289 ; G06F40/295 ; G06F16/93 ; G06F16/2457 ; G06F16/248 ; G06V30/414 ; G06V30/416

Cross-Document Intelligent Authoring and Processing, With Arbitration for Semantically-Annotated Documents

摘要：

Machine learning, artificial intelligence, and other computer-implemented methods are used to identify various semantically important chunks in documents, automatically label them with appropriate datatypes and semantic roles, and use this enhanced information to assist authors and to support downstream processes. Chunk locations, datatypes, and semantic roles can often be automatically determined from what is here called “context”, to wit, the combination of their formatting, structure, and content; those of adjacent or nearby content; overall patterns of occurrence in a document, and similarities of all these things across documents (mainly but not exclusively among documents in the same document set). Similarity is not limited to exact or fuzzy string or property comparisons, but may include similarity of natural language grammatical structure, ML (machine learning) techniques such as measuring similarity of word, chunk, and other embeddings, and the datatypes and semantic roles of previously-identified chunks.

公开/授权文献

US11960832B2 Cross-document intelligent authoring and processing, with arbitration for semantically-annotated documents 公开/授权日：2024-04-16

信息查询

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F40/00	处理自然语言数据（语音分析或综合，语音识别G10L）
G06F40/10	.文本处理（自然语言分析G06F 40/20;语义分析G06F 40/30;自然语言处理或翻译G06F 40/40）
G06F40/166	..编辑，例如插入或删除
G06F40/186	...模板