-
公开(公告)号:US10949661B2
公开(公告)日:2021-03-16
申请号:US16198040
申请日:2018-11-21
Applicant: Amazon Technologies, Inc.
Inventor: Rahul Bhotika , Shai Mazor , Amit Adam , Wendy Tse , Andrea Olgiati , Bhavesh Doshi , Gururaj Kosuru , Patrick Ian Wilson , Umar Farooq , Anand Dhandhania
IPC: G06K9/00
Abstract: Techniques for layout-agnostic complex document processing are described. A document processing service can analyze documents that do not adhere to defined layout rules in an automated manner to determine the content and meaning of a variety of types of segments within the documents. The service may chunk a document into multiple chunks, and operate upon the chunks in parallel by identifying segments within each chunk, classifying the segments into segment types, and processing the segments using special-purpose analysis engines adapted for the analysis of particular segment types to generate results that can be aggregated into an overall output for the entire document that captures the meaning and context of the document text.