Invention Grant
- Patent Title: Multiple channels of rasterized content for page decomposition using machine learning
-
Application No.: US16655363Application Date: 2019-10-17
-
Publication No.: US11386685B2Publication Date: 2022-07-12
- Inventor: Verena Sabine Kaynig-Fittkau , Smitha Bangalore Naresh , Shawn Alan Gaither , Richard Cohn , Paul John Asente , Eylon Stroh , Emily Seminerio
- Applicant: Adobe Inc.
- Applicant Address: US CA San Jose
- Assignee: Adobe Inc.
- Current Assignee: Adobe Inc.
- Current Assignee Address: US CA San Jose
- Agency: Finch & Maloney PLLC
- Main IPC: G06V30/413
- IPC: G06V30/413 ; G06N20/00 ; G06V30/412 ; G06V30/414

Abstract:
Techniques are provided for identifying structural elements of a document. One Methodology includes generating a first channel of rasterized content by rasterizing a full page of the document and generating one or more additional channels of rasterized content from the page of the document by rasterizing one or more corresponding content types from the page of the document. Each of the one or more additional channels includes a specific type of content that is different from each of the other one or more additional channels. The methodology further includes inputting the first channel of rasterized content and the one or more additional channels of rasterized content into a machine learning (ML) model. The methodology continues with determining location and classification for each of a plurality of structural elements on the page of the document using the ML model.
Public/Granted literature
- US20210117666A1 MULTIPLE CHANNELS OF RASTERIZED CONTENT FOR PAGE DECOMPOSITION USING MACHINE LEARNING Public/Granted day:2021-04-22
Information query