Invention Grant
- Patent Title: Method for inferring blocks of text in electronic documents
-
Application No.: US15859152Application Date: 2017-12-29
-
Publication No.: US10579707B2Publication Date: 2020-03-03
- Inventor: Tim Prebble
- Applicant: Konica Minolta Laboratory U.S.A., Inc.
- Applicant Address: US CA San Mateo
- Assignee: KONICA MINOLTA LABORATORY U.S.A., INC.
- Current Assignee: KONICA MINOLTA LABORATORY U.S.A., INC.
- Current Assignee Address: US CA San Mateo
- Agency: Osha Liang LLP
- Main IPC: G06F17/21
- IPC: G06F17/21 ; G06K9/00 ; G06N20/00 ; G06F17/22

Abstract:
A method for processing an electronic document with characters includes adjusting the characters to identify lines and words; generating a cluster encompassing all of the lines and the words; setting the cluster as a target; determining whether the target can be divided; in response to determining that the target can be divided, dividing the target into a first plurality of sub-clusters; identifying blocks of text based on the first sub-clusters; and generating a new electronic document with paragraphs and sections based on the blocks of text.
Public/Granted literature
- US20190205362A1 METHOD FOR INFERRING BLOCKS OF TEXT IN ELECTRONIC DOCUMENTS Public/Granted day:2019-07-04
Information query