Method for inferring blocks of text in electronic documents
Abstract:
A method for processing an electronic document with characters includes adjusting the characters to identify lines and words; generating a cluster encompassing all of the lines and the words; setting the cluster as a target; determining whether the target can be divided; in response to determining that the target can be divided, dividing the target into a first plurality of sub-clusters; identifying blocks of text based on the first sub-clusters; and generating a new electronic document with paragraphs and sections based on the blocks of text.
Public/Granted literature
Information query
Patent Agency Ranking
0/0