Method for inferring blocks of text in electronic documents

    公开(公告)号:US10579707B2

    公开(公告)日:2020-03-03

    申请号:US15859152

    申请日:2017-12-29

    Inventor: Tim Prebble

    Abstract: A method for processing an electronic document with characters includes adjusting the characters to identify lines and words; generating a cluster encompassing all of the lines and the words; setting the cluster as a target; determining whether the target can be divided; in response to determining that the target can be divided, dividing the target into a first plurality of sub-clusters; identifying blocks of text based on the first sub-clusters; and generating a new electronic document with paragraphs and sections based on the blocks of text.

    Producing a flowchart object from an image

    公开(公告)号:US09934431B2

    公开(公告)日:2018-04-03

    申请号:US15221315

    申请日:2016-07-27

    Inventor: Tim Prebble

    Abstract: A method for image processing. The method includes: reading an image of a flowchart; identifying, within the image, a plurality of paths corresponding to the flowchart; classifying a first path of the plurality of paths as a flowchart element by: calculating, during a solo evaluation phase, a plurality of established likelihood scores for the first path based on a plurality of properties of the first path; calculating, during a neighbor-based evaluation phase, a first plurality of provisional likelihood scores for the first path based on the plurality of established likelihood scores for the first path and a plurality of established likelihood scores for a second path of the plurality of paths; and updating, during the neighbor-based evaluation phase, the plurality of established likelihood scores for the first path based on the first plurality of provisional likelihood scores; and generating a flowchart object based on the classified first path.

    INFERRING TITLES AND SECTIONS IN DOCUMENTS
    4.
    发明申请

    公开(公告)号:US20200311412A1

    公开(公告)日:2020-10-01

    申请号:US16370110

    申请日:2019-03-29

    Inventor: Tim Prebble

    Abstract: A method for processing an electronic document (ED) to infer titles and sections in the ED includes: applying visual analysis to the ED and identifying candidate titles and candidate sections of the ED; filtering the candidate titles based on the candidate sections; filtering the candidate sections based on the filtered candidate titles; applying semantic analysis to the ED and identifying topics and portions of the ED; refining, based on the identified topics and the portions, the filtered candidate titles and the filtered candidate sections; and generating a marked-up version of the ED that identifies the refined candidate titles and the refined candidate sections.

    PRODUCING A FLOWCHART OBJECT FROM AN IMAGE

    公开(公告)号:US20180032806A1

    公开(公告)日:2018-02-01

    申请号:US15221315

    申请日:2016-07-27

    Inventor: Tim Prebble

    Abstract: A method for image processing& The method includes: reading an image of a flowchart; identifying, within the image, a plurality of paths corresponding to the flowchart; classifying a first path of the plurality of paths as a flowchart element by: calculating, during a solo evaluation phase, a plurality of established likelihood scores for the first path based on a plurality of properties of the first path; calculating, during a neighbor-based evaluation phase, a first plurality of provisional likelihood scores for the first path based on the plurality of established likelihood scores for the first path and a plurality of established likelihood scores for a second path of the plurality of paths; and updating, during the neighbor-based evaluation phase, the plurality of established likelihood scores for the first path based on the first plurality of provisional likelihood scores; and generating a flowchart object based on the classified first path.

    Selecting primary groups during production of a flowchart object from an image

    公开(公告)号:US09977956B2

    公开(公告)日:2018-05-22

    申请号:US15223449

    申请日:2016-07-29

    Inventor: Tim Prebble

    CPC classification number: G06K9/00456 G06K9/00449 G06K9/00463 G06K9/00476

    Abstract: A method for image processing. The method includes: reading an image of a flowchart; identifying, within the image, a plurality of paths corresponding to the flowchart; grouping the plurality of paths into a plurality of groups including a first group and a second group; calculating a plurality of likelihood scores corresponding to flowchart elements for each of the plurality of groups; identifying a first path belonging to the first group and the second group; and selecting the first group as the primary group for the first path based on a maximum likelihood score for the first group and a maximum likelihood score for the second group; and generating a flowchart object based on the primary group for the first path.

    SELECTING PRIMARY GROUPS DURING PRODUCTION OF A FLOWCHART OBJECT FROM AN IMAGE

    公开(公告)号:US20180032807A1

    公开(公告)日:2018-02-01

    申请号:US15223449

    申请日:2016-07-29

    Inventor: Tim Prebble

    CPC classification number: G06K9/00456 G06K9/00449 G06K9/00463 G06K9/00476

    Abstract: A method for image processing. The method includes: reading an image of a flowchart; identifying, within the image, a plurality of paths corresponding to the flowchart; grouping the plurality of paths into a plurality of groups including a first group and a second group; calculating a plurality of likelihood scores corresponding to flowchart elements for each of the plurality of groups; identifying a first path belonging to the first group and the second group; and selecting the first group as the primary group for the first path based on a maximum likelihood score for the first group and a maximum likelihood score for the second group; and generating a flowchart object based on the primary group for the first path.

Patent Agency Ranking