Resume Document Parsing using Computer Vision and Optical Character Recognition with Reblocking Feedback

    公开(公告)号:US20230215206A1

    公开(公告)日:2023-07-06

    申请号:US17331463

    申请日:2021-05-26

    申请人: Indeed, Inc.

    IPC分类号: G06K9/00 G06K9/62

    摘要: Systems and methods are disclosed for parsing resume documents using computer vision and optical character recognition technology in combination with a user feedback interface system to facilitate user feedback to improve the overall processing quality of the resumes that are imported into computer resume processing systems. In at least one embodiment, the system and method prompt a user to upload an input resume document, which is processed with a first parsing pass to generate initial resume data by extracting a plurality of resume text blocks. Further processing identifies an initial set of bounding blocks and to visually displays the initial resume data for user review and feedback to regroup one or more of the initial set of bounding blocks into a regrouped bounding block. Additional processing consolidates into a group text block each of the resume text blocks corresponding to the regrouped one or more of the initial set of bounding blocks.