MACHINE LEARNING BASED MULTIPAGE SCANNING
    1.
    发明公开

    公开(公告)号:US20230377363A1

    公开(公告)日:2023-11-23

    申请号:US17663785

    申请日:2022-05-17

    Applicant: ADOBE INC.

    CPC classification number: G06V30/41 G06V40/107 G06T7/13 G06T2207/30176

    Abstract: Systems and methods for machine learning based multipage scanning are provided. In one embodiment, one or more processing devices perform operations that include receiving a video stream that includes image frames that capture a plurality of pages of a document. The operations further include detection, via a machine learning model that is trained to infer events from the video stream detects, a new page event. Detection of the new page event indicates that a page of the plurality of pages available for scanning has changed from a first page to a second page. Based on the detection of the new page event, the one or more processing devices capture an image frame of the page from the video stream. In some embodiments, the machine learning model detects events based on a weighted use of video data, inertial data, audio samples, image depth information, image statistics and/or other information.

Patent Agency Ranking