-
公开(公告)号:US20230377363A1
公开(公告)日:2023-11-23
申请号:US17663785
申请日:2022-05-17
Applicant: ADOBE INC.
Inventor: Tong SUN , Nicholas Sergei REWKOWSKI , Nedim LIPKA , Jennifer Anne HEALEY , Curtis Michael WIGINGTON , Anshul MALIK
CPC classification number: G06V30/41 , G06V40/107 , G06T7/13 , G06T2207/30176
Abstract: Systems and methods for machine learning based multipage scanning are provided. In one embodiment, one or more processing devices perform operations that include receiving a video stream that includes image frames that capture a plurality of pages of a document. The operations further include detection, via a machine learning model that is trained to infer events from the video stream detects, a new page event. Detection of the new page event indicates that a page of the plurality of pages available for scanning has changed from a first page to a second page. Based on the detection of the new page event, the one or more processing devices capture an image frame of the page from the video stream. In some embodiments, the machine learning model detects events based on a weighted use of video data, inertial data, audio samples, image depth information, image statistics and/or other information.
-
公开(公告)号:US20230085687A1
公开(公告)日:2023-03-23
申请号:US17991249
申请日:2022-11-21
Applicant: ADOBE INC.
Inventor: Ashutosh MEHRA , Vlad Ion MORARIU , Kajal GUPTA , Jayant Vaibhav SRIVASTAVA , Curtis Michael WIGINGTON , Tushar TIWARI
IPC: G06V30/414 , G06N3/02 , G06K9/62 , G06N20/00
Abstract: Various disclosed embodiments can resolve output inaccuracies produced by many machine learning models. Embodiments use content order as input to machine learning model systems so that they can process documents according to the position or rank of instances in a document or image. In this way, the model is less likely to misclassify or incorrectly detect instances or the ordering between predicted instances. The content order in various embodiments can be used as an additional signal to classify or make predictions.
-