-
公开(公告)号:US12046066B2
公开(公告)日:2024-07-23
申请号:US17252678
申请日:2019-06-21
发明人: Elena Busila , Jerome Pasquero , Tim Beiko , Evelin Fonseca Cruz , Minh-Kim Dao , Majid Laali , Patrick Lazarus
IPC分类号: G06V30/414 , G06F16/906 , G06N20/00 , G06T5/73 , G06T5/80 , G06V30/12 , G06V30/413 , G06V30/416 , G06V30/10
CPC分类号: G06V30/414 , G06F16/906 , G06N20/00 , G06T5/73 , G06T5/80 , G06V30/133 , G06V30/413 , G06V30/416 , G06T2207/20132 , G06V30/10
摘要: Systems and methods for document analysis. An image containing at least one document is received at a pre-processing stage and the image is analyzed for image quality. If the image quality is insufficient for further processing, this is adjusted until the image is suitable for further processing. After the image quality adjustment, the image is then passed to an initial processing stage. At the initial processing stage, the boundaries of one or more documents within the image are determined. In addition, the orientation of the image may be adjusted and the type of document(s) within the image is determined. From the initial processing stage, the adjusted image is then passed to a data extraction stage. At this stage, clusters of data within the document are determined and bounding boxes are placed around the clusters. Data regarding each of the clusters of data is then gathered.
-
公开(公告)号:US12111953B2
公开(公告)日:2024-10-08
申请号:US17287640
申请日:2019-10-25
发明人: Elena Busila , Jerome Pasquero , Patrick Lazarus
IPC分类号: G06F21/62 , G06F21/60 , G06F40/166 , G06V30/414 , G06V30/416 , G06V10/20
CPC分类号: G06F21/6254 , G06F21/60 , G06F40/166 , G06V30/414 , G06V30/416 , G06V10/20
摘要: Systems and methods for privacy and sensitive data protection. An image of a document is received at a pre-processing stage and image pre-processing is applied to the image to ensure that the resulting image is sufficient for further processing. Pre-processing may involve processing relating to image quality and image orientation. The image is then passed to an initial processing stage. At the initial processing stage, the relevant data in the document are located and bounding boxes are placed around the data. The resulting image is then passed to a processing stage. At this stage, the type of data within the bounding boxes is determined and suitable replacement data is generated. The replacement data is then inserted into the image to thereby remove and replace the sensitive data in the image.
-