Patent search ap:("SAP SE") AND inv:"Marco Spinaci" Page 1

1.

发明申请
VISUALLY-AWARE ENCODINGS FOR CHARACTERS 有权

公开(公告)号：US20210174141A1

公开(公告)日：2021-06-10

申请号：US16704940

申请日：2019-12-05

Applicant: SAP SE

Inventor： Johannes Hoehne , Marco Spinaci

IPC: G06K9/62 , G06F16/51 , G06N3/08 , G06N3/04

Abstract: In some embodiments, a method inputs a set of images into a network and trains the network based on a classification of the set of images to one or more characters in a set of characters. The method obtains a set of encodings for the one or more characters based on a layer of the network that restricts the output of the layer to a number of values. Then, the method stores the set of encodings for the one or more characters, wherein an encoding in the set of encodings is retrievable when a corresponding character is determined.

2.

发明授权
Visually-aware encodings for characters 有权

公开(公告)号：US11275969B2

公开(公告)日：2022-03-15

申请号：US16704940

申请日：2019-12-05

Applicant: SAP SE

Inventor： Johannes Hoehne , Marco Spinaci

IPC: G06K9/62 , G06F16/51 , G06N3/04 , G06N3/08 , G06F40/53

Abstract: In some embodiments, a method inputs a set of images into a network and trains the network based on a classification of the set of images to one or more characters in a set of characters. The method obtains a set of encodings for the one or more characters based on a layer of the network that restricts the output of the layer to a number of values. Then, the method stores the set of encodings for the one or more characters, wherein an encoding in the set of encodings is retrievable when a corresponding character is determined.

3.

发明申请
DETECTING TABLES IN DOCUMENTS 有权

公开(公告)号：US20250131761A1

公开(公告)日：2025-04-24

申请号：US18381874

申请日：2023-10-19

Applicant: SAP SE

Inventor： Marek Polewczyk , Marco Spinaci

IPC: G06V30/416 , G06V30/10 , G06V30/412

Abstract: Various examples are directed to systems and methods for determining table data from a document image depicting a plurality of words and at least one table comprising at least a portion of the plurality of words. For example, Optical Character Recognition (OCR) data may be determined based on the document image. A table detection model may be executed based at least in part on the OCR data.

4.

发明公开
MACHINE LEARNING ENABLED DOCUMENT DESKEWING 审中-公开

公开(公告)号：US20230222632A1

公开(公告)日：2023-07-13

申请号：US17570678

申请日：2022-01-07

Applicant: SAP SE

Inventor： Marek Polewczyk , Marco Spinaci

IPC: G06T5/00 , G06V30/414 , G06V10/774 , G06V30/16

CPC classification number: G06T5/006 , G06V10/774 , G06V30/414 , G06V30/1607 , G06T2207/10008 , G06T2207/20081 , G06T2207/30176

Abstract: A method may include determining, based at least on an image of a document, a plurality of text bounding boxes enclosing lines of text present in the document. A machine learning model may be trained to determine, based at least on the coordinates defining the text bounding boxes, the coordinates of a document bounding box enclosing the text bounding boxes. The document bounding box may encapsulate the visual aberrations that are present in the image of the document. As such, one or more transformations may be determined based on the coordinates of the document bounding box. The image of the document may be deskewed by applying the transformations. One or more downstream tasks may be performed based on the deskewed image of the document. Related methods and articles of manufacture are also disclosed.

5.

发明授权
Object detection and candidate filtering system 有权

公开(公告)号：US10915786B2

公开(公告)日：2021-02-09

申请号：US16288357

申请日：2019-02-28

Applicant: SAP SE

Inventor： Johannes Hoehne , Anoop Raveendra Katti , Christian Reisswig , Marco Spinaci

IPC: G06K9/62

Abstract: Disclosed herein are system, method, and computer program product embodiments for providing object detection and filtering operations. An embodiment operates by receiving an image comprising a plurality of pixels and pixel information for each pixel. The pixel information indicates a bounding box corresponding to an object within the image associated with a respective pixel and a confidence score associated with the bounding box for the respective pixel. Pixels that do not correspond to a center of at least one of the bounding boxes are iteratively removed from the plurality of pixels until a subset of pixels each of which correspond to a center of at least one of the bounding boxes remains. Based on the subset, a final bounding box associated with each object of the image is determined based on an overlapping of the bounding boxes of the subset of pixels and the corresponding confidence scores.

6.

发明授权
Machine learning enabled document deskewing 有权

公开(公告)号：US12165298B2

公开(公告)日：2024-12-10

申请号：US17570678

申请日：2022-01-07

Applicant: SAP SE

Inventor： Marek Polewczyk , Marco Spinaci

IPC: G06T5/80 , G06V10/774 , G06V30/16 , G06V30/414

Abstract: A method may include determining, based at least on an image of a document, a plurality of text bounding boxes enclosing lines of text present in the document. A machine learning model may be trained to determine, based at least on the coordinates defining the text bounding boxes, the coordinates of a document bounding box enclosing the text bounding boxes. The document bounding box may encapsulate the visual aberrations that are present in the image of the document. As such, one or more transformations may be determined based on the coordinates of the document bounding box. The image of the document may be deskewed by applying the transformations. One or more downstream tasks may be performed based on the deskewed image of the document. Related methods and articles of manufacture are also disclosed.

7.

发明申请
CHARACTER ENCODING AND DECODING FOR OPTICAL CHARACTER RECOGNITION 有权

公开(公告)号：US20220391637A1

公开(公告)日：2022-12-08

申请号：US17340794

申请日：2021-06-07

Applicant: SAP SE

Inventor： Marco Spinaci , Marek Polewczyk

IPC: G06K9/62

Abstract: The present disclosure provides techniques for encoding and decoding characters for optical character recognition. The techniques involve determining sets of numbers for encoding a character set where each number in a particular set of numbers for encoding a particular character is mapped to a graphical unit (e.g., radical) of the particular character. A mapping between each set of numbers in the possible encodings and the character set may be determined based the closest character already encoded. A machine learning model may be trained to perform optical character recognition using training data labeled using the set of encodings and the mappings.

8.

发明申请
Object Detection and Candidate Filtering System 审中-公开

公开(公告)号：US20200279128A1

公开(公告)日：2020-09-03

申请号：US16288357

申请日：2019-02-28

Applicant: SAP SE

Inventor： Johannes Hoehne , Anoop Raveendra Katti , Christian Reisswig , Marco Spinaci

IPC: G06K9/62

Abstract: Disclosed herein are system, method, and computer program product embodiments for providing object detection and filtering operations. An embodiment operates by receiving an image comprising a plurality of pixels and pixel information for each pixel. The pixel information indicates a bounding box corresponding to an object within the image associated with a respective pixel and a confidence score associated with the bounding box for the respective pixel. Pixels that do not correspond to a center of at least one of the bounding boxes are iteratively removed from the plurality of pixels until a subset of pixels each of which correspond to a center of at least one of the bounding boxes remains. Based on the subset, a final bounding box associated with each object of the image is determined based on an overlapping of the bounding boxes of the subset of pixels and the corresponding confidence scores.

9.

发明授权
Character encoding and decoding for optical character recognition 有权

公开(公告)号：US11816182B2

公开(公告)日：2023-11-14

申请号：US17340794

申请日：2021-06-07

Applicant: SAP SE

Inventor： Marco Spinaci , Marek Polewczyk

IPC: G06F18/214 , G06V30/28

CPC classification number: G06F18/214 , G06V30/287

Abstract: The present disclosure provides techniques for encoding and decoding characters for optical character recognition. The techniques involve determining sets of numbers for encoding a character set where each number in a particular set of numbers for encoding a particular character is mapped to a graphical unit (e.g., radical) of the particular character. A mapping between each set of numbers in the possible encodings and the character set may be determined based the closest character already encoded. A machine learning model may be trained to perform optical character recognition using training data labeled using the set of encodings and the mappings.

10.

发明授权
Rotation and scaling for optical character recognition using end-to-end deep learning 有权

公开(公告)号：US11302108B2

公开(公告)日：2022-04-12

申请号：US16565614

申请日：2019-09-10

Applicant: SAP SE

Inventor： Johannes Hoehne , Marco Spinaci , Anoop Raveendra Katti

IPC: G06K9/34 , G06V30/148 , G06N20/00 , G06T7/10 , G06F16/901 , G06F17/18 , G06V30/10

Abstract: Disclosed herein are system, method, and computer program product embodiments for optical character recognition (OCR) pre-processing using machine learning. In an embodiment, a neural network may be trained to identify a standardized document rotation and scale expected by an OCR service performing character recognition. The neural network may then analyze a received document image to identify a corresponding rotation and scale of the document image relative to the expected standardized values. In response to this identification, the document image may be modified in the inverse to standardize the rotation and scale of the document image to match the format expected by the OCR service. In some embodiments, a neural network may perform the standardization as well as the character recognition using a shared computation graph.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification