Patent search ap:("SAP SE") AND inv:"Christian Reisswig" Page 3

21.

发明授权
Robust key value extraction 有权

公开(公告)号：US10824808B2

公开(公告)日：2020-11-03

申请号：US16196153

申请日：2018-11-20

Applicant: SAP SE

Inventor： Christian Reisswig , Eduardo Vellasques , Sohyeong Kim , Darko Velkoski , Hung Tu Dinh

IPC: G10L15/02 , G06F40/295 , G06F40/289 , G06N3/04 , G06N3/08 , G06F40/30

Abstract: Disclosed herein are system, method, and computer program product embodiments for robust key value extraction. In an embodiment, one or more hierarchical concepts units (HCUs) may be configured to extract key value and hierarchical information from text inputs. The HCUs may use a convolutional neural network, a recurrent neural network, and feature selectors to analyze the text inputs using machine learning techniques to extract the key value and hierarchical information. Multiple HCUs may be used together and configured to identify different categories of hierarchical information. While multiple HCUs may be used, each may use a skip connection to transmit extracted information to a feature concatenation layer. This allows an HCU to directly send a concept that has been identified as important to the feature concatenation layer and bypass other HCUs.

22.

发明申请
OPTICAL CHARACTER RECOGNITION USING END-TO-END DEEP LEARNING 审中-公开

公开(公告)号：US20200082218A1

公开(公告)日：2020-03-12

申请号：US16123177

申请日：2018-09-06

Applicant: SAP SE

Inventor： Johannes Hoehne , Anoop Raveendra Katti , Christian Reisswig

IPC: G06K9/62 , G06N3/08 , G06F17/27

Abstract: Disclosed herein are system, method, and computer program product embodiments for optical character recognition using end-to-end deep learning. In an embodiment, an optical character recognition system may train a neural network to identify characters of pixel images and to assign index values to the characters. The neural network may also be trained to identify groups of characters and to generate bounding boxes to group these characters. The optical character recognition system may then analyze documents to identify character information based on the pixel data and produce a segmentation mask and one or more bounding box masks. The optical character recognition system may supply these masks as an output or may combine the masks to generate a version of the received document having optically recognized characters.

23.

发明申请
DATA-DRIVEN STRUCTURE EXTRACTION FROM TEXT DOCUMENTS 有权

公开(公告)号：US20250103815A1

公开(公告)日：2025-03-27

申请号：US18975483

申请日：2024-12-10

Applicant: SAP SE

Inventor： Christian Reisswig

IPC: G06F40/295 , G06F16/35 , G06F40/14 , G06F40/284 , G06N3/04 , G06N20/20

Abstract: Methods and apparatus are disclosed for extracting structured content, as graphs, from text documents. Graph vertices and edges correspond to document tokens and pairwise relationships between tokens. Undirected peer relationships and directed relationships (e.g. key-value or composition) are supported. Vertices can be identified with predefined fields, and thence mapped to database columns for automated storage of document content in a database. A trained neural network classifier determines relationship classifications for all pairwise combinations of input tokens. The relationship classification can differentiate multiple relationship types. A multi-level classifier extracts multi-level graph structure from a document. Disclosed embodiments support arbitrary graph structures with hierarchical and planar relationships. Relationships are not restricted by spatial proximity or document layout. Composite tokens can be identified interspersed with other content. A single token can belong to multiple higher level structures according to its various relationships. Examples and variations are disclosed.

24.

发明申请
TARGETED DOCUMENT INFORMATION EXTRACTION 有权

公开(公告)号：US20220215446A1

公开(公告)日：2022-07-07

申请号：US17142865

申请日：2021-01-06

Applicant: SAP SE

Inventor： YING JIANG , Christian Reisswig

IPC: G06Q30/04 , G06Q30/06 , G06F40/169 , G06F40/186

Abstract: Disclosed herein are various embodiments for targeted document information extraction. An embodiment operates by receiving a document associated with a particular customer of a plurality of customers. It is determined whether to use a global processor or template processor to analyze the document based on whether one or more customer templates are associated with the particular customer. Which of the one or more templates associated with the particular customer correspond to the document is identified. The document is compared to the identified template associated with the customer. Information is extracted from the document based on the identified template and the identified plurality of variations. The extracted information for the document is output.

25.

发明申请
ADAPTIVE HIGH-RESOLUTION DIGITAL IMAGE PROCESSING WITH NEURAL NETWORKS 有权

公开(公告)号：US20210374548A1

公开(公告)日：2021-12-02

申请号：US16890977

申请日：2020-06-02

Applicant: SAP SE

Inventor： Christian Reisswig , Shachar Klaiman

IPC: G06N3/08 , G06N3/04

Abstract: Technologies are described for performing adaptive high-resolution digital image processing using neural networks. For example, a number of different regions can be defined representing portions of a digital image. One of the regions covers the entire digital image at a reduced resolution. The other regions cover less than the entire digital image at resolutions higher than the region covering the entire digital image. Neural networks are then used to process each of the regions. The neural networks share information using prolongation and restriction operations. Prolongation operations propagate activations from a neural network operating on a lower resolution region to context zones of a neural network operating on a higher resolution region. Restriction operations propagate activations from the neural network operating on the higher resolution region back to the neural network operating on the lower resolution region.

26.

发明授权
Contextual vector grids for document processing 有权

公开(公告)号：US11138424B2

公开(公告)日：2021-10-05

申请号：US16689516

申请日：2019-11-20

Applicant: SAP SE

Inventor： Timo Denk , Christian Reisswig

IPC: G06K9/00 , G06T7/70 , G06K9/72

Abstract: Disclosed herein are system, method, and computer program product embodiments for analyzing contextual symbol information for document processing. In an embodiment, a language model system may generate a vector grid that incorporates contextual document information. The language model system may receive a document file and identify symbols of the document file to generate a symbol grid. The language model system may also identify position parameters corresponding to each of the symbols. The language model system may then analyze the symbols using an embedding function and neighboring symbols to determine contextual vector values corresponding to each of the symbols. The language model system may then generate a vector grid mapping the contextual vector values using the position parameters. The contextual information from the vector grid may provide increase document processing accuracy as well as faster processing convergence.

27.

发明申请
POSITIONAL EMBEDDINGS FOR DOCUMENT PROCESSING 有权

公开(公告)号：US20210150201A1

公开(公告)日：2021-05-20

申请号：US16689498

申请日：2019-11-20

Applicant: SAP SE

Inventor： Christian Reisswig , Stefan Klaus Baur

IPC: G06K9/00 , G06T7/70 , G06N3/04

Abstract: Disclosed herein are system, method, and computer program product embodiments for generating document labels using positional embeddings. In an embodiment, a label system may identify tokens, such as words, of a document image. The label system may apply a position vector neural network to the document image to analyze the pixels and determine positional embedding vectors corresponding to the words. The label system may then combine the positional embedding vectors to corresponding word vectors for use as an input to a neural network trained to generate document labels. This combination may embed the positional information with the corresponding word information in a serialized manner for processing by the document label neural network. Using this formatting, the label system may generate document labels in a light-weight and fast manner while still preserving spatial relationships between words.

28.

发明授权
Bi-directional contextualized text description 有权

公开(公告)号：US10963645B2

公开(公告)日：2021-03-30

申请号：US16270328

申请日：2019-02-07

Applicant: SAP SE

Inventor： Christian Reisswig , Darko Velkoski , Sohyeong Kim , Hung Tu Dinh , Faisal El Hussein

IPC: G06F40/30 , G10L15/183

Abstract: Various examples described herein are directed to systems and methods for analyzing text. A computing device may train an autoencoder language model using a plurality of language model training samples. The autoencoder language mode may comprise a first convolutional layer. Also, a first language model training sample of the plurality of language model training samples may comprise a first set of ordered strings comprising a masked string, a first string preceding the masked string in the first set of ordered strings, and a second string after the masked string in the first set of ordered strings. The computing device may generate a first feature vector using an input sample and the autoencoder language model. The computing device may also generate a descriptor of the input sample using a target model, the input sample, and the first feature vector.

29.

发明申请
CONTEXTUALIZED TEXT DESCRIPTION 审中-公开

公开(公告)号：US20200257764A1

公开(公告)日：2020-08-13

申请号：US16275025

申请日：2019-02-13

Applicant: SAP SE

Inventor： Christian Reisswig , Darko Velkoski , Sohyeong Kim , Hung Tu Dinh

IPC: G06F17/27

Abstract: Various examples are directed to systems and methods for classifying text. A computing device may access, from a database, an input sample comprising a first set of ordered words. The computing device may generate a first language model feature vector for the input sample using a word level language model and a second language model feature vector for the input sample using a partial word level language model. The computing device may generate a descriptor of the input sample using a target model, the input sample, the first language model feature vector, and the second language model feature vector and write the descriptor of the input sample to the database.

30.

发明申请
TWO-DIMENSIONAL DOCUMENT PROCESSING 审中-公开

公开(公告)号：US20190354818A1

公开(公告)日：2019-11-21

申请号：US15983489

申请日：2018-05-18

Applicant: SAP SE

Inventor： Christian Reisswig , Anoop Raveendra Katti , Steffen Bickel , Johannes Hoehne , Jean Baptiste Faddoul

IPC: G06K9/72 , G06K9/00 , G06N3/08 , G06N3/04

Abstract: Disclosed herein are system, method, and computer program product embodiments for processing a document. In an embodiment, a document processing system may receive a document. The document processing system may perform optical character recognition to obtain character information and positioning information for the characters. The document processing system may generate a down-sampled two-dimensional character grid for the document. The document processing system may apply a convolutional neural network to the character grid to obtain semantic meaning for the document. The convolutional neural network may produce a segmentation mask and bounding boxes to correspond to the document.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification