-
公开(公告)号:US10824808B2
公开(公告)日:2020-11-03
申请号:US16196153
申请日:2018-11-20
Applicant: SAP SE
Inventor: Christian Reisswig , Eduardo Vellasques , Sohyeong Kim , Darko Velkoski , Hung Tu Dinh
IPC: G10L15/02 , G06F40/295 , G06F40/289 , G06N3/04 , G06N3/08 , G06F40/30
Abstract: Disclosed herein are system, method, and computer program product embodiments for robust key value extraction. In an embodiment, one or more hierarchical concepts units (HCUs) may be configured to extract key value and hierarchical information from text inputs. The HCUs may use a convolutional neural network, a recurrent neural network, and feature selectors to analyze the text inputs using machine learning techniques to extract the key value and hierarchical information. Multiple HCUs may be used together and configured to identify different categories of hierarchical information. While multiple HCUs may be used, each may use a skip connection to transmit extracted information to a feature concatenation layer. This allows an HCU to directly send a concept that has been identified as important to the feature concatenation layer and bypass other HCUs.
-
公开(公告)号:US20200082218A1
公开(公告)日:2020-03-12
申请号:US16123177
申请日:2018-09-06
Applicant: SAP SE
Inventor: Johannes Hoehne , Anoop Raveendra Katti , Christian Reisswig
Abstract: Disclosed herein are system, method, and computer program product embodiments for optical character recognition using end-to-end deep learning. In an embodiment, an optical character recognition system may train a neural network to identify characters of pixel images and to assign index values to the characters. The neural network may also be trained to identify groups of characters and to generate bounding boxes to group these characters. The optical character recognition system may then analyze documents to identify character information based on the pixel data and produce a segmentation mask and one or more bounding box masks. The optical character recognition system may supply these masks as an output or may combine the masks to generate a version of the received document having optically recognized characters.
-
公开(公告)号:US20250103815A1
公开(公告)日:2025-03-27
申请号:US18975483
申请日:2024-12-10
Applicant: SAP SE
Inventor: Christian Reisswig
IPC: G06F40/295 , G06F16/35 , G06F40/14 , G06F40/284 , G06N3/04 , G06N20/20
Abstract: Methods and apparatus are disclosed for extracting structured content, as graphs, from text documents. Graph vertices and edges correspond to document tokens and pairwise relationships between tokens. Undirected peer relationships and directed relationships (e.g. key-value or composition) are supported. Vertices can be identified with predefined fields, and thence mapped to database columns for automated storage of document content in a database. A trained neural network classifier determines relationship classifications for all pairwise combinations of input tokens. The relationship classification can differentiate multiple relationship types. A multi-level classifier extracts multi-level graph structure from a document. Disclosed embodiments support arbitrary graph structures with hierarchical and planar relationships. Relationships are not restricted by spatial proximity or document layout. Composite tokens can be identified interspersed with other content. A single token can belong to multiple higher level structures according to its various relationships. Examples and variations are disclosed.
-
公开(公告)号:US20220215446A1
公开(公告)日:2022-07-07
申请号:US17142865
申请日:2021-01-06
Applicant: SAP SE
Inventor: YING JIANG , Christian Reisswig
IPC: G06Q30/04 , G06Q30/06 , G06F40/169 , G06F40/186
Abstract: Disclosed herein are various embodiments for targeted document information extraction. An embodiment operates by receiving a document associated with a particular customer of a plurality of customers. It is determined whether to use a global processor or template processor to analyze the document based on whether one or more customer templates are associated with the particular customer. Which of the one or more templates associated with the particular customer correspond to the document is identified. The document is compared to the identified template associated with the customer. Information is extracted from the document based on the identified template and the identified plurality of variations. The extracted information for the document is output.
-
公开(公告)号:US20210374548A1
公开(公告)日:2021-12-02
申请号:US16890977
申请日:2020-06-02
Applicant: SAP SE
Inventor: Christian Reisswig , Shachar Klaiman
Abstract: Technologies are described for performing adaptive high-resolution digital image processing using neural networks. For example, a number of different regions can be defined representing portions of a digital image. One of the regions covers the entire digital image at a reduced resolution. The other regions cover less than the entire digital image at resolutions higher than the region covering the entire digital image. Neural networks are then used to process each of the regions. The neural networks share information using prolongation and restriction operations. Prolongation operations propagate activations from a neural network operating on a lower resolution region to context zones of a neural network operating on a higher resolution region. Restriction operations propagate activations from the neural network operating on the higher resolution region back to the neural network operating on the lower resolution region.
-
公开(公告)号:US11138424B2
公开(公告)日:2021-10-05
申请号:US16689516
申请日:2019-11-20
Applicant: SAP SE
Inventor: Timo Denk , Christian Reisswig
Abstract: Disclosed herein are system, method, and computer program product embodiments for analyzing contextual symbol information for document processing. In an embodiment, a language model system may generate a vector grid that incorporates contextual document information. The language model system may receive a document file and identify symbols of the document file to generate a symbol grid. The language model system may also identify position parameters corresponding to each of the symbols. The language model system may then analyze the symbols using an embedding function and neighboring symbols to determine contextual vector values corresponding to each of the symbols. The language model system may then generate a vector grid mapping the contextual vector values using the position parameters. The contextual information from the vector grid may provide increase document processing accuracy as well as faster processing convergence.
-
公开(公告)号:US20210150201A1
公开(公告)日:2021-05-20
申请号:US16689498
申请日:2019-11-20
Applicant: SAP SE
Inventor: Christian Reisswig , Stefan Klaus Baur
Abstract: Disclosed herein are system, method, and computer program product embodiments for generating document labels using positional embeddings. In an embodiment, a label system may identify tokens, such as words, of a document image. The label system may apply a position vector neural network to the document image to analyze the pixels and determine positional embedding vectors corresponding to the words. The label system may then combine the positional embedding vectors to corresponding word vectors for use as an input to a neural network trained to generate document labels. This combination may embed the positional information with the corresponding word information in a serialized manner for processing by the document label neural network. Using this formatting, the label system may generate document labels in a light-weight and fast manner while still preserving spatial relationships between words.
-
公开(公告)号:US10963645B2
公开(公告)日:2021-03-30
申请号:US16270328
申请日:2019-02-07
Applicant: SAP SE
Inventor: Christian Reisswig , Darko Velkoski , Sohyeong Kim , Hung Tu Dinh , Faisal El Hussein
IPC: G06F40/30 , G10L15/183
Abstract: Various examples described herein are directed to systems and methods for analyzing text. A computing device may train an autoencoder language model using a plurality of language model training samples. The autoencoder language mode may comprise a first convolutional layer. Also, a first language model training sample of the plurality of language model training samples may comprise a first set of ordered strings comprising a masked string, a first string preceding the masked string in the first set of ordered strings, and a second string after the masked string in the first set of ordered strings. The computing device may generate a first feature vector using an input sample and the autoencoder language model. The computing device may also generate a descriptor of the input sample using a target model, the input sample, and the first feature vector.
-
公开(公告)号:US20200257764A1
公开(公告)日:2020-08-13
申请号:US16275025
申请日:2019-02-13
Applicant: SAP SE
Inventor: Christian Reisswig , Darko Velkoski , Sohyeong Kim , Hung Tu Dinh
IPC: G06F17/27
Abstract: Various examples are directed to systems and methods for classifying text. A computing device may access, from a database, an input sample comprising a first set of ordered words. The computing device may generate a first language model feature vector for the input sample using a word level language model and a second language model feature vector for the input sample using a partial word level language model. The computing device may generate a descriptor of the input sample using a target model, the input sample, the first language model feature vector, and the second language model feature vector and write the descriptor of the input sample to the database.
-
公开(公告)号:US20190354818A1
公开(公告)日:2019-11-21
申请号:US15983489
申请日:2018-05-18
Applicant: SAP SE
Inventor: Christian Reisswig , Anoop Raveendra Katti , Steffen Bickel , Johannes Hoehne , Jean Baptiste Faddoul
Abstract: Disclosed herein are system, method, and computer program product embodiments for processing a document. In an embodiment, a document processing system may receive a document. The document processing system may perform optical character recognition to obtain character information and positioning information for the characters. The document processing system may generate a down-sampled two-dimensional character grid for the document. The document processing system may apply a convolutional neural network to the character grid to obtain semantic meaning for the document. The convolutional neural network may produce a segmentation mask and bounding boxes to correspond to the document.
-
-
-
-
-
-
-
-
-