-
公开(公告)号:US20240362585A1
公开(公告)日:2024-10-31
申请号:US18764524
申请日:2024-07-05
Applicant: Capital One Services, LLC
Inventor: Thomas S. Poole
IPC: G06Q10/087 , G06Q20/00 , G06Q20/32 , G06Q20/36 , G06Q20/38 , G06Q20/40 , G06V10/10 , G06V20/62 , G06V30/10 , G06V30/142 , G06V30/224 , G06V30/414
CPC classification number: G06Q10/087 , G06Q20/00 , G06Q20/3276 , G06Q20/36 , G06Q20/385 , G06Q20/407 , G06V10/10 , G06V20/63 , G06V30/142 , G06V30/224 , G06V30/414 , G06V30/10
Abstract: A system and method for generating a notification of a price change for a transaction and facilitating an associated price adjustment based on electronic image capture of a paper receipt. An image capture system captures an electronic image of a paper transaction receipt, which is transmitted to a data extraction processor that extracts transaction receipt data from the captured electronic image. A unique key is generated that is matched to a transaction record to generate a reliability score for the OCRed image and to authorize price matching. If the price monitoring server identifies that the price of a purchased transaction item has changed, the price monitoring server automatically generates a notification to the user indicating the price change and an email to the customer service department of the merchant from which the transaction item was purchased, requesting a price adjustment that is credited back to the payment instrument.
-
2.
公开(公告)号:US20240362197A1
公开(公告)日:2024-10-31
申请号:US18763909
申请日:2024-07-03
Applicant: TUNGSTEN AUTOMATION CORPORATION
Inventor: Steve Thompson , Veronika Levdik , Iurii Vymenets , Donghan Lee
IPC: G06F16/22 , G06V10/70 , G06V30/412 , G06V30/413 , G06V30/414
CPC classification number: G06F16/2282 , G06V10/70 , G06V30/412 , G06V30/413 , G06V30/414
Abstract: Recent developments in machine learning (commonly coined “artificial intelligence” or “AI”) have vastly expanded applications for this technology, such as myriad “chat” agents adept at understanding natural human language. While state of the art generative models can parse text queries from a user and provide comprehensive, accurate responses (including generating images depicting desired content), current implementations struggle with understanding all information present in images of documents, especially images of business documents. In particular, generative models fail to understand structured and semi-structured information, e.g., as indicated by graphical information such as lines, geometric relationships (e.g., indicated by tables, graphs, figures, etc.), formatting, and other contextual information that human readers easily and implicitly understand. The disclosed inventive concepts transform structured and semi-structured information along with textual content into a textual representation that allows generative models to better understand textual content and non-textual structured information present in document images.
-
公开(公告)号:US12130853B2
公开(公告)日:2024-10-29
申请号:US18500058
申请日:2023-11-01
Applicant: Ancestry.com Operations Inc.
Inventor: Carol Myrick Anderson
IPC: G06F16/35 , G06F40/279 , G06N3/08 , G06V30/413 , G06V30/414
CPC classification number: G06F16/35 , G06F40/279 , G06N3/08 , G06V30/413 , G06V30/414
Abstract: Described herein are systems, methods, and other techniques for segmenting an input text. A set of tokens are extracted from the input text. Token representations are computed for the set of tokens. The token representations are provided to a machine learning model that generates a set of label predictions corresponding to the set of tokens. The machine learning model was previously trained to generate label predictions in response to being provided input token representations. Each of the set of label predictions indicates a position of a particular token of the set of tokens with respect to a particular segment. One or more segments within the input text are determined based on the set of label predictions.
-
公开(公告)号:US12124795B2
公开(公告)日:2024-10-22
申请号:US18061580
申请日:2022-12-05
Applicant: Klatt Works, Inc.
Inventor: Nathan D. Klatt , John David Slack , Divya Prasannan , Vinod Krishnankutty , Edward F. Riehle , Stefan Buckenmaier
IPC: G06F3/0482 , G02B27/01 , G06F3/01 , G06F3/0484 , G06F40/186 , G06V30/414 , G06V30/422
CPC classification number: G06F40/186 , G02B27/0172 , G06F3/011 , G06F3/0482 , G06F3/0484 , G06V30/414 , G06V30/422 , G02B2027/0138 , G02B2027/0141
Abstract: The present patent application describes techniques for generating an enhanced electronic document that may include one or more graphical user interface (GUI) elements that comprise an interactive workflow. An electronic document is automatically processed to identify patterns within the content of the document that indicate individual content items, such as individual steps or instructions associated with a task described in the document, or individual input fields at which information is to be recorded. For each individual content item identified, a data object (e.g., a JSON object) is added to a file, which is ultimately embedded within the original document to create an enhanced electronic document. When the enhanced electronic document is presented via an appropriate document viewing application of a hands-free computing device, the content of the document is presented in combination with the interactive GUI elements so the end-user can interact with the content and GUI elements via audible (spoken) commands.
-
公开(公告)号:US12111953B2
公开(公告)日:2024-10-08
申请号:US17287640
申请日:2019-10-25
Applicant: SERVICENOW CANADA INC.
Inventor: Elena Busila , Jerome Pasquero , Patrick Lazarus
IPC: G06F21/62 , G06F21/60 , G06F40/166 , G06V30/414 , G06V30/416 , G06V10/20
CPC classification number: G06F21/6254 , G06F21/60 , G06F40/166 , G06V30/414 , G06V30/416 , G06V10/20
Abstract: Systems and methods for privacy and sensitive data protection. An image of a document is received at a pre-processing stage and image pre-processing is applied to the image to ensure that the resulting image is sufficient for further processing. Pre-processing may involve processing relating to image quality and image orientation. The image is then passed to an initial processing stage. At the initial processing stage, the relevant data in the document are located and bounding boxes are placed around the data. The resulting image is then passed to a processing stage. At this stage, the type of data within the bounding boxes is determined and suitable replacement data is generated. The replacement data is then inserted into the image to thereby remove and replace the sensitive data in the image.
-
公开(公告)号:US12100234B1
公开(公告)日:2024-09-24
申请号:US17454659
申请日:2021-11-12
Applicant: Lexalytics, Inc.
Inventor: Jeff Catlin , Brian Pinette
IPC: G06V30/10 , G06V30/19 , G06V30/414
CPC classification number: G06V30/414 , G06V30/19073
Abstract: In an exemplary embodiment, the invention comprises a principled edit-distance system that performs a method for determining the probability of character errors. In another exemplary embodiment, the invention comprises a post-OCR error correction system that performs a context-sensitive correction method. In another exemplary embodiment, the invention comprises a post-OCR error correction system that performs a comprehensive, unified correction process based on generalized edit distance analysis, wherein the objective is to find a corrected sentence that has the overall smallest edit distance across all levels. In another exemplary embodiment, the invention comprises a post-OCR error correction system that comprises one or more subjective fractional rank-based dictionaries. In another embodiment, the invention comprises a post-OCR error correction system that performs the automatic assignment of rank to words per-document dictionaries.
-
公开(公告)号:US20240296690A1
公开(公告)日:2024-09-05
申请号:US18664807
申请日:2024-05-15
Applicant: ZenPayroll, Inc.
Inventor: Quentin Louis Raoul Balin
IPC: G06V30/412 , G06V10/24 , G06V10/48 , G06V30/414 , G06V30/416
CPC classification number: G06V30/412 , G06V10/242 , G06V10/48 , G06V30/414 , G06V30/416
Abstract: An image processing system accesses an image of a completed form document. The image of the form document includes one or more features, such as form text, at particular locations within the image. The image processing system accesses a template of the form document and computes a rotation and zoom of the image of the form document relative to the template of the form document based on the locations of the features within the image of the form document relative to the locations of the corresponding features within the template of the form document. The image processing system performs a rotation operation and a zoom operation on the image of the form document, and extracts data entered into fields of the modified image of the form document. The extracted data can be then accessed or stored for subsequent use.
-
8.
公开(公告)号:US12080091B2
公开(公告)日:2024-09-03
申请号:US18331990
申请日:2023-06-09
Applicant: Open Text SA ULC
IPC: G06F17/00 , G06F16/22 , G06F16/25 , G06F16/93 , G06F18/21 , G06F40/174 , G06F40/177 , G06F40/186 , G06F40/216 , G06F40/274 , G06N20/00 , G06V30/19 , G06V30/412 , G06V30/414 , G06V30/416
CPC classification number: G06V30/416 , G06F16/2282 , G06F16/258 , G06F16/93 , G06F18/217 , G06F40/174 , G06F40/177 , G06F40/186 , G06F40/216 , G06F40/274 , G06N20/00 , G06V30/1916 , G06V30/412 , G06V30/414
Abstract: A bipartite application implements a table auto-completion (TAC) algorithm on the client side and the server side. A client module runs a local model of the TAC algorithm on a user device and a server module runs a global model of the TAC algorithm on a server machine. The local model is continuously adapted through on-the-fly training, with as few as one negative example, to perform TAC on the client side, one document at a time. Knowledge thus learned by the local model is used to improve the global model on the server side. The global model can be utilized to automatically and intelligently extract table information from a large number of documents with significantly improved accuracy, requiring minimal human intervention even on complex tables.
-
公开(公告)号:US12080089B2
公开(公告)日:2024-09-03
申请号:US17643227
申请日:2021-12-08
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Barton Wayne Emanuel , Nadiya Kochura , Su Liu , Tetsuya Shimada
IPC: G06V30/414 , G06F16/93 , G06F40/58 , G06V30/22 , G06V30/413
CPC classification number: G06V30/414 , G06F16/93 , G06F40/58 , G06V30/22 , G06V30/413
Abstract: A computer-implemented method, a computer system and a computer program product enhance machine translation of a document. The method includes capturing an image of the document. The document includes a plurality of characters that are arranged in a character layout. The method also includes classifying the image by a document type based on the character layout. The method further includes determining a strategy for an intelligent character recognition (ICR) algorithm with the image based on the character layout of the image. Lastly, the method includes generating a translated document by applying the intelligent character recognition (ICR) algorithm to the plurality of characters in the image using the strategy. The translated document includes a plurality of translated characters that are arranged in the character layout.
-
公开(公告)号:US20240265719A1
公开(公告)日:2024-08-08
申请号:US18165125
申请日:2023-02-06
Applicant: International Business Machines Corporation
Inventor: Yuan Yuan DING , Zhong Fang YUAN , Tong LIU , Si Tong ZHAO , Yi Chen ZHONG
IPC: G06V30/19 , G06V10/82 , G06V30/148 , G06V30/414
CPC classification number: G06V30/1914 , G06V10/82 , G06V30/148 , G06V30/19007 , G06V30/414
Abstract: Embodiments of the present disclosure provide systems and methods for implementing enhanced Optical Character Recognition (OCR) of text overlapping scenes through text graph structuring. Text graph structuring is performed to provide a graph data structure for each data character or letter of multiple letters and a library of graph templates from graph structured data of each of the multiple letters. Text graph structuring is performed to convert visual content of an identified overlapping text image region to an overlapping text topology graph. The overlapping text topology graph is split into multiple subgraphs using the graph template library to match recognizable letters in the overlapping text.
-
-
-
-
-
-
-
-
-