Patent search cpc:"G06V30/414" Page 1

1.

发明公开
SYSTEM AND METHOD FOR PRICE MATCHING THROUGH RECEIPT CAPTURE 审中-公开

公开(公告)号：US20240362585A1

公开(公告)日：2024-10-31

申请号：US18764524

申请日：2024-07-05

Applicant: Capital One Services, LLC

Inventor： Thomas S. Poole

IPC: G06Q10/087 , G06Q20/00 , G06Q20/32 , G06Q20/36 , G06Q20/38 , G06Q20/40 , G06V10/10 , G06V20/62 , G06V30/10 , G06V30/142 , G06V30/224 , G06V30/414

CPC classification number: G06Q10/087 , G06Q20/00 , G06Q20/3276 , G06Q20/36 , G06Q20/385 , G06Q20/407 , G06V10/10 , G06V20/63 , G06V30/142 , G06V30/224 , G06V30/414 , G06V30/10

Abstract: A system and method for generating a notification of a price change for a transaction and facilitating an associated price adjustment based on electronic image capture of a paper receipt. An image capture system captures an electronic image of a paper transaction receipt, which is transmitted to a data extraction processor that extracts transaction receipt data from the captured electronic image. A unique key is generated that is matched to a transaction record to generate a reliability score for the OCRed image and to authorize price matching. If the price monitoring server identifies that the price of a purchased transaction item has changed, the price monitoring server automatically generates a notification to the user indicating the price change and an email to the customer service department of the merchant from which the transaction item was purchased, requesting a price adjustment that is credited back to the payment instrument.

2.

发明公开
AUTOMATED TRANSFORMATION OF INFORMATION FROM IMAGES TO TEXTUAL REPRESENTATIONS, AND APPLICATIONS THEREFOR 审中-公开

公开(公告)号：US20240362197A1

公开(公告)日：2024-10-31

申请号：US18763909

申请日：2024-07-03

Applicant: TUNGSTEN AUTOMATION CORPORATION

Inventor： Steve Thompson , Veronika Levdik , Iurii Vymenets , Donghan Lee

IPC: G06F16/22 , G06V10/70 , G06V30/412 , G06V30/413 , G06V30/414

CPC classification number: G06F16/2282 , G06V10/70 , G06V30/412 , G06V30/413 , G06V30/414

Abstract: Recent developments in machine learning (commonly coined “artificial intelligence” or “AI”) have vastly expanded applications for this technology, such as myriad “chat” agents adept at understanding natural human language. While state of the art generative models can parse text queries from a user and provide comprehensive, accurate responses (including generating images depicting desired content), current implementations struggle with understanding all information present in images of documents, especially images of business documents. In particular, generative models fail to understand structured and semi-structured information, e.g., as indicated by graphical information such as lines, geometric relationships (e.g., indicated by tables, graphs, figures, etc.), formatting, and other contextual information that human readers easily and implicitly understand. The disclosed inventive concepts transform structured and semi-structured information along with textual content into a textual representation that allows generative models to better understand textual content and non-textual structured information present in document images.

3.

发明授权
Topic segmentation of image-derived text 有权

公开(公告)号：US12130853B2

公开(公告)日：2024-10-29

申请号：US18500058

申请日：2023-11-01

Applicant: Ancestry.com Operations Inc.

Inventor： Carol Myrick Anderson

IPC: G06F16/35 , G06F40/279 , G06N3/08 , G06V30/413 , G06V30/414

CPC classification number: G06F16/35 , G06F40/279 , G06N3/08 , G06V30/413 , G06V30/414

Abstract: Described herein are systems, methods, and other techniques for segmenting an input text. A set of tokens are extracted from the input text. Token representations are computed for the set of tokens. The token representations are provided to a machine learning model that generates a set of label predictions corresponding to the set of tokens. The machine learning model was previously trained to generate label predictions in response to being provided input token representations. Each of the set of label predictions indicates a position of a particular token of the set of tokens with respect to a particular segment. One or more segments within the input text are determined based on the set of label predictions.

4.

发明授权
Techniques for enhancing an electronic document with an interactive workflow 有权

公开(公告)号：US12124795B2

公开(公告)日：2024-10-22

申请号：US18061580

申请日：2022-12-05

Applicant: Klatt Works, Inc.

Inventor： Nathan D. Klatt , John David Slack , Divya Prasannan , Vinod Krishnankutty , Edward F. Riehle , Stefan Buckenmaier

IPC: G06F3/0482 , G02B27/01 , G06F3/01 , G06F3/0484 , G06F40/186 , G06V30/414 , G06V30/422

CPC classification number: G06F40/186 , G02B27/0172 , G06F3/011 , G06F3/0482 , G06F3/0484 , G06V30/414 , G06V30/422 , G02B2027/0138 , G02B2027/0141

Abstract: The present patent application describes techniques for generating an enhanced electronic document that may include one or more graphical user interface (GUI) elements that comprise an interactive workflow. An electronic document is automatically processed to identify patterns within the content of the document that indicate individual content items, such as individual steps or instructions associated with a task described in the document, or individual input fields at which information is to be recorded. For each individual content item identified, a data object (e.g., a JSON object) is added to a file, which is ultimately embedded within the original document to create an enhanced electronic document. When the enhanced electronic document is presented via an appropriate document viewing application of a hands-free computing device, the content of the document is presented in combination with the interactive GUI elements so the end-user can interact with the content and GUI elements via audible (spoken) commands.

5.

发明授权
Sensitive data detection and replacement 有权

公开(公告)号：US12111953B2

公开(公告)日：2024-10-08

申请号：US17287640

申请日：2019-10-25

Applicant: SERVICENOW CANADA INC.

Inventor： Elena Busila , Jerome Pasquero , Patrick Lazarus

IPC: G06F21/62 , G06F21/60 , G06F40/166 , G06V30/414 , G06V30/416 , G06V10/20

CPC classification number: G06F21/6254 , G06F21/60 , G06F40/166 , G06V30/414 , G06V30/416 , G06V10/20

Abstract: Systems and methods for privacy and sensitive data protection. An image of a document is received at a pre-processing stage and image pre-processing is applied to the image to ensure that the resulting image is sufficient for further processing. Pre-processing may involve processing relating to image quality and image orientation. The image is then passed to an initial processing stage. At the initial processing stage, the relevant data in the document are located and bounding boxes are placed around the data. The resulting image is then passed to a processing stage. At this stage, the type of data within the bounding boxes is determined and suitable replacement data is generated. The replacement data is then inserted into the image to thereby remove and replace the sensitive data in the image.

6.

发明授权
Post-optical character recognition error correction system and methods of use 有权

公开(公告)号：US12100234B1

公开(公告)日：2024-09-24

申请号：US17454659

申请日：2021-11-12

Applicant: Lexalytics, Inc.

Inventor： Jeff Catlin , Brian Pinette

IPC: G06V30/10 , G06V30/19 , G06V30/414

CPC classification number: G06V30/414 , G06V30/19073

Abstract: In an exemplary embodiment, the invention comprises a principled edit-distance system that performs a method for determining the probability of character errors. In another exemplary embodiment, the invention comprises a post-OCR error correction system that performs a context-sensitive correction method. In another exemplary embodiment, the invention comprises a post-OCR error correction system that performs a comprehensive, unified correction process based on generalized edit distance analysis, wherein the objective is to find a corrected sentence that has the overall smallest edit distance across all levels. In another exemplary embodiment, the invention comprises a post-OCR error correction system that comprises one or more subjective fractional rank-based dictionaries. In another embodiment, the invention comprises a post-OCR error correction system that performs the automatic assignment of rank to words per-document dictionaries.

7.

发明公开
DATA EXTRACTION FROM FORM IMAGES 审中-公开

公开(公告)号：US20240296690A1

公开(公告)日：2024-09-05

申请号：US18664807

申请日：2024-05-15

Applicant: ZenPayroll, Inc.

Inventor： Quentin Louis Raoul Balin

IPC: G06V30/412 , G06V10/24 , G06V10/48 , G06V30/414 , G06V30/416

CPC classification number: G06V30/412 , G06V10/242 , G06V10/48 , G06V30/414 , G06V30/416

Abstract: An image processing system accesses an image of a completed form document. The image of the form document includes one or more features, such as form text, at particular locations within the image. The image processing system accesses a template of the form document and computes a rotation and zoom of the image of the form document relative to the template of the form document based on the locations of the features within the image of the form document relative to the locations of the corresponding features within the template of the form document. The image processing system performs a rotation operation and a zoom operation on the image of the form document, and extracts data entered into fields of the modified image of the form document. The extracted data can be then accessed or stored for subsequent use.

8.

发明授权
Table item information extraction with continuous machine learning through local and global models 有权

公开(公告)号：US12080091B2

公开(公告)日：2024-09-03

申请号：US18331990

申请日：2023-06-09

Applicant: Open Text SA ULC

Inventor： Matthias Theodor Middendorf , Gisela Barbara Cäcilie Hammann , Carsten Peust

IPC: G06F17/00 , G06F16/22 , G06F16/25 , G06F16/93 , G06F18/21 , G06F40/174 , G06F40/177 , G06F40/186 , G06F40/216 , G06F40/274 , G06N20/00 , G06V30/19 , G06V30/412 , G06V30/414 , G06V30/416

CPC classification number: G06V30/416 , G06F16/2282 , G06F16/258 , G06F16/93 , G06F18/217 , G06F40/174 , G06F40/177 , G06F40/186 , G06F40/216 , G06F40/274 , G06N20/00 , G06V30/1916 , G06V30/412 , G06V30/414

Abstract: A bipartite application implements a table auto-completion (TAC) algorithm on the client side and the server side. A client module runs a local model of the TAC algorithm on a user device and a server module runs a global model of the TAC algorithm on a server machine. The local model is continuously adapted through on-the-fly training, with as few as one negative example, to perform TAC on the client side, one document at a time. Knowledge thus learned by the local model is used to improve the global model on the server side. The global model can be utilized to automatically and intelligently extract table information from a large number of documents with significantly improved accuracy, requiring minimal human intervention even on complex tables.

9.

发明授权
Enhancing machine translation of handwritten documents 有权

公开(公告)号：US12080089B2

公开(公告)日：2024-09-03

申请号：US17643227

申请日：2021-12-08

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor： Barton Wayne Emanuel , Nadiya Kochura , Su Liu , Tetsuya Shimada

IPC: G06V30/414 , G06F16/93 , G06F40/58 , G06V30/22 , G06V30/413

CPC classification number: G06V30/414 , G06F16/93 , G06F40/58 , G06V30/22 , G06V30/413

Abstract: A computer-implemented method, a computer system and a computer program product enhance machine translation of a document. The method includes capturing an image of the document. The document includes a plurality of characters that are arranged in a character layout. The method also includes classifying the image by a document type based on the character layout. The method further includes determining a strategy for an intelligent character recognition (ICR) algorithm with the image based on the character layout of the image. Lastly, the method includes generating a translated document by applying the intelligent character recognition (ICR) algorithm to the plurality of characters in the image using the strategy. The translated document includes a plurality of translated characters that are arranged in the character layout.

10.

发明公开
OCR OF TEXT OVERLAPPING SCENES THROUGH TEXT GRAPH STRUCTURING 审中-公开

公开(公告)号：US20240265719A1

公开(公告)日：2024-08-08

申请号：US18165125

申请日：2023-02-06

Applicant: International Business Machines Corporation

Inventor： Yuan Yuan DING , Zhong Fang YUAN , Tong LIU , Si Tong ZHAO , Yi Chen ZHONG

IPC: G06V30/19 , G06V10/82 , G06V30/148 , G06V30/414

CPC classification number: G06V30/1914 , G06V10/82 , G06V30/148 , G06V30/19007 , G06V30/414

Abstract: Embodiments of the present disclosure provide systems and methods for implementing enhanced Optical Character Recognition (OCR) of text overlapping scenes through text graph structuring. Text graph structuring is performed to provide a graph data structure for each data character or letter of multiple letters and a library of graph templates from graph structured data of each of the multiple letters. Text graph structuring is performed to convert visual content of an identified overlapping text image region to an overlapping text topology graph. The overlapping text topology graph is split into multiple subgraphs using the graph template library to match recognizable letters in the overlapping text.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification