DATA PROCESSING METHOD AND RELATED DEVICE
    83.
    发明公开

    公开(公告)号:US20240355134A1

    公开(公告)日:2024-10-24

    申请号:US18761274

    申请日:2024-07-01

    CPC classification number: G06V30/412 G06F40/143 G06V10/98 G06V30/414

    Abstract: In a data processing method, a processing device obtains a to-be-processed table image, and determines a table recognition result based on the table image and a generative table recognition policy. The generative table recognition policy indicates that the table recognition result of the table image is to determine using a markup language and a non-overlapping attribute of a bounding box. The bounding box indicates a position of a text included in a cell in a table associated with the table image, and the table recognition result indicates a global structure and content that are included in the table. The processing device then outputs the table recognition result.

    METHOD AND SYSTEM FOR TREE-BASED TEXT REPRESENTATION AND COMPARISON

    公开(公告)号:US20240320995A1

    公开(公告)日:2024-09-26

    申请号:US18124392

    申请日:2023-03-21

    CPC classification number: G06V30/414 G06F40/194 G06F40/205

    Abstract: A method for facilitating electronic textual representation and comparison is disclosed. The method includes receiving, via a graphical user interface, a comparison request that includes a first electronic document and a second electronic document; parsing the first electronic document and the second electronic document to classify textual data; generating, by using the classified textual data, a first tree structure for the first electronic document and a second tree structure for the second electronic document; constructing a first hierarchy dictionary for the first tree structure and a second hierarchy dictionary for the second tree structure; determining differences between the first electronic document and the second electronic document by using the first tree structure, the first hierarchy dictionary, the second tree structure, and the second hierarchy dictionary; and generating graphical representations that depicts the differences and textual representations that summarize the differences.

    SYSTEMS AND METHODS FOR GENERATING DOCUMENT TEMPLATES FROM A MIXED SET OF DOCUMENT TYPES

    公开(公告)号:US20240311555A1

    公开(公告)日:2024-09-19

    申请号:US18671083

    申请日:2024-05-22

    CPC classification number: G06F40/186 G06F16/355 G06V30/10 G06V30/414

    Abstract: A template generation system for generating document templates from a mixed set of document types including a template generation server programmed to receive a batch of documents, identify a plurality of text blocks, generate a plurality of clusters, generate a plurality of document arrays corresponding to the plurality of clusters, and compare each document array to each other document array to determine a percentage match. When the percentage match between two or more frameworks exceeds a threshold, the template generation system defines a subset of documents, and for each subset of documents, template generation system generates a template for the subset of documents. The template is a collection of the text blocks that are commonly included in each of the documents of the subset.

    Product labeling review
    88.
    发明授权

    公开(公告)号:US12067797B2

    公开(公告)日:2024-08-20

    申请号:US17408181

    申请日:2021-08-20

    Applicant: PEPSICO, INC.

    Inventor: Jingting Hui

    Abstract: A label processing engine receives, as inputs, raw data representative of a label and baseline data, detects a raw data object within the raw data, classifies the raw data object, and localizes the raw data object within the raw data, detects a baseline data object within the baseline data, classifies the baseline data object, and localizes the baseline data object within the baseline data. The engine recognizes corresponding text within the raw data object and the baseline data object and extracts the corresponding text within the raw data object and the baseline data object, reassembles the corresponding text of the raw data object and the baseline data object into respective lines of text, compares the respective lines of text with one another, and issues a notification based on the comparison.

Patent Agency Ranking