FORM STRUCTURE EXTRACTION BY PREDICTING ASSOCIATIONS

    公开(公告)号:US20230267345A1

    公开(公告)日:2023-08-24

    申请号:US18135948

    申请日:2023-04-18

    Applicant: Adobe Inc.

    CPC classification number: G06N5/04 G06N3/08 G06N20/00 G06N20/10 G06V10/82

    Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.

    Form structure extraction by predicting associations

    公开(公告)号:US11657306B2

    公开(公告)日:2023-05-23

    申请号:US16904263

    申请日:2020-06-17

    Applicant: Adobe Inc.

    CPC classification number: G06N5/04 G06N3/08 G06N20/00 G06N20/10 G06V10/82

    Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.

    Refining Element Associations for Form Structure Extraction

    公开(公告)号:US20230134460A1

    公开(公告)日:2023-05-04

    申请号:US17517434

    申请日:2021-11-02

    Applicant: Adobe Inc.

    Abstract: In implementations of refining element associations for form structure extraction, a computing device implements a structure system to receive estimate data describing estimated associations of elements included in a form and a digital image depicting the form. An image patch is extracted from the digital image, and the image patch depicts a pair of elements of the elements included in the form. The structure system encodes an indication of whether the pair of elements have an association of the estimated associations. An indication is generated that the pair of elements have a particular association based at least partially on the encoded indication, bounding boxes of the pair of elements, and text depicted in the image patch.

    FORM STRUCTURE EXTRACTION BY PREDICTING ASSOCIATIONS

    公开(公告)号:US20210397986A1

    公开(公告)日:2021-12-23

    申请号:US16904263

    申请日:2020-06-17

    Applicant: Adobe Inc.

    Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.

    Classifying structural features of a digital document by feature type using machine learning

    公开(公告)号:US11003862B2

    公开(公告)日:2021-05-11

    申请号:US16359402

    申请日:2019-03-20

    Applicant: Adobe Inc.

    Abstract: Classifying structural features of a digital document by feature type using machine learning is leveraged in a digital medium environment. A document analysis system is leveraged to extract structural features from digital documents, and to classifying the structural features by respective feature types. To do this, the document analysis system employs a character analysis model and a classification model. The character analysis model takes text content from a digital document and generates text vectors that represent the text content. A vector sequence is generated based on the text vectors and position information for structural features of the digital document, and the classification model processes the vector sequence to classify the structural features into different feature types. The document analysis system can generate a modifiable version of the digital document that enables its structural features to be modified based on their respective feature types.

    USING NATURAL LANGUAGE PROCESSING AND DEEP LEARNING FOR MAPPING ANY SCHEMA DATA TO A HIERARCHICAL STANDARD DATA MODEL (XDM)

    公开(公告)号:US20190286978A1

    公开(公告)日:2019-09-19

    申请号:US15921369

    申请日:2018-03-14

    Applicant: Adobe Inc.

    Abstract: Systems and techniques map an input field from a data schema to a hierarchical standard data model (XDM). The XDM includes a tree of single XDM fields and each of the single XDM fields includes a composition of single level XDM fields. An input field from a data schema is processed by an unsupervised learning algorithm to obtain a sequence of vectors representing the input field and a sequence of vectors representing single level hierarchical standard data model (XDM) fields. These vectors are processed by a neural network to obtain a similarity score between the input field and each of the single level XDM fields. A probability of a match is determined using the similarity score between the input field and each of the single level XDM fields. The input field is mapped to the XDM field having the probability of the match with a highest score.

Patent Agency Ranking