METHOD AND APPARATUS FOR PERFORMING STRUCTURED EXTRACTION OF TEXT, DEVICE AND STORAGE MEDIUM

    公开(公告)号:EP3839818A3

    公开(公告)日:2021-10-06

    申请号:EP21162002.6

    申请日:2021-03-11

    摘要: Embodiments of the present disclosure disclose a method and apparatus for performing a structured extraction on a text, a device and a storage medium, and relate to the field of artificial intelligence such as computer vision, deep learning, and natural language processing. A specific implementation of the method includes: performing a text detection on an entity text image to obtain a position and content of a text line of the entity text image; extracting multivariate information of the text line based on the position and the content of the text line; performing a feature fusion on the multivariate information of the text line to obtain a multimodal fusion feature of the text line; performing category and relationship reasoning based on the multimodal fusion feature of the text line to obtain a category and a relationship probability matrix of the text line; and constructing structured information of the entity text image based on the category and the relationship probability matrix of the text line. According to the implementation, a method for performing a structured extraction on a text based on category and relationship reasoning is provided, which is suitable for large-scale and automated processing and has a wide application range and a strong versatility.

    IMAGE QUESTIONING AND ANSWERING METHOD, APPARATUS, DEVICE AND STORAGE MEDIUM

    公开(公告)号:EP3885935A1

    公开(公告)日:2021-09-29

    申请号:EP21275029.3

    申请日:2021-03-16

    IPC分类号: G06F16/583 G06F16/332

    摘要: The present application discloses an image questioning and answering method, apparatus, device and storage medium, relating to the technical field of image processing, computer vision, deep learning and natural language processing. The specific implementation solution is as follows: constructing a question graph with a topological structure and extracting a question feature of a query sentence, according to the query sentence; constructing a visual graph with a topological structure and a text graph with a topological structure according to a target image corresponding to the query sentence; performing fusion on the visual graph, the text graph and the question graph by using a fusion model, to obtain a final fusion graph; and determining reply information of the query sentence according to a reasoning feature extracted from the final fusion graph and the question feature.

    METHOD AND APPARATUS FOR EXTRACTING INFORMATION ABOUT A NEGOTIABLE INSTRUMENT, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:EP3968287A2

    公开(公告)日:2022-03-16

    申请号:EP22151884.8

    申请日:2022-01-17

    IPC分类号: G06V30/41

    摘要: Provided are a method and apparatus for extracting information about a negotiable instrument, an electronic device and a storage medium. The method includes inputting (S101) a to-be-recognized negotiable instrument into a pretrained deep learning network and obtaining a visual image corresponding to the to-be-recognized negotiable instrument through the deep learning network; matching (S102) the visual image corresponding to the to-be-recognized negotiable instrument with a visual image corresponding to each negotiable-instrument template in a preconstructed base template library; and in response to the visual image corresponding to the to-be-recognized negotiable instrument successfully matching a visual image corresponding to one negotiable-instrument template in the base template library, extracting (S103) structured information of the to-be-recognized negotiable instrument by using the negotiable-instrument template.

    METHOD, APPARATUS AND DEVICE FOR RECOGNIZING BILL AND STORAGE MEDIUM

    公开(公告)号:EP3882817A3

    公开(公告)日:2022-01-05

    申请号:EP21180801.9

    申请日:2021-06-22

    IPC分类号: G06K9/00 G06K9/62 G06K9/46

    摘要: The present disclosure discloses a method, apparatus and device for recognizing a bill, and a storage medium. The method comprises: acquiring a bill image; inputting the bill image into a feature extraction network layer of a pre-trained bill recognition model, to obtain a bill key field feature map and a bill key field value feature map of the bill image; inputting the bill key field feature map into a first head network layer of the bill recognition model, to obtain a bill key field; processing the bill key field value feature map by a second head network layer of the bill recognition model, to obtain a bill key field value, the feature extraction network layer being respectively connected with the first head network layer and the second head network layer; and generating structured information of the bill image based on the bill key field and the bill key field value.

    METHOD, APPARATUS AND DEVICE FOR RECOGNIZING BILL AND STORAGE MEDIUM

    公开(公告)号:EP3882817A2

    公开(公告)日:2021-09-22

    申请号:EP21180801.9

    申请日:2021-06-22

    IPC分类号: G06K9/00 G06K9/62

    摘要: The present disclosure discloses a method, apparatus and device for recognizing a bill, and a storage medium. The method comprises: acquiring a bill image; inputting the bill image into a feature extraction network layer of a pre-trained bill recognition model, to obtain a bill key field feature map and a bill key field value feature map of the bill image; inputting the bill key field feature map into a first head network layer of the bill recognition model, to obtain a bill key field; processing the bill key field value feature map by a second head network layer of the bill recognition model, to obtain a bill key field value, the feature extraction network layer being respectively connected with the first head network layer and the second head network layer; and generating structured information of the bill image based on the bill key field and the bill key field value.

    METHOD AND APPARATUS FOR PERFORMING STRUCTURED EXTRACTION OF TEXT, DEVICE AND STORAGE MEDIUM

    公开(公告)号:EP3839818A2

    公开(公告)日:2021-06-23

    申请号:EP21162002.6

    申请日:2021-03-11

    摘要: Embodiments of the present disclosure disclose a method and apparatus for performing a structured extraction on a text, a device and a storage medium, and relate to the field of artificial intelligence such as computer vision, deep learning, and natural language processing. A specific implementation of the method includes: performing a text detection on an entity text image to obtain a position and content of a text line of the entity text image; extracting multivariate information of the text line based on the position and the content of the text line; performing a feature fusion on the multivariate information of the text line to obtain a multimodal fusion feature of the text line; performing category and relationship reasoning based on the multimodal fusion feature of the text line to obtain a category and a relationship probability matrix of the text line; and constructing structured information of the entity text image based on the category and the relationship probability matrix of the text line. According to the implementation, a method for performing a structured extraction on a text based on category and relationship reasoning is provided, which is suitable for large-scale and automated processing and has a wide application range and a strong versatility.

    TABLE GENERATING METHOD AND APPARATUS, ELECTRONIC DEVICE, STORAGE MEDIUM AND PRODUCT

    公开(公告)号:EP4138050A1

    公开(公告)日:2023-02-22

    申请号:EP22178006.7

    申请日:2022-06-09

    摘要: The present disclosure provides a table generating method and apparatus, an electronic device, a storage medium and a product, and relates to the field of artificial intelligence, specifically to the field of computer vision and deep learning technology, which can be applied to scenarios of smart cities and AiFinance. A specific implementation is: recognizing at least one table object in a to-be-recognized image and obtaining a table property respectively corresponding to the at least one table object, where the table property of any table object includes a cell property or a non-cell property; determining at least one target object with the cell property in the at least one table object; determining a cell region respectively corresponding to the at least one target object to obtain cell position information respectively corresponding to the at least one target object; generating a spreadsheet corresponding to the to-be-recognized image according to the cell position information respectively corresponding to the at least one target object. The technical solution of the present disclosure improves accuracy of table generating.

    IMAGE CLASSIFICATION METHOD AND APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:EP3923185A3

    公开(公告)日:2022-04-27

    申请号:EP21202754.4

    申请日:2021-10-14

    IPC分类号: G06K9/00 G06K9/62

    摘要: Provided are an image classification method and apparatus, an electronic device and a storage medium, relating to the field of artificial intelligence and, in particular, to computer vision and deep learning. The method includes inputting (S101, S201, S301) a to-be-classified document image into a pretrained neural network and obtaining a feature submap of each text box of the to-be-classified document image by use of the neural network; inputting (S102, S202, S302) the feature submap of each text box, a semantic feature corresponding to preobtained text information of each text box and a position feature corresponding to preobtained position information of each text box into a pretrained multimodal feature fusion model and fusing, by use of the multimodal feature fusion model, the three into a multimodal feature corresponding to each text box; and classifying (S103) the to-be-classified document image based on the multimodal feature corresponding to each text box. The semantic feature and position feature in the document image are well used so that the object of improving the classification accuracy of the document image is achieved.

    IMAGE CLASSIFICATION METHOD AND APPARATUS, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:EP3923185A2

    公开(公告)日:2021-12-15

    申请号:EP21202754.4

    申请日:2021-10-14

    IPC分类号: G06K9/00 G06K9/62

    摘要: Provided are an image classification method and apparatus, an electronic device and a storage medium, relating to the field of artificial intelligence and, in particular, to computer vision and deep learning. The method includes inputting (S101, S201, S301) a to-be-classified document image into a pretrained neural network and obtaining a feature submap of each text box of the to-be-classified document image by use of the neural network; inputting (S102, S202, S302) the feature submap of each text box, a semantic feature corresponding to preobtained text information of each text box and a position feature corresponding to preobtained position information of each text box into a pretrained multimodal feature fusion model and fusing, by use of the multimodal feature fusion model, the three into a multimodal feature corresponding to each text box; and classifying (S103) the to-be-classified document image based on the multimodal feature corresponding to each text box. The semantic feature and position feature in the document image are well used so that the object of improving the classification accuracy of the document image is achieved.

    RECOGNIZING INVOICE IMAGES
    10.
    发明公开

    公开(公告)号:EP3836016A1

    公开(公告)日:2021-06-16

    申请号:EP21162692.4

    申请日:2021-03-15

    摘要: The present disclosure discloses a method, apparatus, device and storage medium for recognizing a bill image and a computer program product, relates to the fields of artificial intelligence deep learning and image processing. A specific implementation is: performing text detection on a bill image, and determining an attribute information set and a relationship information set of each text box of at least two text boxes in the bill image; determining a type of the text box and an associated text box that has a structural relationship with the text box based on the attribute information set and the relationship information set of the text box; and extracting structured bill data of the bill image, based on the type of the text box and the associated text box that has the structural relationship with the text box. The solution of embodiments of the present disclosure can support automatic recognition of a variety of bill images in different formats, and the recognition process does not require use of a template, which improves the versatility and accuracy of bill image recognition.