IMAGE QUESTIONING AND ANSWERING METHOD, APPARATUS, DEVICE AND STORAGE MEDIUM

    公开(公告)号:EP3885935A1

    公开(公告)日:2021-09-29

    申请号:EP21275029.3

    申请日:2021-03-16

    IPC分类号: G06F16/583 G06F16/332

    摘要: The present application discloses an image questioning and answering method, apparatus, device and storage medium, relating to the technical field of image processing, computer vision, deep learning and natural language processing. The specific implementation solution is as follows: constructing a question graph with a topological structure and extracting a question feature of a query sentence, according to the query sentence; constructing a visual graph with a topological structure and a text graph with a topological structure according to a target image corresponding to the query sentence; performing fusion on the visual graph, the text graph and the question graph by using a fusion model, to obtain a final fusion graph; and determining reply information of the query sentence according to a reasoning feature extracted from the final fusion graph and the question feature.

    METHOD AND APPARATUS FOR EXTRACTING INFORMATION ABOUT A NEGOTIABLE INSTRUMENT, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:EP3968287A2

    公开(公告)日:2022-03-16

    申请号:EP22151884.8

    申请日:2022-01-17

    IPC分类号: G06V30/41

    摘要: Provided are a method and apparatus for extracting information about a negotiable instrument, an electronic device and a storage medium. The method includes inputting (S101) a to-be-recognized negotiable instrument into a pretrained deep learning network and obtaining a visual image corresponding to the to-be-recognized negotiable instrument through the deep learning network; matching (S102) the visual image corresponding to the to-be-recognized negotiable instrument with a visual image corresponding to each negotiable-instrument template in a preconstructed base template library; and in response to the visual image corresponding to the to-be-recognized negotiable instrument successfully matching a visual image corresponding to one negotiable-instrument template in the base template library, extracting (S103) structured information of the to-be-recognized negotiable instrument by using the negotiable-instrument template.

    METHOD, APPARATUS AND DEVICE FOR RECOGNIZING BILL AND STORAGE MEDIUM

    公开(公告)号:EP3882817A3

    公开(公告)日:2022-01-05

    申请号:EP21180801.9

    申请日:2021-06-22

    IPC分类号: G06K9/00 G06K9/62 G06K9/46

    摘要: The present disclosure discloses a method, apparatus and device for recognizing a bill, and a storage medium. The method comprises: acquiring a bill image; inputting the bill image into a feature extraction network layer of a pre-trained bill recognition model, to obtain a bill key field feature map and a bill key field value feature map of the bill image; inputting the bill key field feature map into a first head network layer of the bill recognition model, to obtain a bill key field; processing the bill key field value feature map by a second head network layer of the bill recognition model, to obtain a bill key field value, the feature extraction network layer being respectively connected with the first head network layer and the second head network layer; and generating structured information of the bill image based on the bill key field and the bill key field value.

    METHOD, APPARATUS AND DEVICE FOR RECOGNIZING BILL AND STORAGE MEDIUM

    公开(公告)号:EP3882817A2

    公开(公告)日:2021-09-22

    申请号:EP21180801.9

    申请日:2021-06-22

    IPC分类号: G06K9/00 G06K9/62

    摘要: The present disclosure discloses a method, apparatus and device for recognizing a bill, and a storage medium. The method comprises: acquiring a bill image; inputting the bill image into a feature extraction network layer of a pre-trained bill recognition model, to obtain a bill key field feature map and a bill key field value feature map of the bill image; inputting the bill key field feature map into a first head network layer of the bill recognition model, to obtain a bill key field; processing the bill key field value feature map by a second head network layer of the bill recognition model, to obtain a bill key field value, the feature extraction network layer being respectively connected with the first head network layer and the second head network layer; and generating structured information of the bill image based on the bill key field and the bill key field value.

    RECOGNIZING INVOICE IMAGES
    7.
    发明公开

    公开(公告)号:EP3836016A1

    公开(公告)日:2021-06-16

    申请号:EP21162692.4

    申请日:2021-03-15

    摘要: The present disclosure discloses a method, apparatus, device and storage medium for recognizing a bill image and a computer program product, relates to the fields of artificial intelligence deep learning and image processing. A specific implementation is: performing text detection on a bill image, and determining an attribute information set and a relationship information set of each text box of at least two text boxes in the bill image; determining a type of the text box and an associated text box that has a structural relationship with the text box based on the attribute information set and the relationship information set of the text box; and extracting structured bill data of the bill image, based on the type of the text box and the associated text box that has the structural relationship with the text box. The solution of embodiments of the present disclosure can support automatic recognition of a variety of bill images in different formats, and the recognition process does not require use of a template, which improves the versatility and accuracy of bill image recognition.

    IMAGE PROCESSING METHOD AND APPARATUS, DEVICE AND STORAGE MEDIUM

    公开(公告)号:EP4040401A1

    公开(公告)日:2022-08-10

    申请号:EP21197863.0

    申请日:2021-09-21

    IPC分类号: G06V10/82 G06V30/413

    摘要: The present disclosure discloses an image processing method and apparatus, a device and a storage medium, and relates to the field of artificial intelligence technologies, and particularly to the fields of computer vision technologies, deep learning technologies, or the like. The image processing method includes: acquiring a multi-modal feature of each of at least one text region in an image, the multi-modal feature including features in plural dimensions; performing a global attention processing operation on the multi-modal feature of each text region to obtain a global attention feature of each text region; determining a category of each text region based on the global attention feature of each text region; and constructing structured information based on text content and the category of each text region. The present disclosure may provide a more universal construction scheme for structured information in an image.