-
公开(公告)号:US20220092353A1
公开(公告)日:2022-03-24
申请号:US17540207
申请日:2021-12-01
Inventor: Ruixue Liu , Xiameng Qin , Mengyi En , Kun Yao , Chengquan Zhang , Shengxian Zhu , Yunhao Li , Junyu Han , Hao Sun
Abstract: A computer-implemented method includes: acquiring training data, the training data includes training images for a preset vertical type, and the training images include a first training image containing real data of the preset vertical type and a second training image containing virtual data of the preset vertical type ; building a basic model, the basic model includes a deep learning network, and the deep learning network is configured to recognize the training images to extract text data in the training image; and training the basic model by using the training data to obtain the image recognition model.
-
公开(公告)号:US11775574B2
公开(公告)日:2023-10-03
申请号:US17182987
申请日:2021-02-23
Inventor: Yulin Li , Xiameng Qin , Ju Huang , Qunyi Xie , Junyu Han
IPC: G06F16/00 , G06F16/36 , G06F40/279 , G06F18/25 , G06V10/764 , G06V10/80 , G06V10/82 , G06V10/44 , G06V10/426 , G06N3/02
CPC classification number: G06F16/367 , G06F18/253 , G06F40/279 , G06V10/426 , G06V10/454 , G06V10/764 , G06V10/811 , G06V10/82 , G06N3/02
Abstract: A method for visual question answering, a computer device implementing the method and a medium for storing instructions on performing the method are provided. The method includes: acquiring an input image and an input question; constructing a visual graph based on the input image, wherein the visual graph comprises a first node feature and a first edge feature; constructing a question graph based on the input question, wherein the question graph comprises a second node feature and a second edge feature; performing a multimodal fusion on the visual graph and the question graph to obtain an updated visual graph and an updated question graph; determining a question feature based on the input question; determining a fusion feature based on the updated visual graph, the updated question graph and the question feature; and generating a predicted answer for the input image and the input question.
-
13.
公开(公告)号:US20220139096A1
公开(公告)日:2022-05-05
申请号:US17578735
申请日:2022-01-19
Inventor: Pengyuan Lv , Chengquan Zhang , Kun Yao , Junyu Han
Abstract: A character recognition method, a model training method, a related apparatus and an electronic device are provided. The specific solution is: obtaining a target picture; performing feature encoding on the target picture to obtain a visual feature of the target picture; performing feature mapping on the visual feature to obtain a first target feature of the target picture, where the first target feature is a feature that has a matching space with a feature of character semantic information of the target picture; inputting the first target feature into a character recognition model for character recognition to obtain a first character recognition result of the target picture.
-
-