-
公开(公告)号:US20230042234A1
公开(公告)日:2023-02-09
申请号:US17972253
申请日:2022-10-24
Inventor: Yangliu XU , Qunyi Xie , Yi Chen , Xiameng Qin , Chengquan Zhang , Kun Yao
IPC: G06N3/08
Abstract: A method for training a model includes: obtaining a scene image, second actual characters in the scene image and a second construct image; obtaining first features and first recognition characters of characters obtained by performing character recognition on the scene image using the model to be trained; obtaining second features of characters obtained by performing character recognition on the second construct image using the training auxiliary model; and obtaining a character recognition model by adjusting model parameters of the model to be trained based on the first recognition characters, the second actual characters, the first features and the second features.
-
公开(公告)号:US20220092353A1
公开(公告)日:2022-03-24
申请号:US17540207
申请日:2021-12-01
Inventor: Ruixue Liu , Xiameng Qin , Mengyi En , Kun Yao , Chengquan Zhang , Shengxian Zhu , Yunhao Li , Junyu Han , Hao Sun
Abstract: A computer-implemented method includes: acquiring training data, the training data includes training images for a preset vertical type, and the training images include a first training image containing real data of the preset vertical type and a second training image containing virtual data of the preset vertical type ; building a basic model, the basic model includes a deep learning network, and the deep learning network is configured to recognize the training images to extract text data in the training image; and training the basic model by using the training data to obtain the image recognition model.
-
公开(公告)号:US20220148324A1
公开(公告)日:2022-05-12
申请号:US17581047
申请日:2022-01-21
Inventor: Xiameng QIN , Yulin Li , Ju Huang , Qunyi Xie , Chengquan Zhang , Kun Yao , Jingtuo Liu , Junyu Han
IPC: G06V30/18 , G06V30/24 , G06V30/148 , G06V30/19
Abstract: Provided are a method and apparatus for extracting information about a negotiable instrument, an electronic device and a storage medium. The method includes inputting a to-be-recognized negotiable instrument into a pretrained deep learning network and obtaining a visual image corresponding to the to-be-recognized negotiable instrument through the deep learning network;
matching the visual image corresponding to the to-be-recognized negotiable instrument with a visual image corresponding to each negotiable-instrument template in a preconstructed base template library; and in response to the visual image corresponding to the to-be-recognized negotiable instrument successfully matching a visual image corresponding to one negotiable-instrument template in the base template library, extracting structured information of the to-be-recognized negotiable instrument by using the negotiable-instrument template.-
4.
公开(公告)号:US20220139096A1
公开(公告)日:2022-05-05
申请号:US17578735
申请日:2022-01-19
Inventor: Pengyuan Lv , Chengquan Zhang , Kun Yao , Junyu Han
Abstract: A character recognition method, a model training method, a related apparatus and an electronic device are provided. The specific solution is: obtaining a target picture; performing feature encoding on the target picture to obtain a visual feature of the target picture; performing feature mapping on the visual feature to obtain a first target feature of the target picture, where the first target feature is a feature that has a matching space with a feature of character semantic information of the target picture; inputting the first target feature into a character recognition model for character recognition to obtain a first character recognition result of the target picture.
-
公开(公告)号:US11854283B2
公开(公告)日:2023-12-26
申请号:US17169112
申请日:2021-02-05
Inventor: Pengyuan Lv , Xiaoqiang Zhang , Shanshan Liu , Chengquan Zhang , Qiming Peng , Sijin Wu , Hua Lu , Yongfeng Chen
IPC: G06V30/262 , G06T7/70 , G06V30/413 , G06V20/62 , G06F16/33 , G06V30/19 , G06V10/82 , G06V30/416
CPC classification number: G06V30/274 , G06F16/3344 , G06T7/70 , G06V10/82 , G06V20/62 , G06V30/19173 , G06V30/413 , G06V30/416 , G06T2207/30176
Abstract: The present disclosure provides a method for visual question answering, which relates to fields of computer vision and natural language processing. The method includes: acquiring an input image and an input question; detecting visual information and position information of each of at least one text region in the input image; determining semantic information and attribute information of each of the at least one text region based on the visual information and the position information; determining a global feature of the input image based on the visual information, the position information, the semantic information, and the attribute information; determining a question feature based on the input question; and generating a predicted answer for the input image and the input question based on the global feature and the question feature. The present disclosure further provides a device for visual question answering, a computer device and a medium.
-
6.
公开(公告)号:US20230123327A1
公开(公告)日:2023-04-20
申请号:US18068149
申请日:2022-12-19
Inventor: Chengquan Zhang , Pengyuan Lv , Kun Yao , Junyu Han , Jingtuo Liu
Abstract: A method for recognizing text includes: obtaining an image sequence feature of an image to be recognized; obtaining a full text string of the image to be recognized by decoding the image sequence feature; obtaining a text sequence feature by performing a semantic enhancement process on the full text string, in which the image sequence feature, the full text string and the text sequence feature are of the same length; and determining text content of the image to be recognized based on the full text string and the text sequence feature.
-
-
-
-
-