Patent search ap:("BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD.") AND inv:"Chengquan Zhang"

1.

发明申请
METHOD FOR TRAINING MODEL, DEVICE, AND STORAGE MEDIUM 有权

公开(公告)号：US20230042234A1

公开(公告)日：2023-02-09

申请号：US17972253

申请日：2022-10-24

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Yangliu XU , Qunyi Xie , Yi Chen , Xiameng Qin , Chengquan Zhang , Kun Yao

IPC: G06N3/08

Abstract: A method for training a model includes: obtaining a scene image, second actual characters in the scene image and a second construct image; obtaining first features and first recognition characters of characters obtained by performing character recognition on the scene image using the model to be trained; obtaining second features of characters obtained by performing character recognition on the second construct image using the training auxiliary model; and obtaining a character recognition model by adjusting model parameters of the model to be trained based on the first recognition characters, the second actual characters, the first features and the second features.

2.

发明申请
METHOD AND DEVICE FOR TRAINING IMAGE RECOGNITION MODEL, EQUIPMENT AND MEDIUM 有权

公开(公告)号：US20220092353A1

公开(公告)日：2022-03-24

申请号：US17540207

申请日：2021-12-01

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Ruixue Liu , Xiameng Qin , Mengyi En , Kun Yao , Chengquan Zhang , Shengxian Zhu , Yunhao Li , Junyu Han , Hao Sun

IPC: G06K9/62 , G06V30/30 , G06V30/14

Abstract: A computer-implemented method includes: acquiring training data, the training data includes training images for a preset vertical type, and the training images include a first training image containing real data of the preset vertical type and a second training image containing virtual data of the preset vertical type ; building a basic model, the basic model includes a deep learning network, and the deep learning network is configured to recognize the training images to extract text data in the training image; and training the basic model by using the training data to obtain the image recognition model.

3.

发明申请
METHOD AND APPARATUS FOR EXTRACTING INFORMATION ABOUT A NEGOTIABLE INSTRUMENT, ELECTRONIC DEVICE AND STORAGE MEDIUM 有权

公开(公告)号：US20220148324A1

公开(公告)日：2022-05-12

申请号：US17581047

申请日：2022-01-21

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Xiameng QIN , Yulin Li , Ju Huang , Qunyi Xie , Chengquan Zhang , Kun Yao , Jingtuo Liu , Junyu Han

IPC: G06V30/18 , G06V30/24 , G06V30/148 , G06V30/19

Abstract: Provided are a method and apparatus for extracting information about a negotiable instrument, an electronic device and a storage medium. The method includes inputting a to-be-recognized negotiable instrument into a pretrained deep learning network and obtaining a visual image corresponding to the to-be-recognized negotiable instrument through the deep learning network;
matching the visual image corresponding to the to-be-recognized negotiable instrument with a visual image corresponding to each negotiable-instrument template in a preconstructed base template library; and in response to the visual image corresponding to the to-be-recognized negotiable instrument successfully matching a visual image corresponding to one negotiable-instrument template in the base template library, extracting structured information of the to-be-recognized negotiable instrument by using the negotiable-instrument template.

4.

发明申请
CHARACTER RECOGNITION METHOD, MODEL TRAINING METHOD, RELATED APPARATUS AND ELECTRONIC DEVICE 有权

公开(公告)号：US20220139096A1

公开(公告)日：2022-05-05

申请号：US17578735

申请日：2022-01-19

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Pengyuan Lv , Chengquan Zhang , Kun Yao , Junyu Han

IPC: G06V30/19 , G06V30/18 , G06V20/70

Abstract: A character recognition method, a model training method, a related apparatus and an electronic device are provided. The specific solution is: obtaining a target picture; performing feature encoding on the target picture to obtain a visual feature of the target picture; performing feature mapping on the visual feature to obtain a first target feature of the target picture, where the first target feature is a feature that has a matching space with a feature of character semantic information of the target picture; inputting the first target feature into a character recognition model for character recognition to obtain a first character recognition result of the target picture.

5.

发明授权
Method and apparatus for visual question answering, computer device and medium 有权

公开(公告)号：US11854283B2

公开(公告)日：2023-12-26

申请号：US17169112

申请日：2021-02-05

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Pengyuan Lv , Xiaoqiang Zhang , Shanshan Liu , Chengquan Zhang , Qiming Peng , Sijin Wu , Hua Lu , Yongfeng Chen

IPC: G06V30/262 , G06T7/70 , G06V30/413 , G06V20/62 , G06F16/33 , G06V30/19 , G06V10/82 , G06V30/416

CPC classification number: G06V30/274 , G06F16/3344 , G06T7/70 , G06V10/82 , G06V20/62 , G06V30/19173 , G06V30/413 , G06V30/416 , G06T2207/30176

Abstract: The present disclosure provides a method for visual question answering, which relates to fields of computer vision and natural language processing. The method includes: acquiring an input image and an input question; detecting visual information and position information of each of at least one text region in the input image; determining semantic information and attribute information of each of the at least one text region based on the visual information and the position information; determining a global feature of the input image based on the visual information, the position information, the semantic information, and the attribute information; determining a question feature based on the input question; and generating a predicted answer for the input image and the input question based on the global feature and the question feature. The present disclosure further provides a device for visual question answering, a computer device and a medium.

6.

发明申请
METHOD AND DEVICE FOR RECOGNIZING TEXT, AND METHOD AND DEVICE FOR TRAINING TEXT RECOGNITION MODEL 有权

公开(公告)号：US20230123327A1

公开(公告)日：2023-04-20

申请号：US18068149

申请日：2022-12-19

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Chengquan Zhang , Pengyuan Lv , Kun Yao , Junyu Han , Jingtuo Liu

IPC: G06V30/19 , G06V10/82

Abstract: A method for recognizing text includes: obtaining an image sequence feature of an image to be recognized; obtaining a full text string of the image to be recognized by decoding the image sequence feature; obtaining a text sequence feature by performing a semantic enhancement process on the full text string, in which the image sequence feature, the full text string and the text sequence feature are of the same length; and determining text content of the image to be recognized based on the full text string and the text sequence feature.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification