Patent search ap:("BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO. Page LTD.") AND inv:"Junyu Han"

11.

发明申请
METHOD AND DEVICE FOR TRAINING IMAGE RECOGNITION MODEL, EQUIPMENT AND MEDIUM 有权

公开(公告)号：US20220092353A1

公开(公告)日：2022-03-24

申请号：US17540207

申请日：2021-12-01

Applicant: BEIJING BAIDU NETCOM SCIENCE TECHNOLOGY CO., LTD.

Inventor： Ruixue Liu , Xiameng Qin , Mengyi En , Kun Yao , Chengquan Zhang , Shengxian Zhu , Yunhao Li , Junyu Han , Hao Sun

IPC: G06K9/62 , G06V30/30 , G06V30/14

Abstract: A computer-implemented method includes: acquiring training data, the training data includes training images for a preset vertical type, and the training images include a first training image containing real data of the preset vertical type and a second training image containing virtual data of the preset vertical type ; building a basic model, the basic model includes a deep learning network, and the deep learning network is configured to recognize the training images to extract text data in the training image; and training the basic model by using the training data to obtain the image recognition model.

12.

发明授权
Method and apparatus for visual question answering, computer device and medium 有权

公开(公告)号：US11775574B2

公开(公告)日：2023-10-03

申请号：US17182987

申请日：2021-02-23

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Yulin Li , Xiameng Qin , Ju Huang , Qunyi Xie , Junyu Han

IPC: G06F16/00 , G06F16/36 , G06F40/279 , G06F18/25 , G06V10/764 , G06V10/80 , G06V10/82 , G06V10/44 , G06V10/426 , G06N3/02

CPC classification number: G06F16/367 , G06F18/253 , G06F40/279 , G06V10/426 , G06V10/454 , G06V10/764 , G06V10/811 , G06V10/82 , G06N3/02

Abstract: A method for visual question answering, a computer device implementing the method and a medium for storing instructions on performing the method are provided. The method includes: acquiring an input image and an input question; constructing a visual graph based on the input image, wherein the visual graph comprises a first node feature and a first edge feature; constructing a question graph based on the input question, wherein the question graph comprises a second node feature and a second edge feature; performing a multimodal fusion on the visual graph and the question graph to obtain an updated visual graph and an updated question graph; determining a question feature based on the input question; determining a fusion feature based on the updated visual graph, the updated question graph and the question feature; and generating a predicted answer for the input image and the input question.

13.

发明申请
CHARACTER RECOGNITION METHOD, MODEL TRAINING METHOD, RELATED APPARATUS AND ELECTRONIC DEVICE 有权

公开(公告)号：US20220139096A1

公开(公告)日：2022-05-05

申请号：US17578735

申请日：2022-01-19

Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.

Inventor： Pengyuan Lv , Chengquan Zhang , Kun Yao , Junyu Han

IPC: G06V30/19 , G06V30/18 , G06V20/70

Abstract: A character recognition method, a model training method, a related apparatus and an electronic device are provided. The specific solution is: obtaining a target picture; performing feature encoding on the target picture to obtain a visual feature of the target picture; performing feature mapping on the visual feature to obtain a first target feature of the target picture, where the first target feature is a feature that has a matching space with a feature of character semantic information of the target picture; inputting the first target feature into a character recognition model for character recognition to obtain a first character recognition result of the target picture.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification