Invention Publication
- Patent Title: METHOD OF TRAINING TEXT DETECTION MODEL, METHOD OF DETECTING TEXT, AND DEVICE
-
Application No.: US18041370Application Date: 2022-04-22
-
Publication No.: US20240265718A1Publication Date: 2024-08-08
- Inventor: Xiaoqiang ZHANG , Xiameng QIN , Chengquan ZHANG , Kun YAO
- Applicant: Beijing Baidu Netcom Science Technology Co., Ltd.
- Applicant Address: CN Beijing
- Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
- Current Assignee: Beijing Baidu Netcom Science Technology Co., Ltd.
- Current Assignee Address: CN Beijing
- Priority: CN 2110934294.5 2021.08.13
- International Application: PCT/CN2022/088393 2022.04.22
- Date entered country: 2023-02-10
- Main IPC: G06V30/19
- IPC: G06V30/19 ; G06V10/77 ; G06V10/82

Abstract:
A method training a text detection model and a method of detecting a text. The training method includes: inputting a sample image into a text feature extraction sub-model of a text detection model to obtain a text feature of a text in the sample image, the sample image having a label indicating an actual position information and an actual category; inputting a predetermined text vector into a text encoding sub-model of the text detection model to obtain a text reference feature; inputting the text feature and the text reference feature into a decoding sub-model of the text detection model to obtain a text sequence vector; inputting the text sequence vector into an output sub-model of the text detection model to obtain a predicted position information and a predicted category; and training the text detection model based on the predicted and actual categories, the predicted and actual position information.
Information query