MULTI-DIRECTIONAL SCENE TEXT RECOGNITION METHOD AND SYSTEM BASED ON MULTI-ELEMENT ATTENTION MECHANISM

    公开(公告)号:US20220121871A1

    公开(公告)日:2022-04-21

    申请号:US17502533

    申请日:2021-10-15

    摘要: A method and a system of multi-directional scene text recognition based on multi-element attention mechanism are provided. The method includes: performing normalization processing for a text row/column image I output from an external text detection module by a feature extractor, extracting a feature for the normalized image by using a deep convolutional neural network to acquire an initial feature map F0, and adding a 2-dimensional directional positional encoding P to an initial feature map F0 in order to output a multi-channel feature map F; converting the multi-channel feature map F output from a feature extractor by an encoder into a hidden representation H; and converting the hidden representation H output from the encoder into a recognized text by a decoder and using the recognized text as the output result. The method and the system of multi-directional scene text recognition based on multi-element attention mechanism provided by the present invention are applied to multi-oriented scene text images including horizontal text, vertical text, and curved text etc., and have achieved high applicability.