METHOD AND APPARATUS OF RECTIFYING TEXT IMAGE, TRAINING METHOD AND APPARATUS, ELECTRONIC DEVICE, AND MEDIUM

    公开(公告)号:EP4123595A3

    公开(公告)日:2023-05-24

    申请号:EP22212042.0

    申请日:2022-12-07

    摘要: The present disclosure provides a method and an apparatus of rectifying a text image, a training method and apparatus, an electronic device, and a medium, which relate to a field of an artificial intelligence technology, in particular to fields of computer vision, deep learning technology, intelligent transportation and high-precision maps. A specific implementation solution includes: performing, based on a gating strategy, a plurality of first layer-wise processing on a text image to be rectified, so as to obtain respective feature maps of a plurality of layer levels, wherein each of the feature maps includes a text structural feature related to the text image to be rectified, and the gating strategy is configured to increase an attention to the text structural feature; and performing a plurality of second layer-wise processing on the respective feature maps of the plurality of layer levels, so as to obtain a rectified text image corresponding to the text image to be rectified.

    METHOD AND APPARATUS OF RECTIFYING TEXT IMAGE, TRAINING METHOD AND APPARATUS, ELECTRONIC DEVICE, AND MEDIUM

    公开(公告)号:EP4123595A2

    公开(公告)日:2023-01-25

    申请号:EP22212042.0

    申请日:2022-12-07

    IPC分类号: G06V10/82 G06V30/40 G06V30/16

    摘要: The present disclosure provides a method and an apparatus of rectifying a text image, a training method and apparatus, an electronic device, and a medium, which relate to a field of an artificial intelligence technology, in particular to fields of computer vision, deep learning technology, intelligent transportation and high-precision maps. A specific implementation solution includes: performing, based on a gating strategy, a plurality of first layer-wise processing on a text image to be rectified, so as to obtain respective feature maps of a plurality of layer levels, wherein each of the feature maps includes a text structural feature related to the text image to be rectified, and the gating strategy is configured to increase an attention to the text structural feature; and performing a plurality of second layer-wise processing on the respective feature maps of the plurality of layer levels, so as to obtain a rectified text image corresponding to the text image to be rectified.