TEXT LOCATION METHOD AND APPARATUS

    公开(公告)号:US20210303901A1

    公开(公告)日:2021-09-30

    申请号:US16836662

    申请日:2020-03-31

    Inventor: Junchao Wei

    Abstract: Aspects of the present invention provide a new text location technique, which can be applied to general handwriting detection at a variety of levels, including characters, words, and sentences. The inventive technique is efficient in training deep learning systems to locate text. The technique works for different languages, for text in different orientations, and for overlapping text. In one aspect, the technique's ability to separate overlapping text also makes the technique useful in application to overlapping objects. Embodiments take advantage of a so-called skyline appearance that text tends to have. Recognizing a skyline appearance for text can facilitate the proper identification of bounding boxes for the text. Even in the case of overlapping text, discernment of a skyline appearance for words can help with the proper identification of bounding boxes for each of the overlapping text words/phrases, thereby facilitating the separation of the text for purposes of recognition.

    DIGITAL STAMP LOCALIZATION AND OVERLAPPING TEXT REMOVAL METHOD AND APPARATUS

    公开(公告)号:US20250111688A1

    公开(公告)日:2025-04-03

    申请号:US18478979

    申请日:2023-09-29

    Inventor: Junchao Wei

    Abstract: In a form recognition system, a deep learning system may be trained to perform stamp localization for stamp removal to facilitate form recognition. In embodiments, a stamp mask identifies locations of stamps or seals on forms, and a line mask identifies pixels of the stamps. Where a stamp or seal overlaps with underlying text on a form, and a color or grayscale of the stamp or seal is sufficiently similar to that of the underlying text, a combination of the stamp mask and the line mask may enable removal of the stamp or seal without degrading the underlying text in the form, and facilitate form recognition.

    Deep-learning based text correction method and apparatus

    公开(公告)号:US12061869B2

    公开(公告)日:2024-08-13

    申请号:US17515230

    申请日:2021-10-29

    Inventor: Junchao Wei

    Abstract: A text correction method and apparatus can take advantage of a greatly reduced number of error-ground truth pairs to train a deep learning model. To generate these error-ground truth pairs, different characters in a ground truth word are replaced with a symbol, not appearing in any ground truth words, to generate error words which are paired with that ground truth word to provide error-ground truth word pairs. This process may be repeated for all ground truth words for which training is to be performed. In embodiments, pairs of characters in a ground truth word may be replaced with a symbol to generate the error words which are paired with that ground truth word to provide error-ground truth word pairs. Again, this process may be repeated for all ground truth words for which training is to be performed.

    METHOD AND APPARATUS FOR CUSTOMIZED DEEP LEARNING-BASED TEXT CORRECTION

    公开(公告)号:US20230096700A1

    公开(公告)日:2023-03-30

    申请号:US17491122

    申请日:2021-09-30

    Inventor: Junchao Wei

    Abstract: A text correction engine meets different and changing end user requirements, with the ability to change a desired output by providing sufficient amounts of data, and by finetuning the appropriate text correction engine at the point of origin of the data. It is possible to retain confidentiality of data by retraining the base deep learning model at the base deep learning model's point of origin, to improve the base deep learning model's performance, making the base deep learning model more accurate for different contexts. Separate training of an end user model, leaving the base deep learning model intact, streamlines end user model training, and highlights desirable changes in the base deep learning model for further training or retraining.

    Text location method and apparatus

    公开(公告)号:US11270146B2

    公开(公告)日:2022-03-08

    申请号:US16836662

    申请日:2020-03-31

    Inventor: Junchao Wei

    Abstract: Aspects of the present invention provide a new text location technique, which can be applied to general handwriting detection at a variety of levels, including characters, words, and sentences. The inventive technique is efficient in training deep learning systems to locate text. The technique works for different languages, for text in different orientations, and for overlapping text. In one aspect, the technique's ability to separate overlapping text also makes the technique useful in application to overlapping objects. Embodiments take advantage of a so-called skyline appearance that text tends to have. Recognizing a skyline appearance for text can facilitate the proper identification of bounding boxes for the text. Even in the case of overlapping text, discernment of a skyline appearance for words can help with the proper identification of bounding boxes for each of the overlapping text words/phrases, thereby facilitating the separation of the text for purposes of recognition.

Patent Agency Ranking