-
公开(公告)号:US11537605B2
公开(公告)日:2022-12-27
申请号:US17218028
申请日:2021-03-30
Inventor: Junchao Wei
IPC: G06F16/242 , G06F16/22 , G06N3/08 , G06F16/25
Abstract: In some forms containing keywords and content, there may be nested levels of keywords, also referred to as a hierarchy. Content in the forms may be associated with one or more keywords in one or more of the nested levels, or in the hierarchy. Identifying keywords in adjacent cells in a table (with a nested keyword being either to the right of or below another keyword) enables distinguishing between keywords and content in filled forms, and enables correct association of content with respective keywords.
-
公开(公告)号:US20210303901A1
公开(公告)日:2021-09-30
申请号:US16836662
申请日:2020-03-31
Inventor: Junchao Wei
Abstract: Aspects of the present invention provide a new text location technique, which can be applied to general handwriting detection at a variety of levels, including characters, words, and sentences. The inventive technique is efficient in training deep learning systems to locate text. The technique works for different languages, for text in different orientations, and for overlapping text. In one aspect, the technique's ability to separate overlapping text also makes the technique useful in application to overlapping objects. Embodiments take advantage of a so-called skyline appearance that text tends to have. Recognizing a skyline appearance for text can facilitate the proper identification of bounding boxes for the text. Even in the case of overlapping text, discernment of a skyline appearance for words can help with the proper identification of bounding boxes for each of the overlapping text words/phrases, thereby facilitating the separation of the text for purposes of recognition.
-
公开(公告)号:US20250111688A1
公开(公告)日:2025-04-03
申请号:US18478979
申请日:2023-09-29
Inventor: Junchao Wei
IPC: G06V30/148 , G06V30/18
Abstract: In a form recognition system, a deep learning system may be trained to perform stamp localization for stamp removal to facilitate form recognition. In embodiments, a stamp mask identifies locations of stamps or seals on forms, and a line mask identifies pixels of the stamps. Where a stamp or seal overlaps with underlying text on a form, and a color or grayscale of the stamp or seal is sufficiently similar to that of the underlying text, a combination of the stamp mask and the line mask may enable removal of the stamp or seal without degrading the underlying text in the form, and facilitate form recognition.
-
公开(公告)号:US11748341B2
公开(公告)日:2023-09-05
申请号:US17218026
申请日:2021-03-30
Inventor: Junchao Wei
IPC: G06F16/242 , G06F16/2453 , G06N3/08 , G06F16/2452 , G06F16/22 , G06F16/2458
CPC classification number: G06F16/2428 , G06F16/2282 , G06F16/2462 , G06F16/24522 , G06F16/24534 , G06N3/08
Abstract: In different kinds of forms with incomplete lines, or with different color cells in lieu of lines, virtually completing or providing the lines enables formation of tables from which keywords and content in the forms may be identified. Where a form may have one or more such tables, as can be the case with forms with irregular formats, multiple tables may be identified, to facilitate identification of keywords and content in each such table. In embodiments, deep learning techniques may be applied. Cost analysis involving minimum distances between keywords and content may be performed, with the cost analysis also facilitating formation of a keyword dictionary and a content dictionary.
-
公开(公告)号:US12008826B2
公开(公告)日:2024-06-11
申请号:US17491122
申请日:2021-09-30
Inventor: Junchao Wei
IPC: G06V30/12 , G06F18/20 , G06F18/214 , G06N3/045 , G06V30/18
CPC classification number: G06V30/12 , G06F18/214 , G06F18/285 , G06N3/045 , G06V30/18
Abstract: A text correction engine meets different and changing end user requirements, with the ability to change a desired output by providing sufficient amounts of data, and by finetuning the appropriate text correction engine at the point of origin of the data. It is possible to retain confidentiality of data by retraining the base deep learning model at the base deep learning model's point of origin, to improve the base deep learning model's performance, making the base deep learning model more accurate for different contexts. Separate training of an end user model, leaving the base deep learning model intact, streamlines end user model training, and highlights desirable changes in the base deep learning model for further training or retraining.
-
公开(公告)号:US11354940B2
公开(公告)日:2022-06-07
申请号:US16836525
申请日:2020-03-31
Inventor: Junchao Wei
Abstract: A method and system to detect visual spoofing of a process of authenticating a person's identity employs computer vision techniques to define characteristics of different kinds of spoofing. Embodiments identify a foreground object within an image and by examining positions and/or orientations of that foreground object within the image, determine whether the presentation of the foreground object is an attempt to spoof the authentication process.
-
公开(公告)号:US12061869B2
公开(公告)日:2024-08-13
申请号:US17515230
申请日:2021-10-29
Inventor: Junchao Wei
IPC: G06K9/00 , G06F18/214 , G06F40/232 , G06V30/12 , G06V30/196 , G06V30/10
CPC classification number: G06F40/232 , G06F18/214 , G06V30/12 , G06V30/1983 , G06V30/10
Abstract: A text correction method and apparatus can take advantage of a greatly reduced number of error-ground truth pairs to train a deep learning model. To generate these error-ground truth pairs, different characters in a ground truth word are replaced with a symbol, not appearing in any ground truth words, to generate error words which are paired with that ground truth word to provide error-ground truth word pairs. This process may be repeated for all ground truth words for which training is to be performed. In embodiments, pairs of characters in a ground truth word may be replaced with a symbol to generate the error words which are paired with that ground truth word to provide error-ground truth word pairs. Again, this process may be repeated for all ground truth words for which training is to be performed.
-
公开(公告)号:US20230096700A1
公开(公告)日:2023-03-30
申请号:US17491122
申请日:2021-09-30
Inventor: Junchao Wei
Abstract: A text correction engine meets different and changing end user requirements, with the ability to change a desired output by providing sufficient amounts of data, and by finetuning the appropriate text correction engine at the point of origin of the data. It is possible to retain confidentiality of data by retraining the base deep learning model at the base deep learning model's point of origin, to improve the base deep learning model's performance, making the base deep learning model more accurate for different contexts. Separate training of an end user model, leaving the base deep learning model intact, streamlines end user model training, and highlights desirable changes in the base deep learning model for further training or retraining.
-
公开(公告)号:US11270146B2
公开(公告)日:2022-03-08
申请号:US16836662
申请日:2020-03-31
Inventor: Junchao Wei
IPC: G06K9/46 , G06T11/20 , G02B27/46 , G06K9/72 , G06F40/10 , H04N1/00 , G06F40/20 , G06T9/00 , G06F40/00
Abstract: Aspects of the present invention provide a new text location technique, which can be applied to general handwriting detection at a variety of levels, including characters, words, and sentences. The inventive technique is efficient in training deep learning systems to locate text. The technique works for different languages, for text in different orientations, and for overlapping text. In one aspect, the technique's ability to separate overlapping text also makes the technique useful in application to overlapping objects. Embodiments take advantage of a so-called skyline appearance that text tends to have. Recognizing a skyline appearance for text can facilitate the proper identification of bounding boxes for the text. Even in the case of overlapping text, discernment of a skyline appearance for words can help with the proper identification of bounding boxes for each of the overlapping text words/phrases, thereby facilitating the separation of the text for purposes of recognition.
-
公开(公告)号:US20210303890A1
公开(公告)日:2021-09-30
申请号:US16836525
申请日:2020-03-31
Inventor: Junchao Wei
Abstract: A method and system to detect visual spoofing of a process of authenticating a person's identity employs computer vision techniques to define characteristics of different kinds of spoofing. Embodiments identify a foreground object within an image and by examining positions and/or orientations of that foreground object within the image, determine whether the presentation of the foreground object is an attempt to spoof the authentication process.
-
-
-
-
-
-
-
-
-