Text recognition in image
    34.
    发明授权

    公开(公告)号:US11823471B2

    公开(公告)日:2023-11-21

    申请号:US17795446

    申请日:2021-01-20

    Abstract: According to implementations of the subject matter described herein, there is provided a solution for text recognition in an image. In this solution, a target text line area, which is expected to include a text to be recognized, is determined from an image. Probability distribution information of a character model element(s) present in the target text line area is determined using a single character model. The single character model is trained based on training text line areas and respective ground-truth texts in the training text line areas. Texts in the training text line areas are arranged in different orientations, and/or the ground-truth texts comprise texts are related to various languages (e.g., texts related to a Latin and an Eastern languages). The text in the target text line area can be determined based on the determined probability distribution information. The single character model enables more efficient and convenient text recognition.

    Text feature guided visual based document classifier

    公开(公告)号:US11720605B1

    公开(公告)日:2023-08-08

    申请号:US17876069

    申请日:2022-07-28

    Applicant: Intuit Inc.

    CPC classification number: G06F16/287 G06F16/24578 G06F16/93 G06V30/1444

    Abstract: A visual-based classification model influenced by text features as a result of the outputs of a text-based classification model is disclosed. A system receives one or more documents to be classified based on one or more visual features and provides the one or more documents to a student classification model, which is a visual-based classification model. The system also classifies, by the student classification model, the one or more documents into one or more document types based on one or more visual features. The one or more visual features are generated by the student classification model that is trained based on important text identified by a teacher classification model for the one or more document types, with the teacher classification model being a text-based classification model. Generating training data and training the student classification model based on the training data are also described.

Patent Agency Ranking