UTILIZING MACHINE-LEARNING BASED OBJECT DETECTION TO IMPROVE OPTICAL CHARACTER RECOGNITION

    公开(公告)号:US20230094787A1

    公开(公告)日:2023-03-30

    申请号:US17490610

    申请日:2021-09-30

    Applicant: Adobe Inc.

    Abstract: The present disclosure relates to systems, methods, and non-transitory computer readable media for accurately enhancing optical character recognition with a machine learning approach for determining words from reverse text, vertical text, and atypically-sized text. For example, the disclosed systems segment a digital image into text regions and non-text regions utilizing an object detection machine learning model. Within the text regions, the disclosed systems can determine reverse text glyphs, vertical text glyphs, and/or atypically-sized text glyphs utilizing an edge based adaptive binarization model. Additionally, the disclosed systems can utilize respective modification techniques to manipulate reverse text glyphs, vertical text glyphs, and/or atypically-sized glyphs for analysis by an optical character recognition model. The disclosed systems can further utilize an optical character recognition model to determine words from the modified versions of the reverse text glyphs, the vertical text glyphs, and/or the atypically-sized text glyphs.

    Analytics based on scalable hierarchical categorization of web content

    公开(公告)号:US11507551B2

    公开(公告)日:2022-11-22

    申请号:US16536061

    申请日:2019-08-08

    Applicant: ADOBE INC.

    Abstract: Various methods and systems for performing analytics based on hierarchical categorization of content are provided. Analytics can be performed using an index building workflow and a classification workflow. In the index building workflow, documents are received and analyzed to extract features from the documents. Hierarchical category paths can be identified for the features. The documents are indexed to support searching the documents for the hierarchical category paths. In the classification workflow, a query, that includes or references content, may be received and analyzed to extract features from the content. The features are executed against a search engine that returns search result documents associated with hierarchical category paths. The hierarchical category paths from the search result documents may be used to generate a topic model of the content associated with the query. The topic model, used for web analytics, includes scores for the hierarchical category paths and for enumerated category topics.

    IMAGE SHADOW DETECTION USING MULTIPLE IMAGES

    公开(公告)号:US20200242388A1

    公开(公告)日:2020-07-30

    申请号:US16260762

    申请日:2019-01-29

    Applicant: Adobe Inc.

    Abstract: While a user holds a camera positioned relative to an object, a first image of the object and a second image of the object, as captured by the camera, may be obtained. Intensity variations between a first intensity map of the first image and a combination intensity map obtained from the first intensity map and a second intensity map of the second image may be compared. Then, a shadow may be identified within the first image, based on the intensity variations.

    Real Time Shadow and Glare Analysis for Document Capture

    公开(公告)号:US20200077005A1

    公开(公告)日:2020-03-05

    申请号:US16116764

    申请日:2018-08-29

    Applicant: Adobe Inc.

    Abstract: There are described mobile devices, and methods thereof, for real time analyses of captured images. The device determines whether a light level of a first image meets or exceeds a low light threshold. The device also determines whether a shadow level of the first image meets or exceeds a shadow threshold in response to determining the light level does not meet or exceed the low light threshold. The light source is activated in response to determining the light level meets or exceeds the low light threshold or the shadow level meets or exceeds the shadow threshold. The device further determines whether a glare level of a second image meets or exceeds a glare threshold in response to identifying the second image. If so, then a final image is captured with the light source inactive. Otherwise, the final image is captured with the light source active.

    Image shadow detection using multiple images

    公开(公告)号:US10997453B2

    公开(公告)日:2021-05-04

    申请号:US16260762

    申请日:2019-01-29

    Applicant: Adobe Inc.

    Abstract: While a user holds a camera positioned relative to an object, a first image of the object and a second image of the object, as captured by the camera, may be obtained. Intensity variations between a first intensity map of the first image and a combination intensity map obtained from the first intensity map and a second intensity map of the second image may be compared. Then, a shadow may be identified within the first image, based on the intensity variations.

    Analytics based on scalable hierarchical categorization of web content

    公开(公告)号:US10417301B2

    公开(公告)日:2019-09-17

    申请号:US14482514

    申请日:2014-09-10

    Applicant: ADOBE INC.

    Abstract: Various methods and systems for performing analytics based on hierarchical categorization of content are provided. Analytics can be performed using an index building workflow and a classification workflow. In the index building workflow, documents are received and analyzed to extract features from the documents. Hierarchical category paths can be identified for the features. The documents are indexed to support searching the documents for the hierarchical category paths. In the classification workflow, a query, that includes or references content, may be received and analyzed to extract features from the content. The features are executed against a search engine that returns search result documents associated with hierarchical category paths. The hierarchical category paths from the search result documents may be used to generate a topic model of the content associated with the query. The topic model, used for web analytics, includes scores for the hierarchical category paths and for enumerated category topics.

Patent Agency Ranking