Generating searchable text for documents portrayed in a repository of digital images utilizing orientation and text prediction neural networks

    公开(公告)号:US10783400B2

    公开(公告)日:2020-09-22

    申请号:US16224698

    申请日:2018-12-18

    Applicant: Dropbox, Inc.

    Abstract: The present disclosure relates to generating computer searchable text from digital images that depict documents utilizing an orientation neural network and/or text prediction neural network. For example, one or more embodiments detect digital images that depict documents, identify the orientation of the depicted documents, and generate computer searchable text from the depicted documents in the detected digital images. In particular, one or more embodiments train an orientation neural network to identify the orientation of a depicted document in a digital image. Additionally, one or more embodiments train a text prediction neural network to analyze a depicted document in a digital image to generate computer searchable text from the depicted document. By utilizing the identified orientation of the depicted document before analyzing the depicted document with a text prediction neural network, the disclosed systems can efficiently and accurately generate computer searchable text for a digital image that depicts a document.

    ENHANCING DOCUMENTS PORTRAYED IN DIGITAL IMAGES

    公开(公告)号:US20180024974A1

    公开(公告)日:2018-01-25

    申请号:US15658289

    申请日:2017-07-24

    Applicant: DROPBOX, INC.

    Abstract: The present disclosure is directed toward systems and methods that efficiently and effectively generate an enhanced document image of a displayed document in an image frame captured from a live image feed. For example, systems and methods described herein apply a document enhancement process to a displayed document in an image frame that result in an enhanced document image that is cropped, rectified, un-shadowed, and with dark text against a mostly white background. Additionally, systems and method described herein determine whether a stored digital content item includes a displayed document. In response to determining that a stored digital content item does include a displayed document, systems and methods described herein generate an enhanced document image of a displayed document included in the stored digital content item.

    GENERATING SEARCHABLE TEXT FOR DOCUMENTS PORTRAYED IN A REPOSITORY OF DIGITAL IMAGES UTILIZING ORIENTATION AND TEXT PREDICTION NEURAL NETWORKS

    公开(公告)号:US20190311227A1

    公开(公告)日:2019-10-10

    申请号:US16224698

    申请日:2018-12-18

    Applicant: Dropbox, Inc.

    Abstract: The present disclosure relates to generating computer searchable text from digital images that depict documents utilizing an orientation neural network and/or text prediction neural network. For example, one or more embodiments detect digital images that depict documents, identify the orientation of the depicted documents, and generate computer searchable text from the depicted documents in the detected digital images. In particular, one or more embodiments train an orientation neural network to identify the orientation of a depicted document in a digital image. Additionally, one or more embodiments train a text prediction neural network to analyze a depicted document in a digital image to generate computer searchable text from the depicted document. By utilizing the identified orientation of the depicted document before analyzing the depicted document with a text prediction neural network, the disclosed systems can efficiently and accurately generate computer searchable text for a digital image that depicts a document.

Patent Agency Ranking