Invention Grant
- Patent Title: Generating searchable text for documents portrayed in a repository of digital images utilizing orientation and text prediction neural networks
-
Application No.: US17020519Application Date: 2020-09-14
-
Publication No.: US11645826B2Publication Date: 2023-05-09
- Inventor: David J. Kriegman , Peter N. Belhumeur , Bradley Neuberg , Leonard Fink
- Applicant: Dropbox, Inc.
- Applicant Address: US CA San Francisco
- Assignee: Dropbox, Inc.
- Current Assignee: Dropbox, Inc.
- Current Assignee Address: US CA San Francisco
- Agency: Keller Preece PLLC
- Main IPC: G06K9/62
- IPC: G06K9/62 ; G06V10/24 ; G06F18/214 ; G06F18/21 ; G06V30/40 ; G06V30/19 ; G06V10/82 ; G06V20/62

Abstract:
The present disclosure relates to generating computer searchable text from digital images that depict documents utilizing an orientation neural network and/or text prediction neural network. For example, one or more embodiments detect digital images that depict documents, identify the orientation of the depicted documents, and generate computer searchable text from the depicted documents in the detected digital images. In particular, one or more embodiments train an orientation neural network to identify the orientation of a depicted document in a digital image. Additionally, one or more embodiments train a text prediction neural network to analyze a depicted document in a digital image to generate computer searchable text from the depicted document. By utilizing the identified orientation of the depicted document before analyzing the depicted document with a text prediction neural network, the disclosed systems can efficiently and accurately generate computer searchable text for a digital image that depicts a document.
Information query