Using deep learning techniques to determine the contextual reading order in a form document

Invention Grant

US10423828B2 Using deep learning techniques to determine the contextual reading order in a form document 有权

Please log in to see more content

Patent Title: Using deep learning techniques to determine the contextual reading order in a form document
Application No.: US15843953

Application Date: 2017-12-15
Publication No.: US10423828B2

Publication Date: 2019-09-24
Inventor: Shagun Sodhani , Kartikay Garg , Balaji Krishnamurthy
Applicant: Adobe Inc.
Applicant Address: US CA San Jose
Assignee: Adobe Inc.
Current Assignee: Adobe Inc.
Current Assignee Address: US CA San Jose
Agency: Finch & Maloney PLLC
Main IPC: G06K9/62
IPC: G06K9/62 ; G06K9/00

Using deep learning techniques to determine the contextual reading order in a form document

Abstract:

Techniques for determining reading order in a document. A current labeled text run (R1), RIGHT text run (R1) and DOWN text run (R3) are generated. The R1 labeled text run is processed by a first LSTM, the R2 labeled text run is processed by a second LSTM, and the R3 labeled text run is processed by a third LSTM, wherein each of the LSTMs generates a respective internal representation (R1′, R2′ and R3′). Deep learning tools other than LSTMs can be used, as will be appreciated. The respective internal representations R1′, R2′ and R3′ are concatenated or otherwise combined into a vector or tensor representation and provided to a classifier network that generates a predicted label for a next text run as RIGHT, DOWN or EOS in the reading order of the document.

Public/Granted literature

US20190188463A1 USING DEEP LEARNING TECHNIQUES TO DETERMINE THE CONTEXTUAL READING ORDER IN A FORM DOCUMENT Public/Granted day:2019-06-20

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )
G06K9/62	.应用电子设备进行识别的方法或装置