Invention Grant
- Patent Title: System and method for identifying document genres
- Patent Title (中): 识别文件类型的系统和方法
-
Application No.: US12437526Application Date: 2009-05-07
-
Publication No.: US08260062B2Publication Date: 2012-09-04
- Inventor: Francine R. Chen , Yijuan Lu , Matthew Cooper
- Applicant: Francine R. Chen , Yijuan Lu , Matthew Cooper
- Applicant Address: JP Tokyo
- Assignee: Fuji Xerox Co., Ltd.
- Current Assignee: Fuji Xerox Co., Ltd.
- Current Assignee Address: JP Tokyo
- Agency: Morgan, Lewis & Bockius LLP
- Main IPC: G06K9/62
- IPC: G06K9/62 ; G06K9/46 ; H04N1/36 ; G06F12/00

Abstract:
A system, a computer readable storage medium including instructions, and method for generating genre models used to identify genres of a document. For each document image in a set of document images that are associated with one or more genres, the document image is segmented into a plurality of tiles, wherein the tiles in the plurality of tiles are sized so that document page features are identifiable, and features of the document image and the plurality of tiles are computed. At least one genre classifier is trained to classify document images as being associated with one or more genres based on the features of the document images in the set of document images, the features of the plurality of tiles of the set of documents images, and the one or more genres associated with each document image in the set of documents images.
Public/Granted literature
- US20100284623A1 SYSTEM AND METHOD FOR IDENTIFYING DOCUMENT GENRES Public/Granted day:2010-11-11
Information query