Invention Grant
- Patent Title: Removal of extraneous text from electronic documents
- Patent Title (中): 从电子文件中删除外来文本
-
Application No.: US10346795Application Date: 2003-01-13
-
Publication No.: US07310773B2Publication Date: 2007-12-18
- Inventor: Xiaofan Lin
- Applicant: Xiaofan Lin
- Applicant Address: US TX Houston
- Assignee: Hewlett-Packard Development Company, L.P.
- Current Assignee: Hewlett-Packard Development Company, L.P.
- Current Assignee Address: US TX Houston
- Main IPC: G06F17/00
- IPC: G06F17/00

Abstract:
Method and apparatus for removing lines of extraneous text from a document. Similarities are identified between lines of text on each page and corresponding lines on a selected subset of pages. Different weight values are associated with different line numbers of text on a page, each weight value indicating a degree of likelihood that a line of text contains extraneous text. One or more lines of text are selectively removed from a page as a function of the similarities and associated weight values of line numbers of the lines of text.
Public/Granted literature
- US20040139384A1 Removal of extraneous text from electronic documents Public/Granted day:2004-07-15
Information query