Invention Grant
- Patent Title: Identification of content in an electronic document
- Patent Title (中): 电子文件内容的识别
-
Application No.: US13663070Application Date: 2012-10-29
-
Publication No.: US09355087B2Publication Date: 2016-05-31
- Inventor: Jean-David Ruvini
- Applicant: eBay Inc.
- Applicant Address: US CA San Jose
- Assignee: eBay Inc.
- Current Assignee: eBay Inc.
- Current Assignee Address: US CA San Jose
- Agency: Schwegman, Lundberg & Woessner, P.A.
- Main IPC: G06F3/00
- IPC: G06F3/00 ; G06F17/27

Abstract:
In some embodiments, a method includes receiving an electronic document that comprises a plurality of sections. The method includes marking the plurality of sections as a content section or a non-content section using a visual attribute of the sections that includes at least one of a width of the section, a density of the plurality of hyperlinks in the section, a size of a font of text in the section and whether a title of the electronic document overlaps with text in the section. The method also includes storing the marking other plurality of sections of the electronic document in a machine-readable medium.
Public/Granted literature
- US20130055075A1 IDENTIFICATION OF CONTENT IN AN ELECTRONIC DOCUMENT Public/Granted day:2013-02-28
Information query