Invention Grant
- Patent Title: Learning facts from semi-structured text
- Patent Title (中): 从半结构化文本学习事实
-
Application No.: US11142853Application Date: 2005-05-31
-
Publication No.: US07769579B2Publication Date: 2010-08-03
- Inventor: Shubin Zhao , Jonathan T. Betz
- Applicant: Shubin Zhao , Jonathan T. Betz
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Morgan, Lewis & Bockius LLP
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F17/28

Abstract:
A method and system of learning, or bootstrapping, facts from semi-structured text is described. Starting with a set of seed facts associated with an object, documents associated with the object are identified. The identified documents are checked to determine if each has at least a first predefined number of seed facts. If a document does have at least a first predefined number of seed facts, a contextual pattern associated with the seed facts is identified and other instances of content in the document matching the contextual pattern are identified. If the document includes at least a second predefined number of the other instances of content matching the contextual pattern, then facts may be extracted from the other instances.
Public/Granted literature
- US20060293879A1 Learning facts from semi-structured text Public/Granted day:2006-12-28
Information query