Invention Application
- Patent Title: FINDING AND DISAMBIGUATING REFERENCES TO ENTITIES ON WEB PAGES
-
Application No.: US14457869Application Date: 2014-08-12
-
Publication No.: US20140379743A1Publication Date: 2014-12-25
- Inventor: Leonardo A. Laroco, Jr. , Nikola Jevtic , Nikolai V. Yakovenko , Jeffrey Reynar
- Applicant: Google Inc.
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
A system and method for disambiguating references to entities in a document. In one embodiment, an iterative process is used to disambiguate references to entities in documents. An initial model is used to identify documents referring to an entity based on features contained in those documents. The occurrence of various features in these documents is measured. From the number occurrences of features in these documents, a second model is constructed. The second model is used to identify documents referring to the entity based on features contained in the documents. The process can be repeated, iteratively identifying documents referring to the entity and improving subsequent models based on those identifications. Additional features of the entity can be extracted from documents identified as referring to the entity.
Information query