-
公开(公告)号:US20140359409A1
公开(公告)日:2014-12-04
申请号:US14286770
申请日:2014-05-23
Applicant: Google Inc.
Inventor: Krzysztof W. Czuba , Jonathan Betz , Jeffrey C. Reynar
IPC: G06F17/22
CPC classification number: G06F17/2235 , G06F16/951
Abstract: A repository contains objects representing entities. The objects also include facts about the represented entities. The facts are derived from source documents. A synonymous name of an object is determined by identifying a source document from which one or more facts of the entity represented by the object were derived, identifying a plurality of linking documents that link to the source document through hyperlinks, each hyperlink having an anchor text, processing the anchor texts in the plurality of linking documents to generate a collection of synonym candidates for the entity represented by the object, and selecting a synonymous name for the entity represented by the object from the collection of synonym candidates.
Abstract translation: 存储库包含表示实体的对象。 这些对象还包括有关被表示实体的事实。 事实来自源文件。 通过识别源文档来确定对象的同义名称,从源文档中导出由对象表示的实体的一个或多个事实,通过超链接识别链接到源文档的多个链接文档,每个超链接具有锚文本 处理所述多个链接文档中的所述锚定文本以生成由所述对象表示的所述实体的同义词候选的集合,以及从所述同义词候选的集合中选择由所述对象表示的所述实体的同义名称。