Learning synonymous object names from anchor texts
    1.
    发明授权
    Learning synonymous object names from anchor texts 有权
    从锚文本学习同义对象名称

    公开(公告)号:US08738643B1

    公开(公告)日:2014-05-27

    申请号:US11833180

    申请日:2007-08-02

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/2235 G06F17/30864

    摘要: A repository contains objects representing entities. The objects also include facts about the represented entities. The facts are derived from source documents. A synonymous name of an object is determined by identifying a source document from which one or more facts of the entity represented by the object were derived, identifying a plurality of linking documents that link to the source document through hyperlinks, each hyperlink having an anchor text, processing the anchor texts in the plurality of linking documents to generate a collection of synonym candidates for the entity represented by the object, and selecting a synonymous name for the entity represented by the object from the collection of synonym candidates.

    摘要翻译: 存储库包含表示实体的对象。 这些对象还包括有关被表示实体的事实。 事实来自源文件。 通过识别源文档来确定对象的同义名称,从源文档中导出由对象表示的实体的一个或多个事实,通过超链接识别链接到源文档的多个链接文档,每个超链接具有锚文本 处理所述多个链接文档中的所述锚定文本以生成由所述对象表示的所述实体的同义词候选的集合,以及从所述同义词候选的集合中选择由所述对象表示的所述实体的同义名称。

    Resource geotopicality measures
    2.
    发明授权
    Resource geotopicality measures 有权
    资源地理学措施

    公开(公告)号:US08332396B1

    公开(公告)日:2012-12-11

    申请号:US12896059

    申请日:2010-10-01

    IPC分类号: G06F7/00 G06F17/30

    摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for measuring resource geotopicality. In one aspect, a method includes receiving geotokens for a resource, where each geotoken references a geographic location. An initial geotopicality score for a geographic location is computed using token attribute values of geotokens in the resource. A set of geotopical locations are selected for the resource based on the initial geotopicality score for the geographic location. Each geotopical location can be a geographic location for which the initial geotopicality score exceeds a geotopicality threshold. A final geotopicality score representing a measure of relevance for the resource relative to the geotopical location is computed for the geotopical location. Data specifying geotopical locations for the resource and geotopicality scores for the geotopical locations are provided.

    摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于测量资源的地理位置。 在一个方面,一种方法包括接收资源的地理符号,其中每个地理标记引用地理位置。 使用资源中的地理标记的令牌属性值来计算地理位置的初始地理学评分。 基于地理位置的初始地理学评分,为资源选择一组地理位置。 每个地理位置可以是初始地理学评分超过地理学阈值的地理位置。 为地理位置计算代表资源相对于地理位置的相关度的最终地理学评分。 提供了指定地理位置的资源和地理位置得分的地理位置的数据。

    Corroborating facts in electronic documents
    3.
    发明授权
    Corroborating facts in electronic documents 有权
    在电子文件中证实事实

    公开(公告)号:US08954412B1

    公开(公告)日:2015-02-10

    申请号:US11536504

    申请日:2006-09-28

    IPC分类号: G06F17/30

    摘要: A query is defined that has an answer formed of terms from electronic documents. A repository having facts is examined to identify attributes corresponding to terms in the query. The electronic documents are examined to find other terms that commonly appear near the query terms. Hypothetical facts representing possible answers to the query are created based on the information identified in the fact repository and the commonly-appearing terms. These hypothetical facts are corroborated using the electronic documents to determine how many documents support each fact. Additionally, contextual clues in the documents are examined to determine whether the hypothetical facts can be expanded to include additional terms. A hypothetical fact that is supported by at least a certain number of documents, and is not contained within another fact with at least the same level of support, is presented as likely correct.

    摘要翻译: 定义了一个查询,其具有由电子文档中的术语构成的答案。 检查具有事实的存储库以识别与查询中的术语相对应的属性。 检查电子文档以查找通常出现在查询条件附近的其他术语。 基于事实存储库中标识的信息和常见的术语,创建表示查询可能答案的假设事实。 使用电子文件确认这些假设事实,以确定有多少文件支持每个事实。 另外,检查文件中的语境线索以确定假设事实是否可以扩展以包括附加条款。 至少有一定数量的文件支持的假设事实并不包含在具有至少相同级别的支持的另一事实中,可能是正确的。