Selecting tags for a document by analyzing paragraphs of the document
    3.
    发明授权
    Selecting tags for a document by analyzing paragraphs of the document 有权
    通过分析文档的段落来选择文档的标签

    公开(公告)号:US08280892B2

    公开(公告)日:2012-10-02

    申请号:US12242984

    申请日:2008-10-01

    IPC分类号: G06F17/30

    摘要: In one embodiment, assigning tags to a document includes accessing the document, where the document comprises text units that include words. The following is performed for each text unit: a subset of words of a text unit is selected as candidate tags, relatedness is established among the candidate tags, and certain candidate tags are selected according to the established relatedness to yield a candidate tag set for the text unit. Relatedness between the candidate tags of each candidate tag set and the candidate tags of other candidate tag sets is determined. At least one candidate tag is assigned to the document according to the determined relatedness.

    摘要翻译: 在一个实施例中,将标签分配给文档包括访问文档,其中文档包括包括单词的文本单元。 对于每个文本单元执行以下操作:选择文本单元的单词的子集作为候选标签,在候选标签之间建立相关性,并且根据建立的相关性来选择某些候选标签,以产生用于 文字单位 确定每个候选标签集的候选标签与其他候选标签集合的候选标签之间的相关性。 根据确定的相关性,至少一个候选标签被分配给文档。

    Selecting Tags For A Document By Analyzing Paragraphs Of The Document
    5.
    发明申请
    Selecting Tags For A Document By Analyzing Paragraphs Of The Document 有权
    通过分析文档的段落来选择文档的标签

    公开(公告)号:US20090094231A1

    公开(公告)日:2009-04-09

    申请号:US12242984

    申请日:2008-10-01

    IPC分类号: G06F17/30 G06F17/27

    摘要: In one embodiment, assigning tags to a document includes accessing the document, where the document comprises text units that include words. The following is performed for each text unit: a subset of words of a text unit is selected as candidate tags, relatedness is established among the candidate tags, and certain candidate tags are selected according to the established relatedness to yield a candidate tag set for the text unit. Relatedness between the candidate tags of each candidate tag set and the candidate tags of other candidate tag sets is determined. At least one candidate tag is assigned to the document according to the determined relatedness.

    摘要翻译: 在一个实施例中,将标签分配给文档包括访问文档,其中文档包括包括单词的文本单元。 对于每个文本单元执行以下操作:选择文本单元的单词的子集作为候选标签,在候选标签之间建立相关性,并且根据建立的相关性来选择某些候选标签,以产生用于 文字单位 确定每个候选标签集的候选标签与其他候选标签集合的候选标签之间的相关性。 根据确定的相关性,至少一个候选标签被分配给文档。

    Identifying clusters of words according to word affinities
    6.
    发明授权
    Identifying clusters of words according to word affinities 有权
    根据词亲密度识别词组

    公开(公告)号:US08108392B2

    公开(公告)日:2012-01-31

    申请号:US12242957

    申请日:2008-10-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30734

    摘要: In one embodiment, identifying clusters of words includes accessing a record that records affinities. An affinity between a first and second word describes a quantitative relationship between the first and second word. Clusters of words are identified according to the affinities. A cluster comprises words that are sufficiently affine with each other. A first word is sufficiently affine with a second word if the affinity between the first and second word satisfies one or more affinity criteria. A clustering analysis is performed using the clusters.

    摘要翻译: 在一个实施例中,识别字词群包括访问记录亲和力的记录。 第一个和第二个字之间的亲和度描述了第一个和第二个单词之间的定量关系。 根据亲和力识别词群。 一个群集包含彼此充分相识的单词。 如果第一个和第二个字符之间的亲和度满足一个或多个亲和度标准,则第一个字词与第二个字词充分相符。 使用群集执行聚类分析。

    Identifying Clusters Of Words According To Word Affinities
    7.
    发明申请
    Identifying Clusters Of Words According To Word Affinities 有权
    根据词的亲和力识别词群

    公开(公告)号:US20090094207A1

    公开(公告)日:2009-04-09

    申请号:US12242957

    申请日:2008-10-01

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30734

    摘要: In one embodiment, identifying clusters of words includes accessing a record that records affinities. An affinity between a first and second word describes a quantitative relationship between the first and second word. Clusters of words are identified according to the affinities. A cluster comprises words that are sufficiently affine with each other. A first word is sufficiently affine with a second word if the affinity between the first and second word satisfies one or more affinity criteria. A clustering analysis is performed using the clusters.

    摘要翻译: 在一个实施例中,识别字词群包括访问记录亲和力的记录。 第一个和第二个字之间的亲和度描述了第一个和第二个单词之间的定量关系。 根据亲和力识别词群。 集群包含彼此充分相互联系的单词。 如果第一个和第二个字符之间的亲和度满足一个或多个亲和度标准,则第一个字词与第二个字词充分相符。 使用群集执行聚类分析。

    GEOTAGGING BASED ON SPECIFIED CRITERIA
    8.
    发明申请
    GEOTAGGING BASED ON SPECIFIED CRITERIA 审中-公开
    基于指定标准的地理

    公开(公告)号:US20140067801A1

    公开(公告)日:2014-03-06

    申请号:US13601706

    申请日:2012-08-31

    IPC分类号: G06F17/30

    CPC分类号: G06F16/29

    摘要: A method of geotagging based on specified criteria is described. The method may include analyzing a data stream indicating a variable parameter associated with an object to determine data within the data stream satisfying a specified criteria. The method may also include obtaining geospatial information for the object or another object corresponding to a time the data was generated. Relevant data collected at the time the data satisfies the specified criteria may be tagged with the geospatial information. Related systems are also described.

    摘要翻译: 描述了基于指定标准的地理标记的方法。 该方法可以包括分析指示与对象相关联的可变参数的数据流,以确定满足特定标准的数据流内的数据。 该方法还可以包括获得对应于数据生成时间的对象或另一个对象的地理空间信息。 在数据满足指定标准时收集的相关数据可能会被地理空间信息标记。 还描述了相关系统。