Processing of document metadata for use as query suggestions
    4.
    发明授权
    Processing of document metadata for use as query suggestions 有权
    处理用作查询建议的文档元数据

    公开(公告)号:US09195706B1

    公开(公告)日:2015-11-24

    申请号:US13782770

    申请日:2013-03-01

    Applicant: Google Inc.

    CPC classification number: G06F17/30395 G06F17/3064

    Abstract: Methods, systems and apparatus are described herein that include obtaining metadata within a document, where the metadata comprises a sequence of terms. Tags are assigned to terms in the sequence of terms based at least in part on grammatical relationships between the terms, thereby forming a corresponding sequence of tags. A determination is made that the sequence of terms is grammatically correct based at least in part on tags within the corresponding sequence of tags. In response to the determination, the sequence of terms is stored as a query suggestion.

    Abstract translation: 本文描述了包括在文档内获得元数据的方法,系统和装置,其中元数据包括术语序列。 至少部分地基于术语之间的语法关系将标签分配给术语序列中的术语,从而形成相应的标签序列。 至少部分地基于相应标签序列内的标签,确定术语序列在语法上是正确的。 响应于确定,术语序列被存储为查询建议。

    Providing images of named resources in response to a search query
    7.
    发明授权
    Providing images of named resources in response to a search query 有权
    提供命名资源的图像以响应搜索查询

    公开(公告)号:US09189554B1

    公开(公告)日:2015-11-17

    申请号:US14702851

    申请日:2015-05-04

    Applicant: Google Inc.

    Abstract: Systems, computer program products, apparatus, and methods are described that perform operations including receiving a search query that includes a name, receiving multiple resources that have been identified by a search engine as best satisfying the search query, wherein the identified multiple resources include a resource including a plurality of images. The operations include identifying an image of the plurality of images displaying a face of the person. The image is identified based on a description associated with the image. The description is based at least in part on one or more resources included in the search results. The operations further include providing the identified image with the search results. The search results are provided as a plurality of links. Each link identifies a corresponding resource of the identified plurality of resources.

    Abstract translation: 描述了执行操作的系统,计算机程序产品,装置和方法,其包括接收包括名称的搜索查询,接收由搜索引擎识别出的多个资源,最好地满足搜索查询,其中所识别的多个资源包括 资源包括多个图像。 操作包括识别显示人的面部的多个图像的图像。 基于与图像相关联的描述来识别图像。 描述至少部分地基于包括在搜索结果中的一个或多个资源。 操作还包括向识别的图像提供搜索结果。 搜索结果被提供为多个链接。 每个链路标识所识别的多个资源的相应资源。

    Detecting document text that is hard to read
    8.
    发明授权
    Detecting document text that is hard to read 有权
    检测难以阅读的文档文本

    公开(公告)号:US08990224B1

    公开(公告)日:2015-03-24

    申请号:US13674320

    申请日:2012-11-12

    Applicant: Google Inc.

    CPC classification number: G06F7/00 G06F17/214 G06F17/30 G06Q10/10 G06Q30/0201

    Abstract: A computer system is configured to determine portions of text extracted from a corresponding group of documents; process a particular portion of text by a set of filters, where the particular portion of text may correspond to a particular document, and where each of the filters may generate a respective score based on processing the particular portion of text; calculate a readability score based on the respective scores generated by the filters; determine that the readability score satisfies a threshold score; and generate or select a new portion of text, for the particular document, based on determining that the readability score satisfies the threshold score.

    Abstract translation: 计算机系统被配置为确定从相应文档组提取的文本的部分; 通过一组过滤器处理文本的特定部分,其中文本的特定部分可以对应于特定文档,并且其中每个过滤器可以基于处理文本的特定部分生成相应的分数; 基于过滤器生成的各个分数计算可读性分数; 确定可读性分数满足阈值分数; 并且基于确定可读性分数满足阈值分数,为特定文档生成或选择文本的新部分。

Patent Agency Ranking