Generating content snippets using a tokenspace repository
    21.
    发明授权
    Generating content snippets using a tokenspace repository 有权
    使用令牌空间存储库生成内容片段

    公开(公告)号:US09098501B2

    公开(公告)日:2015-08-04

    申请号:US13685581

    申请日:2012-11-26

    Applicant: Google Inc.

    Abstract: A search engine server system receives from a client system a search query and identifies a set of documents in accordance with the search query. A content snippet corresponding to content in a respective document of the identified set of documents is generated, the content snippet associated with at least one query term of the one or more query terms in the search query. A response to the search query is returned to the client system, the response including information identifying at least the respective document and including the content snippet. Generating the content snippet includes performing a first decompression operation on first token identifiers, from a compressed document repository, to provide a set of second token identifiers, and performing a second decompression operation on the set of second token identifiers to recover uncompressed content comprising a portion of the respective document.

    Abstract translation: 搜索引擎服务器系统从客户端系统接收搜索查询,并根据搜索查询识别一组文档。 产生对应于所识别的一组文档的相应文档中的内容的内容片段,该内容片段与搜索查询中的一个或多个查询词的至少一个查询词相关联。 对搜索查询的响应被返回到客户端系统,响应包括至少标识相应文档并且包括内容片段的信息。 生成内容片段包括对来自压缩文档库的第一令牌标识符执行第一解压缩操作,以提供一组第二令牌标识符,以及对所述第二令牌标识符集合执行第二解压缩操作,以恢复未压缩内容,其包括部分 的相关文件。

    Systems and methods for replicating data
    22.
    发明授权
    Systems and methods for replicating data 有权
    用于复制数据的系统和方法

    公开(公告)号:US09047307B1

    公开(公告)日:2015-06-02

    申请号:US13959055

    申请日:2013-08-05

    Applicant: GOOGLE INC.

    CPC classification number: H04L67/1095 G06F17/30174 G06F17/30215

    Abstract: A system facilitates the distribution and redistribution of chunks of data among multiple servers. The system may identify servers to store a replica of the data based on at least one of utilization of the servers, prior data distribution involving the servers, and failure correlation properties associated with the servers, and place the replicas of the data at the identified servers. The system may also monitor total numbers of replicas of the chunks available in the system, identify chunks that have a total number of replicas below one or more chunk thresholds, assign priorities to the identified chunks, and re-replicate the identified chunks based substantially on the assigned priorities. The system may further monitor utilization of the servers, determine whether to redistribute any of the replicas, select one or more of the replicas to redistribute based on the utilization of the servers, select one or more of the servers to which to move the one or more replicas, and move the one or more replicas to the selected one or more servers.

    Abstract translation: 系统便于在多个服务器之间分发和重新分发数据块。 该系统可以基于服务器的使用,涉及服务器的先前数据分发以及与服务器相关联的故障相关属性中的至少一个来识别服务器来存储数据的副本,并将数据的副本放置在所识别的服务器 。 该系统还可以监视系统中可用的块的副本的总数,识别具有低于一个或多个块阈值的总副本数量的块,为所识别的块分配优先级,并且基于实质上重新复制所识别的块 分配的优先级。 系统可以进一步监视服务器的利用率,确定是否重新分发任何副本,基于服务器的使用选择一个或多个副本以重新分配,选择一个或多个服务器来移动其中一个或多个 更多的副本,并将一个或多个副本移动到所选的一个或多个服务器。

    Collecting Processor Usage Statistics
    23.
    发明申请
    Collecting Processor Usage Statistics 有权
    收集处理器使用统计信息

    公开(公告)号:US20140025810A1

    公开(公告)日:2014-01-23

    申请号:US13910052

    申请日:2013-06-04

    Applicant: Google Inc.

    Abstract: In accordance with some implementations, a method of collecting statistics about processor usage is disclosed. The method is performed on a data storage server having one or more processors and memory storing one or more programs for execution by the one or more processors. The server system executes one or more processes, wherein each of the one or more first processes is associated with an entity from a group of one or more entities. The server system then receives an interrupt signal at a first predetermined interval. In response to receiving the interrupt signal and for each processor of the one or more processors, the server system interrupts the process currently being executed on the processor. The server system increments the counter associated with the interrupted process. The server system then resumes the interrupted process.

    Abstract translation: 根据一些实施方式,公开了一种收集关于处理器使用的统计数据的方法。 该方法在具有一个或多个处理器的数据存储服务器和存储一个或多个程序的存储器中执行,以供一个或多个处理器执行。 服务器系统执行一个或多个进程,其中一个或多个第一进程中的每一个与来自一个或多个实体的组的实体相关联。 然后,服务器系统以第一预定间隔接收中断信号。 响应于接收到中断信号并且对于一个或多个处理器的每个处理器,服务器系统中断处理器当前正在执行的处理。 服务器系统增加与中断进程关联的计数器。 服务器系统然后恢复中断的进程。

    METHOD AND SYSTEM FOR DELETING OBSOLETE FILES FROM A FILE SYSTEM
    24.
    发明申请
    METHOD AND SYSTEM FOR DELETING OBSOLETE FILES FROM A FILE SYSTEM 有权
    从文件系统中删除OBSOLETE文件的方法和系统

    公开(公告)号:US20130339318A1

    公开(公告)日:2013-12-19

    申请号:US13909021

    申请日:2013-06-03

    Applicant: Google Inc.

    Abstract: A method for deleting obsolete files from a file system is provided. The method includes: receiving a request to delete a reference to a target file in a file system from a file reference data structure, wherein the file reference data structure includes target file names and reference file names; identifying a reference file name in the file reference data structure, wherein the reference file name includes a file name of the target file; deleting a reference file from the file system, wherein the reference file has the identified reference file name; checking whether the file system includes at least one reference file whose file name matches the file name of the target file; if there is no such reference file in the file system: deleting the target file from the file system; and deleting the file name of the target file from the file reference data structure.

    Abstract translation: 提供了从文件系统中删除过时文件的方法。 该方法包括:从文件参考数据结构接收删除对文件系统中的目标文件的引用的请求,其中文件参考数据结构包括目标文件名和参考文件名; 在所述文件参考数据结构中识别参考文件名,其中所述参考文件名包括所述目标文件的文件名; 从所述文件系统中删除参考文件,其中所述参考文件具有所识别的参考文件名; 检查文件系统是否包含至少一个文件名与目标文件的文件名匹配的引用文件; 如果文件系统中没有这样的引用文件:从文件系统中删除目标文件; 并从文件引用数据结构中删除目标文件的文件名。

    CONTROLLING DISPLAY OF CONTENT
    25.
    发明申请

    公开(公告)号:US20180253189A1

    公开(公告)日:2018-09-06

    申请号:US13715232

    申请日:2012-12-14

    Applicant: Google Inc.

    CPC classification number: G06F3/0481 G06F3/0484 G06F3/04847

    Abstract: A method includes, in one or more processing devices, executing instructions to perform operations that include outputting data for a first graphical user interface comprising a control mechanism for specifying an amount of content from a first content set to include in a second content set. The method also includes receiving, through the control mechanism, information indicative of the amount of content from the first content set to include in the second content set. The method also includes identifying content from the first content set based on a score that is adjusted in accordance with the information, and based on an author of the content being a member of a predefined group of individuals. The method also includes outputting data for a second graphical user interface comprising the second content set the second content set including the identified content.

    PROCESSING COMPUTATIONAL GRAPHS
    26.
    发明申请

    公开(公告)号:US20170124452A1

    公开(公告)日:2017-05-04

    申请号:US15337744

    申请日:2016-10-28

    Applicant: Google Inc.

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for receiving a request from a client to process a computational graph; obtaining data representing the computational graph, the computational graph comprising a plurality of nodes and directed edges, wherein each node represents a respective operation, wherein each directed edge connects a respective first node to a respective second node that represents an operation that receives, as input, an output of an operation represented by the respective first node; identifying a plurality of available devices for performing the requested operation; partitioning the computational graph into a plurality of subgraphs, each subgraph comprising one or more nodes in the computational graph; and assigning, for each subgraph, the operations represented by the one or more nodes in the subgraph to a respective available device in the plurality of available devices for operation.

    System and Method For Analyzing Data Records
    27.
    发明申请
    System and Method For Analyzing Data Records 有权
    用于分析数据记录的系统和方法

    公开(公告)号:US20160342657A1

    公开(公告)日:2016-11-24

    申请号:US15226795

    申请日:2016-08-02

    Applicant: GOOGLE INC.

    Abstract: A method processes data records. The method partitions the data records into groups and assigns each group to a respective process of a first plurality of processes, which execute in parallel. For each group, the assigned process extracts information from the data records, applies a script with information processing commands applied sequentially to produce intermediate values, stores the intermediate values in a respective intermediate data structure, and updates the status of the group to indicate completion. When the predefined threshold percentage of the data records are completed, the process assigns each group to a respective second process as a backup. When each of the groups has been completed by at least one process (either the original or the backup), the method executes a second plurality of processes to aggregate intermediate values from the intermediate data structures to produce output data. The aggregation includes intermediate values only once for each group.

    Abstract translation: 一种方法处理数据记录。 该方法将数据记录分成组,并将每个组分配给并行执行的第一多个进程的相应进程。 对于每个组,分配的进程从数据记录中提取信息,应用顺序应用的信息处理命令的脚本以产生中间值,将中间值存储在各自的中间数据结构中,并更新组的状态以指示完成。 当数据记录的预定义阈值百分比完成时,进程将每个组分配给相应的第二个进程作为备份。 当每个组已经由至少一个进程(原始或备份)完成时,该方法执行第二多个进程以从中间数据结构聚合中间值以产生输出数据。 聚合包括每个组只有中间值一次。

    Collecting processor usage statistics
    28.
    发明授权
    Collecting processor usage statistics 有权
    收集处理器使用统计信息

    公开(公告)号:US09298576B2

    公开(公告)日:2016-03-29

    申请号:US13910052

    申请日:2013-06-04

    Applicant: Google Inc.

    Abstract: In accordance with some implementations, a method of collecting statistics about processor usage is disclosed. The method is performed on a data storage server having one or more processors and memory storing one or more programs for execution by the one or more processors. The server system executes one or more processes, wherein each of the one or more first processes is associated with an entity from a group of one or more entities. The server system then receives an interrupt signal at a first predetermined interval. In response to receiving the interrupt signal and for each processor of the one or more processors, the server system interrupts the process currently being executed on the processor. The server system increments the counter associated with the interrupted process. The server system then resumes the interrupted process.

    Abstract translation: 根据一些实施方式,公开了一种收集关于处理器使用的统计数据的方法。 该方法在具有一个或多个处理器的数据存储服务器和存储一个或多个程序的存储器中执行,以供一个或多个处理器执行。 服务器系统执行一个或多个进程,其中一个或多个第一进程中的每一个与来自一个或多个实体的组的实体相关联。 然后,服务器系统以第一预定间隔接收中断信号。 响应于接收到中断信号并且对于一个或多个处理器的每个处理器,服务器系统中断处理器当前正在执行的处理。 服务器系统增加与中断进程关联的计数器。 服务器系统然后恢复中断的进程。

    Representative Document Selection for a Set of Duplicate Documents
    29.
    发明申请
    Representative Document Selection for a Set of Duplicate Documents 审中-公开
    一组重复文件的代表性文件选择

    公开(公告)号:US20150026170A1

    公开(公告)日:2015-01-22

    申请号:US14510775

    申请日:2014-10-09

    Applicant: GOOGLE INC.

    Abstract: Systems and methods are provided for obtaining a plurality of documents. A respective document in the plurality of documents is associated with a score and each document in the plurality of documents is from a different data structure in a plurality of data structures. Each data structure in the plurality of data structures represents a different portion of a document address space. A first document in the plurality of documents is selected in accordance with the score associated with the first document. The first document has a fingerprint that indicates that the first document has substantially identical content to every other document in the plurality of documents. In accordance with the score, the first document is indexed thereby producing an indexed first document. With respect to the plurality of documents, the indexed first document is included in a document index as representative of each document in the plurality of documents.

    Abstract translation: 提供了用于获得多个文档的系统和方法。 多个文档中的相应文档与分数相关联,并且多个文档中的每个文档来自多个数据结构中的不同数据结构。 多个数据结构中的每个数据结构表示文档地址空间的不同部分。 根据与第一文档相关联的得分来选择多个文档中的第一文档。 第一文档具有指示,其指示第一文档具有与多个文档中的每个其他文档基本相同的内容。 根据分数,第一个文档被索引,从而产生索引的第一个文档。 关于多个文档,索引的第一文档被包括在作为多个文档中的每个文档的代表的文档索引中。

    DETERMINING CORRESPONDING TERMS WRITTEN IN DIFFERENT FORMATS
    30.
    发明申请
    DETERMINING CORRESPONDING TERMS WRITTEN IN DIFFERENT FORMATS 有权
    确定不同格式中的相关条款

    公开(公告)号:US20140188454A1

    公开(公告)日:2014-07-03

    申请号:US14199249

    申请日:2014-03-06

    Applicant: Google Inc.

    Abstract: Methods and apparatus consistent with the invention allow a user to submit an ambiguous search query and to receive relevant search results. Queries can be expressed using character sets and/or languages that are different from the character set and/or language of at least some of the data that is to be searched. A translation between these character sets and/or languages can be performed by examining the use of terms in aligned text. Probabilities can be associated with each possible translation. Refinements can be made to these probabilities by examining user interactions with the search results.

    Abstract translation: 与本发明一致的方法和装置允许用户提交模糊的搜索查询并接收相关的搜索结果。 可以使用与要搜索的至少一些数据的字符集和/或语言不同的字符集和/或语言来表达查询。 这些字符集和/或语言之间的翻译可以通过检查对齐文本中的术语的使用来执行。 概率可以与每个可能的翻译相关联。 通过检查用户与搜索结果的交互,可以对这些概率进行细化。

Patent Agency Ranking