-
1.
公开(公告)号:US09268806B1
公开(公告)日:2016-02-23
申请号:US13952517
申请日:2013-07-26
Applicant: GOOGLE INC.
Inventor: Alex Kesselman
CPC classification number: G06F17/3033 , G06F17/30159 , G06F17/30303 , G06F17/30312 , G06F17/30339 , G06F17/30377
Abstract: According to some implementations, a method of managing database storage is performed for a database storage system having content storage, a content index, and a reference table. The content index identifies content chunks and the reference table stores references to each content chunk. An object is received for storage, which comprises one or more content chunks. The object is identified by a unique object ID. For each content chunk, the method constructs a content hash of the content chunk and looks up the content hash in the content index. When the content hash is not found in the content index, the method stores the content chunk in the content storage and inserts a record into the content index that identifies the location of the content chunk in the content storage. In addition, the method inserts a record into the reference table, which includes the content hash and the object ID.
Abstract translation: 根据一些实现,对具有内容存储,内容索引和参考表的数据库存储系统执行管理数据库存储的方法。 内容索引标识内容块,引用表存储对每个内容块的引用。 接收用于存储的对象,其包括一个或多个内容块。 该对象由唯一的对象ID标识。 对于每个内容块,该方法构造内容块的内容散列,并在内容索引中查找内容哈希。 当在内容索引中没有找到内容散列时,该方法将内容块存储在内容存储器中,并将记录插入到标识内容存储器中的内容块的位置的内容索引中。 此外,该方法将记录插入参考表,其中包括内容散列和对象ID。
-
公开(公告)号:US09819728B2
公开(公告)日:2017-11-14
申请号:US13874381
申请日:2013-04-30
Applicant: Google Inc.
Inventor: Alex Kesselman , Alexandre Drobychev
CPC classification number: H04L67/1097 , G06F3/06 , H04L67/1002
Abstract: A system, computer-readable storage medium storing at least one program, and a computer-implemented method for facilitating deduplication of operations to be performed is presented. An operation to be performed is received. A mapping function is applied to at least one parameter of the operation to produce a mapping value in a target mapping space, the target mapping space being partitioned between target servers in a set of target servers proportional to resource capacities of the target servers in the set of target servers. A target server in the set of target servers whose portion of the target mapping space includes the mapping value is identified. The operation is issued to the target server.
-
公开(公告)号:US20170272503A1
公开(公告)日:2017-09-21
申请号:US13874381
申请日:2013-04-30
Applicant: Google Inc.
Inventor: Alex Kesselman , Alexandre Drobychev
IPC: H04L29/08
CPC classification number: H04L67/1097 , G06F3/06 , H04L67/1002
Abstract: A system, computer-readable storage medium storing at least one program, and a computer-implemented method for facilitating deduplication of operations to be performed is presented. An operation to be performed is received. A mapping function is applied to at least one parameter of the operation to produce a mapping value in a target mapping space, the target mapping space being partitioned between target servers in a set of target servers proportional to resource capacities of the target servers in the set of target servers. A target server in the set of target servers whose portion of the target mapping space includes the mapping value is identified. The operation is issued to the target server.
-
4.
公开(公告)号:US20160170987A1
公开(公告)日:2016-06-16
申请号:US15051612
申请日:2016-02-23
Applicant: GOOGLE INC.
Inventor: Alex Kesselman
IPC: G06F17/30
CPC classification number: G06F17/3033 , G06F17/30159 , G06F17/30303 , G06F17/30312 , G06F17/30339 , G06F17/30377
Abstract: A process manages database storage. The process receives a first object comprising one or more content chunks. The first object is identified by a unique object ID and each content chunk has a unique offset within the first object. For each chunk, the process inserts a record into a reference table. The record includes a content hash and the object ID. The process stores each of the chunks in content storage. Later, the process obtains a request to delete a first chunk from storage. The first chunk has a corresponding first content hash. The process determines whether the reference table includes a reference record corresponding to the first content hash. When the reference table does not include any reference records corresponding to the first content hash, the process deletes the first chunk. When the reference table includes a corresponding reference record, the process does not delete the first chunk.
Abstract translation: 进程管理数据库存储。 该过程接收包括一个或多个内容块的第一对象。 第一个对象由唯一对象ID标识,每个内容块在第一个对象内具有唯一的偏移量。 对于每个块,进程将一个记录插入到一个引用表中。 该记录包括内容散列和对象ID。 该过程将每个块存储在内容存储中。 之后,该进程获得从存储中删除第一个块的请求。 第一个块具有相应的第一个内容散列。 该过程确定参考表是否包括与第一内容散列相对应的参考记录。 当参考表不包括与第一内容散列相对应的任何参考记录时,该过程将删除第一个块。 当参考表包含相应的参考记录时,该进程不会删除第一个块。
-
公开(公告)号:US09747320B2
公开(公告)日:2017-08-29
申请号:US15051612
申请日:2016-02-23
Applicant: GOOGLE INC.
Inventor: Alex Kesselman
IPC: G06F17/30
CPC classification number: G06F17/3033 , G06F17/30159 , G06F17/30303 , G06F17/30312 , G06F17/30339 , G06F17/30377
Abstract: A process manages database storage. The process receives a first object comprising one or more content chunks. The first object is identified by a unique object ID and each content chunk has a unique offset within the first object. For each chunk, the process inserts a record into a reference table. The record includes a content hash and the object ID. The process stores each of the chunks in content storage. Later, the process obtains a request to delete a first chunk from storage. The first chunk has a corresponding first content hash. The process determines whether the reference table includes a reference record corresponding to the first content hash. When the reference table does not include any reference records corresponding to the first content hash, the process deletes the first chunk. When the reference table includes a corresponding reference record, the process does not delete the first chunk.
-
-
-
-