Watermark-based techniques for change-data-capture

    公开(公告)号:US11860849B2

    公开(公告)日:2024-01-02

    申请号:US17745739

    申请日:2022-05-16

    Applicant: NETFLIX, INC.

    CPC classification number: G06F16/2358 G06F16/2308 G06F16/2365

    Abstract: Various embodiments set forth systems and techniques for concurrent log and dump processing. The techniques include selecting, from a datastore, a chunk comprising one or more rows of data; comparing the one or more rows of data in the chunk with a first set of log events in a change log associated with the datastore, wherein each log event included in the first set of log events occurs after a first log event in the change log and prior to a second log event in the change log; selecting, based on the comparison, one or more non-conflicting rows in the chunk; and transmitting, to an output, one or more log events associated with the one or more non-conflicting rows in the chunk prior to processing a second set of log events in the change log, wherein the second set of log events occur after the second log event.

    CACHE WARMING: AGILITY FOR A STATEFUL SERVICE

    公开(公告)号:US20200174940A1

    公开(公告)日:2020-06-04

    申请号:US16561668

    申请日:2019-09-05

    Applicant: Netflix, Inc.

    Abstract: The disclosed computer-implemented method includes receiving an indication that cache data is to be copied from an originating cluster having a specified number of replica nodes to a destination cluster having an arbitrary number of replica nodes. The method further includes copying the cache data to a cache dump and creating a log that identifies where the cache data is stored in the cache dump. The method further includes copying the cache data from the cache dump to the replica nodes of the destination cluster. The copying includes writing the copied data in a distributed manner, such that at least a portion of the copied data is distributed over each of the replica nodes in the destination cluster. Various other methods, systems, and computer-readable media are also disclosed.

    Techniques for dynamically benchmarking cloud data store systems

    公开(公告)号:US11297138B2

    公开(公告)日:2022-04-05

    申请号:US15394448

    申请日:2016-12-29

    Applicant: NETFLIX, INC.

    Abstract: In various embodiments, a benchmarking engine automatically tests a data store to assess functionality and/or performance of the data store. The benchmarking engine generates data store operations based on dynamically adjustable configuration data. As the benchmarking engine generates the data store operations, the data store operations execute on the data store. In a complementary fashion, as the data store operations execute on the data store, the benchmarking engine generates statistics based on the results of the executed data store operations. Advantageously, because the benchmarking engine adjusts the number and/or type of data store operations that the benchmarking engine generates based on any changes to the configuration data, the workload that executes on the data store may be fine-tuned as the benchmarking engine executes.

    CACHE WARMING: AGILITY FOR A STATEFUL SERVICE

    公开(公告)号:US20210124691A1

    公开(公告)日:2021-04-29

    申请号:US17141217

    申请日:2021-01-04

    Applicant: Netflix, Inc.

    Abstract: The disclosed computer-implemented method includes receiving an indication that cache data is to be copied from an originating cluster having a specified number of replica nodes to a destination cluster having an arbitrary number of replica nodes. The method further includes copying the cache data to a cache dump and creating a log that identifies where the cache data is stored in the cache dump. The method further includes copying the cache data from the cache dump to the replica nodes of the destination cluster. The copying includes writing the copied data in a distributed manner, such that at least a portion of the copied data is distributed over each of the replica nodes in the destination cluster. Various other methods, systems, and computer-readable media are also disclosed.

    Cache warming: agility for a stateful service

    公开(公告)号:US10915455B2

    公开(公告)日:2021-02-09

    申请号:US16561668

    申请日:2019-09-05

    Applicant: Netflix, Inc.

    Abstract: The disclosed computer-implemented method includes receiving an indication that cache data is to be copied from an originating cluster having a specified number of replica nodes to a destination cluster having an arbitrary number of replica nodes. The method further includes copying the cache data to a cache dump and creating a log that identifies where the cache data is stored in the cache dump. The method further includes copying the cache data from the cache dump to the replica nodes of the destination cluster. The copying includes writing the copied data in a distributed manner, such that at least a portion of the copied data is distributed over each of the replica nodes in the destination cluster. Various other methods, systems, and computer-readable media are also disclosed.

    Cache warming: agility for a stateful service

    公开(公告)号:US11347651B2

    公开(公告)日:2022-05-31

    申请号:US17141217

    申请日:2021-01-04

    Applicant: Netflix, Inc.

    Abstract: The disclosed computer-implemented method includes receiving an indication that cache data is to be copied from an originating cluster having a specified number of replica nodes to a destination cluster having an arbitrary number of replica nodes. The method further includes copying the cache data to a cache dump and creating a log that identifies where the cache data is stored in the cache dump. The method further includes copying the cache data from the cache dump to the replica nodes of the destination cluster. The copying includes writing the copied data in a distributed manner, such that at least a portion of the copied data is distributed over each of the replica nodes in the destination cluster. Various other methods, systems, and computer-readable media are also disclosed.

    Watermark-based techniques for change-data-capture

    公开(公告)号:US11334554B2

    公开(公告)日:2022-05-17

    申请号:US17105830

    申请日:2020-11-27

    Applicant: NETFLIX, INC.

    Abstract: Various embodiments set forth systems and techniques for concurrent log and dump processing. The techniques include selecting, from a datastore, a chunk comprising one or more rows of data; comparing the one or more rows of data in the chunk with a first set of log events in a change log associated with the datastore, wherein each log event included in the first set of log events occurs after a first log event in the change log and prior to a second log event in the change log; selecting, based on the comparison, one or more non-conflicting rows in the chunk; and transmitting, to an output, one or more log events associated with the one or more non-conflicting rows in the chunk prior to processing a second set of log events in the change log, wherein the second set of log events occur after the second log event.

    Techniques for warming up a node in a distributed data store

    公开(公告)号:US10749921B2

    公开(公告)日:2020-08-18

    申请号:US15379299

    申请日:2016-12-14

    Applicant: NETFLIX Inc.

    Abstract: In various embodiments, a node manager configures a “new” node as a replacement for an “unavailable” node that was previously included in a distributed data store. First, the node manager identifies a source node that stores client data that was also stored in the unavailable node. Subsequently, the node manager configures the new node to operate as a slave of the source node and streams the client data from the source node to the new node. Finally, the node manager configures the new node to operate as one of multiple masters nodes in the distributed data store. Advantageously, by configuring the node to implement a hybrid of a master-slave replication scheme and a master-master replication scheme, the node manager enables the distributed data store to process client requests without interruption while automatically restoring the previous level of redundancy provided by the distributed data store.

Patent Agency Ranking