REPARTITIONING DATA IN A DISTRIBUTED COMPUTING SYSTEM

    公开(公告)号:US20180196618A1

    公开(公告)日:2018-07-12

    申请号:US15914012

    申请日:2018-03-07

    申请人: Google Inc.

    IPC分类号: G06F3/06 G06F13/28

    摘要: Methods, systems, and apparatus for allocating, by a source of one or more sources, a segment of a data file of a transient memory for exclusive access by the source, the transient memory being a distributed in-memory file system that supports remote direct memory access; writing, by the source, data from an initial partition to one or more blocks within the allocated segment of the data file, wherein a portion of the initial partition is written to a first block of the one or more blocks; publishing, by the source, the segment of the data file of the transient memory to be accessible for reading by one or more sinks; and reading by a particular sink of the one or more sinks, a particular block of the published segment of the data file of the transient memory, wherein the particular block is associated with the particular sink.

    Accessing data on distributed storage systems

    公开(公告)号:US09977760B1

    公开(公告)日:2018-05-22

    申请号:US14138240

    申请日:2013-12-23

    申请人: Google Inc.

    IPC分类号: G06F15/173 G06F17/30

    CPC分类号: G06F15/17331 G06F17/30194

    摘要: A distributed storage system includes memory hosts, a curator, and a cache manager. Each memory host includes remote direct memory accessible non-transitory memory and a network interface controller that is in communication with the memory. The network interface controller services remote direct memory access requests. The curator communicates with the memory hosts, manages striping of a file amongst the memory hosts, and generates a file descriptor for the file that maps the striping of the file to corresponding storage locations on memory hosts. In addition, the cache manager stores copies of file descriptors generated by the curator in the memory of the memory hosts for remote direct memory access by a client in communication with the memory hosts.

    Shared resource management
    7.
    发明授权

    公开(公告)号:US09954958B2

    公开(公告)日:2018-04-24

    申请号:US15010793

    申请日:2016-01-29

    申请人: Red Hat, Inc.

    摘要: Various examples are directed to systems and methods for executing a computing job at a computing node of a distributed computing system. A node controller may receive computing job request data from a system controller. The node controller may send job start data to a shared resource provider. The shared resource provider may mount a shared resource for the computing job indicated by the computing job request data. The node controller may send a heartbeat message to the system controller and receive from the system controller an acknowledgement message referencing the heartbeat message. The system controller may send to the shared resource provider, a job stop command. In response to the job stop command, the shared resource provider may dismount the shared resource.

    System and method for distributed categorization

    公开(公告)号:US09928251B2

    公开(公告)日:2018-03-27

    申请号:US15271900

    申请日:2016-09-21

    申请人: NICE-SYSTEMS LTD.

    IPC分类号: G06F17/30

    摘要: A generic categorization method may include receiving interaction data on a distributed computing system operating on a plurality of computing nodes. The distributed computing system may distribute the received interaction data across the plurality of nodes. On each node, categorization rules may be applied to the interaction data via parallel processing. The results, which may include a category associated with each interaction, may be written to a distributed storage system. A user interface may allow a user to define the categorization rules and schemas of interaction data.