摘要:
In the field of distributed computing, the communication that needs to be performed between the computing nodes or helper nodes, poses a major performance bottleneck. In order to reduce the time associated with this so-called shuffling phase of inter-node communication, it was recently proposed that the computational redundancy across the nodes be exploited to allow - via a specific type of coded communications - that each transmission becomes useful to more than one node at a time, thus significantly reducing the overall communication time. But the authors in this patent application here, have identified as a main bottleneck of such coded' current approaches, the fact that the input dataset to be processed for a given job, has to be split into an exponentially large number of smaller subsets, which is often impossible, thus significantly constraining the speedup gains attributed to coded communication. The inventors here have invented a new method of assigning data to the computing nodes, which dramatically alleviates the aforementioned bottleneck, thus resulting in dramatic reduction to the overall execution time of the distributed computing job.
摘要:
Disclosed are various embodiments for a parallel priority queue implemented on one or more many-core processors and/or multi-core processors such as those in general-purpose graphics processing units (GPGPUs). According to various embodiments, a priority may be determined according to a timestamp of an item, such as an event or an entry, in a priority queue. A priority queue interface may comprise functions to insert and remove entries from the priority queue. Priority order of the entries may be maintained as the entries are inserted and removed from the queue.
摘要:
Systems and techniques are disclosed that include in one aspect a computer implemented method storing a received stream of data elements in a buffer, applying a boundary condition to the data elements stored in the buffer after receiving each individual data element of the stream of data elements, and producing one or more data elements from the buffer based on the boundary condition as an output stream of data elements sorted according to a predetermined order.
摘要:
There is provided a method of querying a data structure (100), the data structure comprising a plurality of records (T1 - T7) and a plurality of data items (R, G, B, Y, O), each record comprising an index identifying the record and a collection of paths (P1 - P5) pointing to a selection of the data items. Each path of a record corresponds to a respective data variable, the value of the data variable being the data item pointed to by the path. The method comprises receiving (200) a query to sort a sub-set of the records according to a data variable; creating (210) a first array; defining (220) start points in the first array for respective data items of the first data variable according to the order in which the data items are to be sorted; for each record of the sub-set, storing (230) the index of the record after the start point corresponding to the data item of the first data variable of the record; determining (240) an end point for each start point; determining (250) which locations in each sequence of locations within the first array where indexes to the records have been stored based on the start points and the end points; and returning (250) the determined locations to designate where the record indexes corresponding to the respective data items may be found.
摘要:
There is shown and described a definition for defining searching, filtering and sorting requests which seeks to provide a more succinct and efficient language in order to minimize transmission size and parsing/processing requirements while facilitating common and useful search/filter/sort contingencies. The language allows for filtering a data set by comparing a single field of all instances of the data against a static or dynamic value, then, optionally, to order the result set in ascending or descending order. All instances in the data set may be obtained without searching/filtering.
摘要:
A multiple key self-sorting table. The table includes a plurality of intelligent self-sorting modules. Each module of the plurality of modules having an entry with at least one value. Each module making a decision to perform one of a plurality of acts, and preferably four acts, when there is an addition or deletion of an entry to the table; it holds its current value, stores a new entry, takes an entry from its immediately adjacent module with a higher value or takes an entry from its immediately adjacent module with a lower value; and performing the decision. A method for sorting a table. The method includes the steps of making a decision by each module, having an entry with at least one value, of a plurality of modules intelligent self-sorting modules to perform one of a plurality of acts, and preferably four acts, when there is an addition or deletion of an entry to the table; holding its current value, storing a new entry, taking an entry from its immediately adjacent module with a higher value or taking an entry from its immediately adjacent module with a lower value. There is the step of performing the decision.