摘要:
A generalized approach to particle interaction can confer advantages over previously described method in terms of one or more of communications bandwidth and latency and memory access characteristics. These generalizations can involve one or more of at least spatial decomposition, import region rounding, and multiple zone communication scheduling. An architecture for computation of particle interactions makes use various forms of parallelism. In one implementation, the parallelism involves using multiple computation nodes arranged according to a geometric partitioning of a simulation volume.
摘要:
A computer-implemented method for determining computational units for computing interactions among sets of bodies located in a computation region includes, for each computation associated with one of the sets of bodies, determining, according to an assignment rule that provides a mapping from a location of each of the bodies to a determined computation unit from the plurality of computation units, a computation unit from a plurality of computation units for performing the computation.
摘要:
A generalized approach to particle interaction can confer advantages over previously described method in terms of one or more of communications bandwidth and latency and memory access characteristics. These generalizations can involve one or more of at least spatial decomposition, import region rounding, and multiple zone communication scheduling. An architecture for computation of particle interactions makes use various forms of parallelism. In one implementation, the parallelism involves using multiple computation nodes arranged according to a geometric partitioning of a simulation volume.
摘要:
A method for performing computations associated with bodies located in a computation region includes, for each subset of multiple subsets of the computations, performing the computations in that subset of computations, including accepting data of bodies located in each of a plurality of import regions associated with the subset of the computations, the import regions being parts of the computation region; for each combination of a predetermined plurality of combinations of multiple of the import regions, performing computations associated with sets of bodies, wherein for each of the sets of bodies, at least one body of the set is located in each import region of the combination.
摘要:
Distributed computation of multiple body interactions in a region uses multiple processing modules, where each of the processing modules is associated with a respective corresponding portion of the region. In some examples, the approach includes establishing multiple coordinate frames of reference, each processing module corresponding to one the coordinate frames of reference. In some examples, efficient techniques are used for selecting elements for computation of interactions according at least in part to a separation-based criterion.
摘要:
Distributed computation of multiple body interactions in a region uses multiple processing modules, where each of the processing modules is associated with a respective corresponding portion of the region. In some examples, the approach includes establishing multiple coordinate frames of reference, each processing module corresponding to one the coordinate frames of reference. In some examples, efficient techniques are used for selecting elements for computation of interactions according at least in part to a separation-based criterion.
摘要:
Distributed computation of multiple body interactions in a region uses multiple processing modules, where each of the processing modules is associated with a respective corresponding portion of the region. In some examples, the approach includes establishing multiple coordinate frames of reference, each processing module corresponding to one the coordinate frames of reference. In some examples, efficient techniques are used for selecting elements for computation of interactions according at least in part to a separation-based criterion.
摘要:
Distributed computation of multiple body interactions in a region uses multiple processing modules, where each of the processing modules is associated with a respective corresponding portion of the region. In some examples, the approach includes establishing multiple coordinate frames of reference, each processing module corresponding to one the coordinate frames of reference. In some examples, efficient techniques are used for selecting elements for computation of interactions according at least in part to a separation-based criterion.
摘要:
A method for dynamics simulation involves maintaining quantities according to a floating point binary format quantized to a first precision lower than the precision supported by the floating point format. For example, although an IEEE floating point number can represent numbers with a precision of one part in 2^24, the quantities are quantized to a lower precision, such as one part in 2^22. Operations are applied to sets of the quantities by quantizing the intermediate results of the operations to the lower precision than the precision supported by the floating point format.
摘要:
An improved constraint approach reduces the energy drift rate to acceptable levels. In an embodiment of this approach, massively parallel constrained velocity Verlet NVE (constant particle number, constant volume, constant energy) MD simulations can be run using single precision arithmetic with very low energy drift (e.g., ˜1 Kelvin per microsecond simulated time) using large timesteps (e.g., 2.5 fs) for typical systems and MD force fields.