摘要:
Embodiments for providing critical path analysis of active trace files in a cloud computing environment. A critical path may be identified using a trace of time spans and activities of a plurality of applications, wherein the critical path is a set of activities having time spans free of overlap with other activities.
摘要:
A temperature-aware task scheduling method, system, and computer program product, include determining a change in an operation intensity factor of the GPU from a previous state and modifying the operation intensity factor, in response to the determining the change in the operation intensity factor from the previous state.
摘要:
A temperature-aware task scheduling method, system, and computer program product, includes obtaining a temperature of the GPU and accepting and executing the task to the GPU, in response to the obtaining a temperature of the GPU.
摘要:
A profiling tool identifies a code region with a false sharing potential. A static analysis tool classifies variables and arrays in the identified code region. A mapping detection library correlates memory access instructions in the identified code region with variables and arrays in the identified code region while a processor is running the identified code region. The mapping detection library identifies one or more instructions at risk, in the identified code region, which are subject to an analysis by a false sharing detection library. A false sharing detection library performs a run-time analysis of the one or more instructions at risk while the processor is re-running the identified code region. The false sharing detection library determines, based on the performed run-time analysis, whether two different portions of the cache memory line are accessed by the generated binary code.
摘要:
Transfer learning in machine learning can include receiving a machine learning model. Target domain training data for reprogramming the machine learning model using transfer learning can be received. The target domain training data can be transformed by performing a transformation function on the target domain training data. Output labels of the machine learning model can be mapped to target labels associated with the target domain training data. The transformation function can be trained by optimizing a parameter of the transformation function. The machine learning model can be reprogrammed based on input data transformed by the transformation function and a mapping of the output labels to target labels.
摘要:
A temperature-aware task scheduling method, system, and computer program product for facilitating a task in a multi-graphical processing unit (GPU) environment, includes executing the task to a GPU in the multi-GPU environment based on a thermal characteristic of the GPU as compared to the other GPUs in the multi-GPU environment.
摘要:
A data-centric reduction method, system, and computer program product include configuring a similarity threshold and a correlation threshold for an entire data set from at least two back-end nodes, reducing the entire data set to a reduced data set from the at least two back-end nodes sent to a front-end node by removing data based on the similarity threshold and the correlation threshold, and after the front-end receives the reduced data set, reconstructing the entire data set from the reduced data set using the similarity threshold and correlation threshold.
摘要:
Embodiments for providing direct access to non-volatile memory by a processor. One or more accelerators may be provided, via an application programming interface (“API”), direct access to non-volatile storage independent of a host central processing unit (“CPU”) on a control path or data path to perform a read operation and write operation of data.
摘要:
A temperature-aware task scheduling method, system, and computer program product, includes obtaining a temperature of the GPU and accepting and executing the task to the GPU, in response to the obtaining a temperature of the GPU.
摘要:
A data-centric reduction method, system, and computer program product include configuring a similarity threshold and a correlation threshold for an entire data set from at least two back-end nodes, reducing the entire data set to a reduced data set from the at least two back-end nodes sent to a front-end node by removing data based on the similarity threshold and the correlation threshold, and after the front-end receives the reduced data set, reconstructing the entire data set from the reduced data set using the similarity threshold and correlation threshold.