SYSTEM-LEVEL TUNABLE PARAMETER IDENTIFICATION

    公开(公告)号:US20210081760A1

    公开(公告)日:2021-03-18

    申请号:US16572752

    申请日:2019-09-17

    Abstract: Disclosed are a computer-implemented method, a system, and a computer program product for system-level tunable parameter identification. Performance characteristic data for an application to be tuned can be obtained by one or more processing units. At least one system-level tunable parameter for the application to be tuned can be identified by one or more processing units based on the obtained performance characteristic data for the application to be tuned and a pattern between training performance characteristic data and a set of training system-level parameter-related correlation coefficients. The set of training system-level parameter-related correlation coefficients can be respective correlation coefficients of system-level tunable parameters with respect to at least one performance metric.

    Determining influence of applications on system performance

    公开(公告)号:US11620205B2

    公开(公告)日:2023-04-04

    申请号:US17073504

    申请日:2020-10-19

    Abstract: A computer-implemented method for determining influence of applications on system performance includes collecting, by a processor, for several applications that are executing on a computing system, respective instrumentation data during multiple time-segments. The method further includes determining, for each of the applications, a performance value and a robustness value for each of the time-segments based on the respective instrumentation data. Further, using the performance value and robustness value for each time-segment, multiple health-waveforms are generated, where a health-waveform is generated for each respective application. The method further includes determining, by the processor, an influence-factor of a first application on a second application, the first application and the second application are executing on the computing system. The method further includes adjusting, by the processor, allocation of a computer resource by releasing the computer resource from the first application and allocating the computer resource to the second application based on the influence-factor.

    MULTI-AGENT INFERENCE
    36.
    发明申请

    公开(公告)号:US20220383149A1

    公开(公告)日:2022-12-01

    申请号:US17330099

    申请日:2021-05-25

    Abstract: A computer-implemented method includes determining, by a master node, model update information at least based on a workload related to a task and a resource capacity of a computing environment. The model update information indicates respective model update suggestions for a plurality of inference models configured to perform the task. The method further includes distributing, by the master node, the model update information to a plurality of inference agents in the computing environment. The plurality of inference agents has a plurality of instances of the plurality of inference models executed thereon.

    LOCATE NEURAL NETWORK PERFORMANCE HOT SPOTS

    公开(公告)号:US20220350619A1

    公开(公告)日:2022-11-03

    申请号:US17245042

    申请日:2021-04-30

    Abstract: Embodiments for locating performance hot spots include collecting sample data having instruction addresses, the sample data being for a neural network model and determining instructions in the instruction addresses that are performance hot spots. A listing file is used to map the instructions of the sample data that are performance hot spots to locations in a lower-level intermediate representation. A mapping file is used to map the locations of the lower-level intermediate representation that are performance hot spots to operations in one or more higher-level representations, one or more of the operations corresponding to the performance hot spots, the mapping file being generated from compiling the neural network model.

    UNIFORM ARTIFICIAL INTELLIGENCE MODEL CONVERSION

    公开(公告)号:US20220292390A1

    公开(公告)日:2022-09-15

    申请号:US17197361

    申请日:2021-03-10

    Abstract: Aspects of the invention include converting an artificial intelligence (AI) model generated in a first framework to a uniform exchange formatted model by engaging a master table to retrieve instructions for converting from the AI model to the uniform exchange formatted model in accordance with the first framework. The uniform exchange formatted model in compiled by engaging the master table to retrieve instructions for compiling the uniform exchange formatted model in accordance with the first framework. Data is received as an input to the compiled uniform exchange formatted model and an output is generated by engaging the master table to retrieve instructions for generating the output in accordance with the first framework.

Patent Agency Ranking