DEMONSTRATION-DRIVEN REINFORCEMENT LEARNING

    公开(公告)号:US20240412063A1

    公开(公告)日:2024-12-12

    申请号:US18698218

    申请日:2022-10-05

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a reinforcement learning system to select actions to be performed by an agent interacting with an environment to perform a particular task. In one aspect, one of the methods includes obtaining a training sequence comprising a respective training observations at each of a plurality of time steps; obtaining demonstration data comprising one or more demonstration sequences; generating a new training sequence from the training sequence and the demonstration data; and training the goal-conditioned policy neural network on the new training sequence through reinforcement learning.

    Optimizing data center controls using neural networks

    公开(公告)号:US11836599B2

    公开(公告)日:2023-12-05

    申请号:US17331614

    申请日:2021-05-26

    CPC classification number: G06N3/045

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for improving operational efficiency within a data center by modeling data center performance and predicting power usage efficiency. An example method receives a state input characterizing a current state of a data center. For each data center setting slate, the state input and the data center setting slate are processed through an ensemble of machine learning models. Each machine learning model is configured to receive and process the state input and the data center setting slate to generate an efficiency score that characterizes a predicted resource efficiency of the data center if the data center settings defined by the data center setting slate are adopted t. The method selects, based on the efficiency scores for the data center setting slates, new values for the data center settings.

    OPTIMIZING DATA CENTER CONTROLS USING NEURAL NETWORKS

    公开(公告)号:US20200272889A1

    公开(公告)日:2020-08-27

    申请号:US16863357

    申请日:2020-04-30

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for improving operational efficiency within a data center by modeling data center performance and predicting power usage efficiency. An example method receives a state input characterizing a current state of a data center. For each data center setting slate, the state input and the data center setting slate are processed through an ensemble of machine learning models. Each machine learning model is configured to receive and process the state input and the data center setting slate to generate an efficiency score that characterizes a predicted resource efficiency of the data center if the data center settings defined by the data center setting slate are adopted t. The method selects, based on the efficiency scores for the data center setting slates, new values for the data center settings.

Patent Agency Ranking