Abstract:
In an embodiment, an electronic device may include a processor that may iteratively simulate candidate control trajectories using upper confidence bound for trees (UCT) to control an environmental control system (e.g., an HVAC system). Each candidate control trajectory may be simulated by selecting a control action at each of a plurality of time steps over a period of time that has the highest upper bound on possible performance using values from previous simulations and predicting a temperature for a next time step of the plurality of time steps that results from applying the selected control action using a thermal model. The processor may determine a value of each candidate control trajectory using a cost function, update the value of each control action selected in each candidate control trajectory, and select a candidate control trajectory with the highest value using UCT to apply to control the environmental control system.
Abstract:
In an embodiment, an electronic device may include a processor that may iteratively simulate candidate control trajectories using upper confidence bound for trees (UCT) to control an environmental control system (e.g., an HVAC system). Each candidate control trajectory may be simulated by selecting a control action at each of a plurality of time steps over a period of time that has the highest upper bound on possible performance using values from previous simulations and predicting a temperature for a next time step of the plurality of time steps that results from applying the selected control action using a thermal model. The processor may determine a value of each candidate control trajectory using a cost function, update the value of each control action selected in each candidate control trajectory, and select a candidate control trajectory with the highest value using UCT to apply to control the environmental control system.
Abstract:
In an embodiment, an electronic device may include a power source configured to provide operational power to the electronic device and a processor coupled to the power source. The processor may be configured to generate temperature predictions using a model of a structure and possible control scenarios, determine a value of the temperature predictions and the respective possible control scenarios using a cost function, the cost function comprising weighted factors related to an error between a setpoint temperature and the temperature predictions, a length of runtime for an environmental control system (e.g., an HVAC system), and a length of environmental control system cycles. The processor may also be configured to select the control scenario with the highest value to apply to control the environmental control system. The control scenarios may be generated using upper confidence bound for trees (UCT).