APPARATUS, SYSTEM, METHOD AND COMPUTER-IMPLEMENTED STORAGE MEDIA TO IMPLEMENT RADIO RESOURCE MANAGEMENT POLICIES USING MACHINE LEARNING

    公开(公告)号:US20220377614A1

    公开(公告)日:2022-11-24

    申请号:US17712050

    申请日:2022-04-01

    申请人: Intel Corporation

    IPC分类号: H04W28/08 H04W28/02

    摘要: An apparatus of a transmitter computing node n (TX node n) of a wireless network, one or more computer readable media, a system, and a method. The apparatus includes one or more processors to: implement machine learning (ML) based training rounds, each training round including: determining a local action value function Qn(hn, an; θn) corresponding to a value of performing a radio resource management (RRM) action an at a receiving computing node n (RX node n) associated with TX node n using policy parameter θn and based on hn, hn including channel state information at RX node n; and determining, based on an overall action value function Qtot at time t, an estimated gradient of an overall loss at time t for overall policy parameter θt(∇Lt(θt)), wherein Qtot corresponds to a mixing of local action value functions Qi(hi, ai; θi) for all TX nodes i in the network at time t including TX node n; and determine, in response to a determination that ∇Lt(θt) is close to zero for various values of t during training, a trained local action value function Qn,trained to generate a trained action value relating to data communication between TX node n and RX node n.