METHOD AND APPARATUS FOR PORTING ENTITY ON REINFORCEMENT LEARNING SYSTEM

    公开(公告)号:US20220164706A1

    公开(公告)日:2022-05-26

    申请号:US17474849

    申请日:2021-09-14

    Abstract: The present invention relates to an apparatus and method for porting any hardware or software entity to a reinforcement learning system. The present invention includes receiving, by a proxy, a message including episode initiation information from an agent interface and delivering the message to an entity interface based on first synchronization; receiving, by the proxy, a message including first observation information from the entity interface and delivering the message to the agent interface based on second synchronization; receiving, by the proxy, a message including action information from the agent interface and delivering the message to the entity interface based on first synchronization; and receiving, by the proxy, a message including second observation information and reward information from the entity interface and delivering the message to the agent interface based on second synchronization.

    APPARATUS AND METHOD FOR ALTRUISTIC SCHEDULING BASED ON REINFORCEMENT LEARNING

    公开(公告)号:US20210168827A1

    公开(公告)日:2021-06-03

    申请号:US17104377

    申请日:2020-11-25

    Inventor: Seung Jae SHIN

    Abstract: The present disclosure relates to an apparatus and method of altruistic scheduling based on reinforcement learning. An altruistic scheduling apparatus according to an embodiment of the present disclosure includes: an external scheduling agent for determining a basic resource share for each process based on information of a resource management system; an internal scheduling agent for determining a basic resource allocation schedule for each process based on information including the basic resource share and a resource leftover based on the basic resource allocation schedule; and a leftover scheduling agent for determining a leftover resource allocation schedule based on information including the resource leftover. According to an embodiment of the present disclosure, it may be expected that reinforcement learning will not only mitigate the diminution of fairness of an altruistic scheduler but also further improve other performance indicators such as completion time and efficiency.

Patent Agency Ranking