- 专利标题: REINFORCEMENT LEARNING TECHNIQUES FOR SELECTING A SOFTWARE POLICY NETWORK AND AUTONOMOUSLY CONTROLLING A CORRESPONDING SOFTWARE CLIENT BASED ON SELECTED POLICY NETWORK
-
申请号: US18128092申请日: 2023-03-29
-
公开(公告)号: US20230237312A1公开(公告)日: 2023-07-27
- 发明人: Victor Carbune , Thomas Deselaers
- 申请人: GOOGLE LLC
- 申请人地址: US CA Mountain View
- 专利权人: GOOGLE LLC
- 当前专利权人: GOOGLE LLC
- 当前专利权人地址: US CA Mountain View
- 主分类号: G06N3/045
- IPC分类号: G06N3/045 ; G06N3/00 ; G06N3/02 ; G10L15/00 ; G10L15/22 ; G10L15/20
摘要:
Techniques are disclosed that enable automating user interface input by generating a sequence of actions to perform a task utilizing a multi-agent reinforcement learning framework. Various implementations process an intent associated with received user interface input using a holistic reinforcement policy network to select a software reinforcement learning policy network. The sequence of actions can be generated by processing the intent, as well as a sequence of software client state data, using the selected software reinforcement learning policy network. The sequence of actions are utilized to control the software client corresponding to the selected software reinforcement learning policy network.
信息查询