-
公开(公告)号:US20220398283A1
公开(公告)日:2022-12-15
申请号:US17824680
申请日:2022-05-25
Applicant: NVIDIA Corporation
Inventor: Shie Mannor , Assaf Joseph Hallak , Gal Dalal , Steven Tarence Dalton , Iuri Frosio , Gal Chechik
IPC: G06F16/903 , G06F16/901
Abstract: A method for performing a Tree-Search (TS) on an environment is provided. The method comprises generating a tree for a current state of the environment based on a TS policy, determining a corrected TS policy, and determining an action to apply to the environment based on the corrected TS policy. The tree comprises a plurality of nodes including a root node among the plurality of nodes corresponding to the current state of the environment. Each node other than the root node among the plurality of nodes corresponding to an estimated future state of the environment. The plurality of nodes in the tree are connected by a plurality of edges. Each edge among the plurality of edges is associated with an action causing a transition from a first state to a different sate of the environment.