-
公开(公告)号:US11891088B1
公开(公告)日:2024-02-06
申请号:US17347088
申请日:2021-06-14
Applicant: Zoox, Inc.
Inventor: Marin Kobilarov , Jefferson Bradfield Packer , Gowtham Garimella , Andreas Pasternak , Yiteng Zhang , Ruikun Yu
CPC classification number: B60W60/0015 , G06N20/00 , G07C5/008 , G07C5/0808 , B60W2050/0075 , B60W2554/404 , B60W2556/45
Abstract: A reward determined as part of a machine learning technique, such as reinforcement learning, may be used to control an adversarial agent in a simulation such that a component for controlling motion of the adversarial agent is trained to reduce the reward. Training the adversarial agent component may be subject to one or more constraints and/or may be balanced against one or more additional goals. Additionally or alternatively, the reward may be used to alter scenario data so that the scenario data reduces the reward, allowing the discovery of difficult scenarios and/or prospective events.