Systems and methods for practical autonomy decision controller

    公开(公告)号:US11107001B1

    公开(公告)日:2021-08-31

    申请号:US16143126

    申请日:2018-09-26

    摘要: A system includes a machine learning engine configured to receive training data including a plurality of input conditions associated with a state space and a plurality of response maneuvers associated with the state space and train a learning system using the training data and a reward function including a plurality of terms associated with a plurality of end state spaces, each term in the plurality of terms defines an end reward value for each end state space. A value function and policy are generated. The value function comprising a plurality of values, wherein each response maneuvers in the plurality of response maneuvers is associated with a value in the plurality of values related to transitioning from the state space to each end state space, the policy indicative of connections between the state spaces, plurality of values, and the respective end reward value for the plurality of end state spaces.