-
公开(公告)号:US11131992B2
公开(公告)日:2021-09-28
申请号:US16206506
申请日:2018-11-30
申请人: DENSO International America, Inc. , Sriram Subramanian , Sushrut Bhalla , Jaspreet Sambee , Mark Crowley , Sebastian Fischmeister , Donghyun Shin , William Melek , Baris Fidan , Ami Woo , Bismay Sahoo
发明人: Zhiyuan Du , Joseph Lull , Rajesh Malhan , Sriram Subramanian , Sushrut Bhalla , Jaspreet Sambee , Mark Crowley , Sebastian Fischmeister , Donghyun Shin , William Melek , Baris Fidan , Ami Woo , Bismaya Sahoo
摘要: A RLP system for a host vehicle includes a memory and levels. The memory stores a RLP algorithm, which is a multi-agent collaborative DQN with PER algorithm. A first level includes a data processing module that provides sensor data, object location data, and state information of the host vehicle and other vehicles. A second level includes a coordinate location module that, based on the sensor data, the object location data, the state information, and a refined policy provided by the third level, generates an updated policy and a set of future coordinate locations implemented via the first level. A third level includes evaluation and target neural networks and a processor that executes instructions of the RLP algorithm for collaborative action planning between the host and other vehicles based on outputs of the evaluation and target networks and to generate the refined policy based on reward values associated with events.