- 专利标题: Multi-level collaborative control system with dual neural network planning for autonomous vehicle control in a noisy environment
-
申请号: US16206506申请日: 2018-11-30
-
公开(公告)号: US11131992B2公开(公告)日: 2021-09-28
- 发明人: Zhiyuan Du , Joseph Lull , Rajesh Malhan , Sriram Subramanian , Sushrut Bhalla , Jaspreet Sambee , Mark Crowley , Sebastian Fischmeister , Donghyun Shin , William Melek , Baris Fidan , Ami Woo , Bismaya Sahoo
- 申请人: DENSO International America, Inc. , Sriram Subramanian , Sushrut Bhalla , Jaspreet Sambee , Mark Crowley , Sebastian Fischmeister , Donghyun Shin , William Melek , Baris Fidan , Ami Woo , Bismay Sahoo
- 申请人地址: US MI Southfield; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo
- 专利权人: DENSO International America, Inc.,Sriram Subramanian,Sushrut Bhalla,Jaspreet Sambee,Mark Crowley,Sebastian Fischmeister,Donghyun Shin,William Melek,Baris Fidan,Ami Woo,Bismay Sahoo
- 当前专利权人: DENSO International America, Inc.,Sriram Subramanian,Sushrut Bhalla,Jaspreet Sambee,Mark Crowley,Sebastian Fischmeister,Donghyun Shin,William Melek,Baris Fidan,Ami Woo,Bismay Sahoo
- 当前专利权人地址: US MI Southfield; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo; CA Waterloo
- 代理机构: Harness, Dickey & Pierce, P.L.C.
- 主分类号: G05D1/00
- IPC分类号: G05D1/00 ; G05D1/02 ; G06N3/04 ; G06N3/08
摘要:
A RLP system for a host vehicle includes a memory and levels. The memory stores a RLP algorithm, which is a multi-agent collaborative DQN with PER algorithm. A first level includes a data processing module that provides sensor data, object location data, and state information of the host vehicle and other vehicles. A second level includes a coordinate location module that, based on the sensor data, the object location data, the state information, and a refined policy provided by the third level, generates an updated policy and a set of future coordinate locations implemented via the first level. A third level includes evaluation and target neural networks and a processor that executes instructions of the RLP algorithm for collaborative action planning between the host and other vehicles based on outputs of the evaluation and target networks and to generate the refined policy based on reward values associated with events.
公开/授权文献
信息查询