-
公开(公告)号:US20250095487A1
公开(公告)日:2025-03-20
申请号:US18404077
申请日:2024-01-04
Inventor: Min Hae KWON , Chan In EOM , Dong Su LEE
IPC: G08G1/0967 , B60W30/14 , B60W60/00 , G08G1/01
Abstract: A traffic control method and apparatus with an autonomous vehicle based on adaptive reward. A traffic control apparatus for an autonomous vehicle based on adaptive reward comprises an information observation unit that collects observation information from a sensing module of an autonomous vehicle or a roadside unit (RSU); a policy execution unit that decides on an action including adjusting acceleration and changing lanes of the autonomous vehicle based on the observation information and policy; and a reward determination unit that determines reward according to observation information at a next timestep according to the decision made, wherein reward in the reward determination unit includes penalty in an event of an accident and reward when driving, wherein the reward when driving includes an adaptive target speed reward term, a successful lane change reward term, and a safety distance compliance reward term that are adaptively determined according to road traffic.
-
2.
公开(公告)号:US20240242596A1
公开(公告)日:2024-07-18
申请号:US18219628
申请日:2023-07-07
Inventor: Min Hae KWON , Chan In EOM , Dong Su LEE
IPC: G08G1/01 , G08G1/0967
CPC classification number: G08G1/0116 , G08G1/096725
Abstract: Provided is a method and an apparatus for determining a vehicle behavior, and more specifically, to a method and an apparatus for determining a vehicle behavior for bottleneck congestion control in a bottleneck section. Tn apparatus for determining a vehicle behavior may include an information collection unit collecting surrounding information of a target driving vehicle from a road side unit (RSU), a vehicle observation unit obtaining observation information based on the target driving vehicle from a sensing module mounted on the target driving vehicle, a reward determination unit determining a reward for the target driving vehicle through a reward function which uses the surrounding information and the observation information, a model training unit updating and training a decision making model through the reward, and a behavior determination unit determining a behavior of the target driving vehicle by inputting the observation information into the decision making model.
-