Aiming at the low computational efficiency and absence of safety guarantees in existing centralized cooperative control schemes, a swarm-coordination algorithm based on reinforcement learning is first proposed. Single-agent proximal policy optimization is extended to multi-agent interactive environments so that complex cooperation in multi-agent systems can be addressed. Secondly, the cooperative control of vehicles at unsignalized intersections is formulated as a multi-agent reinforcement-learning problem, and a safety-augmented centralized cooperative control method—behavior-restricted proximal policy optimization—is developed. Formal safety verification and behavior restrictions are integrated into the swarm-coordination algorithm, whereby the policy is guided to be iteratively optimized in a safe manner and unsafe driving behaviors are avoided, so that traffic safety in unknown scenarios can be further guaranteed. Finally, simulation experiments are conducted with the Carla platform. It is shown that the incorporation of behavior restrictions causes an 8.06 % loss in traffic efficiency, yet a 100 % safety improvement is achieved. Compared with the representative model predictive control approach, the proposed method reduces the computation time to 1/326 of the original, increases traffic efficiency by 67.0 %, lowers the collision rate from 63.5 % to 0, and improves ride comfort by 26.5 %.
智能网联汽车(Connected and autonomous vehicle,CAV)和高效通信技术(如V2X、5G)的出现,推动了自主交叉口管理(Autonomous intersection management,AIM)的广泛发展[5,6]。AIM旨在有效管理CAVs在无信号交叉口的安全高效通行。当前AIM相关研究可分为基于规则和基于优化的方法。
基于规则的方法通常基于预定义的规则来为CAVs分配优先级或进入交叉口的时间[7-11]。先到先得服务(First come first service,FCFS)[7-9]是一种经典的基于规则的算法,根据车辆到达交叉口的顺序对车辆的通行优先级进行排序。其他方法则基于交通法规或预约系统的固定规则。Lu等[10]提出了一套基于交通安全法规的交叉口车辆引导策略,允许某些车辆抢先或礼让其他车辆,来建立无信号交叉口的通行序列。Zhang等[11]提出了一套基于预约请求的车辆调度规则,以确保高级别的预约请求得到优先响应。在稀疏交通流场景中,相较传统的信号控制,基于规则的方法实现了更高的交叉口吞吐量、通行效率、通行公平性。然而,这种方法也有明显的缺点,随着进入交叉口的车辆增多,通行效率迅速降低。在密集交通流量场景下,如果预约请求失败,车辆必须以更低的速度重新提出请求,导致交通堵塞问题加剧[12]。
DuY, ShangG W, ChaiL G. A coupled vehicle-signal control method at signalized intersections in mixed traffic environment[J]. IEEE Transactions on Vehicular Technology, 2021, 70(3): 2089-2100.
ZhuangWei-chao, DingHao-nan, DongHao-xuan, et al. Learning based eco⁃driving strategy of connected electric vehicle at signalized intersection[J]. Journal of Jilin University(Engineering and Technology Edition), 2023, 53(1): 82-93.
[4]
KhouryJ, KhouryJ, ZoueinG, et al. A practical decentralized access protocol for autonomous vehicles at isolated under-saturated intersections[J]. Journal of Intelligent Transportation Systems, 2019, 23(5): 427-440.
[5]
KarthikeyanP, ChenW, HsiungP. Autonomous intersection management by using reinforcement learning[J]. Algorithms, 2022, 15(9): No.326.
[6]
ChamidethS, TarnebergW, KihlM. A safe and robust autonomous intersection management system using a hierarchical control strategy and V2I communication[J]. IEEE Systems Journal, 2023, 17(1): 50-61.
[7]
AntonioG, Maria-DoloresC. Multi-agent deep reinforcement learning to manage connected autonomous vehicles at tomorrow's intersections[J]. IEEE Transactions on Vehicular Technology, 2022, 71(7): 7033-7043.
[8]
FajardoD, AuT, WallerS T, et al. Automated intersection control[J]. Transportation Research Record: Journal of the Transportation Research Board, 2011, 2259(1): 223-232.
[9]
DresnerK, StoneP. A multiagent approach to autonomous intersection management[J]. Journal of Artificial Intelligent Research, 2008, 31(1): 591-656.
[10]
GregoireJ, BonnabelS, ArnaudD. Optimal cooperative motion planning for vehicles at intersections[J/OL].[2023-11-23].
[11]
LuG, LiL, WangY, et al. A rule based control algorithm of connected vehicles in uncontrolled intersection[C]∥The 17th International IEEE Conference on Intelligent Transportation Systems, Qingdao, China,2014: 115-120.
[12]
ZhangK, ArnaudD, ZhangD, et al. Analysis and modeled design of one state-driven autonomous passing-through algorithm for driverless vehicles at intersections[C]∥The 16th International Conference on Computational Science and Engineering,Sydney, Australia,2013: 751-757.
[13]
ArnaudD. Analysis of reservation algorithms for cooperative planning at intersections[C]∥The 13th International IEEE Conference on Intelligent Transportation Systems,Funchal,Portugal, 2010: 445-449.
[14]
LiN, KolmanovskyI, GirardA, et al. Game theoretic modeling of vehicle interactions at unsignalized intersections and application to autonomous vehicle control[C]∥Annual American Control Conference,Milwaukee, USA,2018: 3215-3220.
[15]
WangH, MengQ, ChenS. Competitive and cooperative behaviour analysis of connected and autonomous vehicles across unsignalised intersections: a game-theoretic approach[J]. Transportation Research Part B: Methodological, 2021, 149: 322-346.
[16]
ElhenawyM, ElberyA A, HassanA A, et al. An intersection game-theory-based traffic control algorithm in a connected vehicle environment[C]∥IEEE 18th International Conference on Intelligent Transportation Systems,Gran Canaria,Spain,2015: 343-347.
[17]
ZhaoW, LiuR, NgoduyD. A bilevel programming model for autonomous intersection control and trajectory planning[J]. Transportmetrica A: Transport Science, 2021, 17(1): 34-58.
[18]
NairS H, GovindarajanV, LinT, et al. Stochastic MPC with multi-modal predictions for traffic intersections[C]∥IEEE 25th International Conference on Intelligent Transportation Systems,Macau,China, 2022: 635-640.
[19]
KamalM A S, ImuraJ, HayakawaT, et al. A vehicle-intersection coordination scheme for smooth flows of traffic without using traffic lights[J]. IEEE Transactions on Intelligent Transportation Systems, 2014, 16(3): 1136-1147.
[20]
FinkM. Implementation of linear model predictive control-tutorial[J/OL].[2023-11-06].
[21]
ZhouM, YuU, QuX. Development of an efficient driving strategy for connected and automated vehicles at signalized intersections: a reinforcement learning approach[J]. IEEE Transactions on Intelligent Transportation Systems, 2020, 20(1): 433-443.
[22]
GuoM, WangP, ChanC Y, et al. A reinforcement learning approach for intelligent traffic signal control at urban intersections[C]∥IEEE Intelligent Transportation Systems Conference, Auckland, New Zealand,2019: 4242-4247.
[23]
GuanY, RenY, LiS, et al. Centralized cooperation for connected and automated vehicles at intersections by proximal policy optimization[J]. IEEE Transactions on Vehicular Technology, 2020, 69(11): 12597-12608.
[24]
WangS, WanQ. Right-turn driving decisions of autonomous vehicles at signal-free intersections [J]. Application Research of Computers, 2023, 40(5): 1468-1472.
[25]
NordfjarnT, Simseloglu, O, RundmoT. Culture related to road traffic safety: a comparison of eight countries using two conceptualizations of culture[J]. Accident Analysis and Prevention, 2014, 62: 319-328.
[26]
ZhengJ, ZhuK, WangR. Deep reinforcement learning for autonomous vehicles collaboration at unsignalized intersections[C]∥IEEE Global Communications Conference, Rio de Janeiro,Brazil, 2022: 1115-1120.
[27]
TehY W, BapstV, CzarneckiW M, et al. Distral: robust multitask reinforcement learning[C]∥Proceedings of the 31st International Conference on Neural Information Processing Systems,Long Beach,USA,2017: 4499-4509.