Survey
Survey
Survey
Abstract
Efficient traffic signal control is very crucial for managing vehicular flow and
alleviating congestion in urban areas. This survey paper presents a thorough
examination of diverse techniques employed in traffic signal control. It begins
by exploring traditional methods like fixed-time control and actuated control,
progressing to advanced intelligent systems incorporating reinforcement learning,
support vector machine and neural network algorithms. The survey encompasses
a broad spectrum of approaches, spanning classical optimization techniques to
the most advanced machine learning and deep learning methods. Each method’s
principles, advantages, and limitations are discussed, alongside their practical
applications in real-world traffic management scenarios. Furthermore, the paper
investigates recent advancements and emerging trends in traffic signal control,
such as connected and autonomous vehicles, cooperative control systems, and
adaptive signal optimization strategies. By synthesizing insights from existing lit-
erature and highlighting current research directions, this survey paper will help in
transportation engineering and urban planning. It offers comprehensive insights
into the complexities and opportunities in traffic signal control, facilitating the
development of innovative and sustainable solutions to tackle urban mobility
challenges.
1 Introduction
Urbanization has led to a proliferation of vehicular traffic on roads, encompassing
diverse categories such as two-wheelers, three-wheelers, and four-wheelers. This surge
1
in traffic has given rise to a pervasive issue: congestion[1][2]. It is a ubiquitous problem
in urban areas, impeding the smooth flow of daily activities for countless individ-
uals. The congestion caused by the multitude of vehicles on the road often brings
daily life to a standstill. Commuters find themselves trapped in long queues of vehi-
cles, their progress slowed to a crawl. This impediment not only wastes time but
also adds stress and frustration to people’s lives. Many techniques have been used to
detect traffic[3][4][5][6] and manage it but a lot of shortcomings have been noticed.[7]
Furthermore, the gridlock negatively impacts the environment, as engines idle and
emissions increase. This leads to air pollution[8], further exacerbating the health
concerns[9] already present in densely populated urban areas. Efforts to mitigate traf-
fic congestion[10] are ongoing, with various strategies being explored. These include
improvements in public transportation, the development of alternative transportation
modes such as cycling lanes[11][12],pedestrian lights[13] and the implementation of
smart traffic management systems. An effective traffic signal control system seeks to
minimize vehicle congestion at intersections, thereby reducing fuel consumption and
optimizing time utilization. By efficiently managing the flow of traffic, these systems
aim to mitigate delays and waiting times, enhancing overall road efficiency and safety.
Through the strategic coordination of traffic signals, the system endeavors to main-
tain a steady and balanced movement of vehicles, preventing gridlock and bottlenecks.
This not only alleviates frustration among drivers but also contributes to a reduc-
tion in carbon emissions and environmental pollution associated with idling vehicles.
Moreover, a well-designed traffic signal control system can enhance the overall func-
tionality of urban transportation networks. By synchronizing signals and adapting to
real-time traffic conditions, it enables smoother and more predictable travel routes for
commuters, businesses, and emergency services alike.
To achieve these objectives, modern traffic control systems often incorporate
advanced technologies such as real-time data analytics, adaptive signal control algo-
rithms, and communication networks. These innovations enable dynamic adjustments
to signal timings based on traffic volume, flow patterns, and priority demands, opti-
mizing the utilization of road infrastructure and enhancing the quality of urban life.
In essence, the implementation of an efficient traffic signal control system plays a vital
role in promoting sustainable and resilient urban mobility, fostering economic pro-
ductivity, and improving the overall well-being of communities. A prevalent approach
in modern traffic management is the implementation of static traffic light control
systems. These systems rely on historical traffic congestion data to regulate signal
timings. However, this method is inherently limited as it does not allow for real-time
adjustments. Consequently, the fixed signal timings may not effectively address the
dynamic nature of traffic congestion prevalent in contemporary urban environments.
As a result, there is a growing recognition of the need for more adaptive and respon-
sive traffic control solutions to better manage congestion and optimize traffic flow in
today’s bustling cities.
Numerous promising traffic signal control and traffic management techniques have
been proposed, showing potential for significant improvements upon implementation.
In the forthcoming sections of the paper, we will delve into some of the latest tech-
nologies that hold promise in revolutionizing traffic management. These innovations
2
Table 1 Inclusion Exclusion Criterion
offer the possibility of addressing the complexities of modern traffic challenges more
effectively, paving the way for more efficient and sustainable urban mobility solutions.
3
Fig. 1 Inclusion Exclusion Criteria
4
Fig. 2 Techniques used in Traffic Signal Controls
the day or in response to incidents such as accidents or road construction. One of the
main drawbacks of fixed-timer systems is their inability to optimize traffic flow effi-
ciently, especially during peak hours or in areas with fluctuating traffic patterns. This
can lead to congestion, delays, and increased fuel consumption for vehicles waiting
at red lights. Despite their limitations, fixed-timer traffic signal control systems are
still widely used in many regions, particularly in areas with relatively stable traffic
conditions or limited resources for more advanced signal management systems. Due
to these shortcomings the new techniques for more adaptive traffic signal controlling
were introduced.
5
[17][18]Key to this learning process are value functions, which estimate the poten-
tial long-term rewards of states or state-action pairs. This estimation helps the agent
make informed decisions by evaluating the future benefits of its current actions. RL
involves a crucial balance between exploration and exploitation: exploration involves
trying new actions to discover their effects, while exploitation leverages known actions
that have previously yielded high rewards.
Common RL algorithms include Q-learning, which focuses on learning the value of
action-reward pairs, and policy gradient methods, which directly optimize the policy.
More advanced techniques, like Deep Reinforcement Learning, combine neural net-
works with RL to handle more complex and high-dimensional environments. Overall,
RL is widely used in various applications, from game playing to autonomous driving,
where decision-making is critical.
Reinforcement Learning algorithms often rely on exploration-exploitation strate-
gies to balance between exploiting known information and exploring new actions.
Techniques like epsilon-greedy exploration and Boltzmann exploration are commonly
employed to address this trade-off.
6
Table 2 Reinforcement Learning Based work summary
privacy of individual travelers. Their approach aims to shield sensitive traveler infor-
mation, such as habitual destinations and travel patterns, from potential exposure. To
accomplish this, they devised a comprehensive defense mechanism to counter various
types of attacks, including collusion among CVs, database breaches, and interference
attempts. Furthermore, they developed a refined traffic signal optimization method-
ology, complemented by an arrival rate estimator leveraging aggregated CV data.
This multifaceted approach exhibits adaptability to diverse traffic scenarios, ensur-
ing effective traffic management across different conditions. By amalgamating privacy
preservation measures with signal control optimization, Tan and Yang’s framework not
7
only enhances traffic flow efficiency but also underscores the significance of safeguard-
ing CV data confidentiality, thereby fostering a safer and more secure transportation
ecosystem.
[22] Zhao et al. present an innovative model named Traffic Signal Control based on Het-
erogeneous Graph Attention Mechanism combined with Deep Reinforcement Learning
(TSC-HGAM-DRL). This model fuses a DRL with GNN to improve traffic manage-
ment and decision-making. It features a dual attention-based GNN combined with
deep reinforcement learning methods, including DDQN and DuelingDQN, to address
and reduce overfitting problems during training. Simulation experiments were con-
ducted using the SUMO platform in conjunction with a deep learning development
framework. The algorithm’s performance was rigorously evaluated using these metrics:
Average Reward and Queue Length. During peak traffic flow scenarios, the algorithm
demonstrated significant improvements over existing methods. Specifically, it achieved
an average latency reduction of 8.76, representing a 50% enhancement. The average
queue length decreased by 31.01 meters, marking a notable 48% improvement. More-
over, the average waiting time decreased to 8.71 seconds, showcasing a remarkable
120% enhancement. Additionally, the algorithm yielded a cumulative reward of 59.34,
reflecting a substantial 35% improvement compared to alternative algorithms.
These findings underscore the effectiveness of the TSC-HGAM-DRL model in opti-
mizing traffic signal control and minimizing congestion during peak traffic periods. By
integrating advanced techniques from deep reinforcement learning and graph neural
networks, the model offers a robust framework for addressing the complex dynamics
of urban traffic management. The significant improvements observed across multiple
evaluation metrics highlight the potential of this approach to enhance overall traffic
efficiency and reduce travel delays, contributing to more sustainable and resilient urban
transportation systems. When compared to four other traffic control models, this
model shows performance advantages of over 13% across multiple performance indica-
tors. However, further evaluation and validation are necessary, particularly for traffic
scenarios with varying complexities and characteristics. Additionally, the study did
not account for various potential influencing factors such as environmental conditions,
system failures, and uncertainties.
[21] Zhu et al. introduced an Intelligent Traffic Light Control (ITLC) system
aimed at mitigating traffic congestion. The system leveraged Multi-Agent Deep Rein-
forcement Learning [31](DRL) algorithms, which utilize deep neural networks for
decision-making. To enhance versatility, the study incorporated three distinct datasets.
Experimentation was conducted using the Cityflow simulator, with the primary goal
of alleviating traffic congestion by reducing both average travel time and total training
duration.
Throughout the study, comparisons were drawn among various traffic control
systems and reinforcement learning approaches. These included the fixed traffic
light control system, self-organizing traffic control system, Single-Agent Deep Rein-
forcement Learning, and Single-Agent Broad Reinforcement Learning methods. The
findings revealed notable advantages of the Multi-Agent Deep Reinforcement Learning
algorithm over the others.
8
Specifically, the training time for the ITLC system was found to be 35.03%, 27.51%,
and 18.22% faster compared to the fixed, self-organizing, and single-agent approaches,
respectively. Moreover, the stability of the Multi-Agent DRL algorithm significantly
surpassed that of its counterparts.
These results underscore the efficacy of the ITLC system in enhancing traffic man-
agement efficiency. By harnessing the capabilities of Multi-Agent DRL, the system
demonstrates promising potential for optimizing traffic flow and reducing congestion
in urban environments. Additionally, the utilization of diverse datasets enhances the
adaptability and robustness of the model, further contributing to its effectiveness in
addressing real-world traffic challenges.
[20] Guo and et al. proposed the CoTV (Cooperative Control for Traffic Light
Signals and Connected Autonomous Vehicles) system to effectively manage both traf-
fic light signals and various Connected Autonomous Vehicles (CAVs). This system
employs Deep Reinforcement Learning (DRL) to optimize the balance between reduc-
ing travel time, fuel consumption, and emissions. As a multi-agent DRL system, CoTV
demonstrates notable improvements over the Baseline approach. In comparative anal-
ysis, CoTV achieved remarkable savings of up to 28% in fuel consumption and reduced
travel time by up to 30%. These findings suggest that CoTV represents a significant
advancement in traffic control strategies for both traffic signals and CAVs. Its abil-
ity to efficiently coordinate between these elements highlights its potential to enhance
overall traffic management efficiency while promoting sustainability and reducing envi-
ronmental impact. The CoTV system stands as a noteworthy contribution to the
field, offering insights into the optimization of traffic flow in the context of emerging
autonomous vehicle technologies.
[23] Kwok et al. introduced an innovative intelligent traffic light system designed
to address traffic congestion issues in Hong Kong. This system combines object detec-
tion technology with an evolutionary algorithm to optimize traffic flow. The study
utilized live video footage from Hong Kong to develop traffic models, which were then
employed to simulate traffic scenarios. Both the existing Fixed Cycle Traffic Light
(FCTL) system and the proposed intelligent system were implemented at intersections.
The simulation, based on estimates of pedestrian and vehicle numbers obtained
through object detection, allowed for dynamic switching of traffic signals in response
to varying traffic conditions. Results showed significant improvements in traffic effi-
ciency with the proposed system. During non-peak hours, the throughput of the roads
increased from 44% to approximately 60%. Moreover, during peak hours, the overall
efficiency saw an increase from 54% to 57%, as measured by the reduction in total
waiting time.
Notably, the implementation of the evolutionary algorithm led to a substantial
average reduction of 65% in total waiting time. These findings highlight the effec-
tiveness of the intelligent traffic light system in enhancing traffic flow and reducing
congestion in urban areas. By dynamically adjusting signal timings based on real-time
traffic data, the system demonstrates potential for improving overall transportation
efficiency and reducing travel delays.
9
[26] Liu et al. implemented traffic signal control using a teacher-student framework,
a reinforcement learning paradigm where a teacher guides a student in decision-
making. This framework is enhanced by incorporating a teaching policy, where the
teacher learns how to effectively guide the student, and a max update approach,
enabling the student to estimate the value function accurately. Initially, the teacher
is pre-trained in a specific traffic scenario and then applied to other traffic scenar-
ios. Meanwhile, the student, initialized with random parameters, is guided by the
pre-trained teacher. Notably, the policies of the student and teacher differ. The pro-
posed approach is compared with Double Deep Q-Learning, a traditional reinforcement
learning method, and demonstrates promising results. Across all four configurations
tested, the proposed approach achieved significant reductions in travel time ranging
from 15.73% to 27.91%. Moreover, the approach substantially decreased average wait-
ing time by 39.83% to 60.72% overall. By leveraging the teacher-student framework,
this approach offers a novel perspective on traffic signal control, showcasing improve-
ments in both travel time and waiting time. These findings underscore the potential
of reinforcement learning techniques in optimizing traffic flow and enhancing urban
mobility.
[27] Song et al. gave us an approach called the Multi-Agent Counterfactual
Actor-Critic with Scheduler framework, representing a modified reinforcement learn-
ing algorithm that facilitates information exchange among agents. This framework
surpassed traditional model-based methods and other existing reinforcement learn-
ing (RL) approaches. A central critic was employed to coordinate global information
using the global state, enhancing the model’s performance. The simulation study was
conducted on the actual road network of Shenzhen city, China, featuring nine intersec-
tions. The study focused on real-time traffic demand, categorized into low, medium,
and high demand periods based on actual traffic flow timing. Benchmark comparisons
were conducted against several algorithms, including Adaptive Signal Timer, MaxPres-
sure algorithm, classic RL-based Deep Q-Network (DQN) method, Multi-Agent Policy
Gradient method, Multi-Agent Actor-Critic method, and COMA-based method. This
innovative framework demonstrates significant advancements in traffic management
strategies, particularly in urban environments. By leveraging a multi-agent approach
and incorporating real-time traffic demand data, the model effectively optimizes traffic
signal control in complex urban road networks. The comparison against various bench-
marks showcases the superiority of the proposed framework, highlighting its potential
to enhance traffic flow efficiency and reduce congestion in urban areas. Additionally,
the utilization of real road network data from Shenzhen city adds credibility and rele-
vance to the findings, emphasizing the practical applicability of the proposed approach
in real-world scenarios.
[28]Wang et al. proposed a cooperative RL (MARL) algorithm designed to opti-
mize traffic signal control. Their approach leverages cooperative vehicle-infrastructure
systems to efficiently manage large-scale road networks. Through extensive simulation,
the researchers explored multiple reinforcement learning methods and found that Deep
Q-Network (DQN) in combination with MARL yielded promising results. The algo-
rithm was trained and evaluated on a 6x6 network using SUMO simulation software
to replicate real-world scenarios. In their study, real-time traffic control adjustments
10
were made every 20 seconds, with congestion levels determined by the number of
waiting vehicles. Three congestion thresholds were considered: less than 5 vehicles,
5-15 vehicles, and above 15 vehicles. The proposed algorithm outperformed alterna-
tive strategies significantly, achieving a 78.84% improvement over a fixed approach, a
95.69% improvement over the CGL-ILQ strategy, and an 86.64% improvement over
the CGB-MA2C strategy. This cooperative MARL approach demonstrates consider-
able potential in enhancing traffic signal control systems, particularly in large-scale
urban environments. By leveraging cooperative interactions between vehicles and
infrastructure, the algorithm effectively reduces congestion levels and improves traffic
flow efficiency. Moreover, the rigorous evaluation against various benchmarks under-
scores the robustness and effectiveness of the proposed approach. Overall, Wang et
al.’s research contributes valuable insights into the development of advanced traffic
management strategies aimed at addressing the challenges of modern urban mobility.
[19]Essa and Sayed introduced a reinforcement learning algorithm implemented
on the VISSIM simulation platform, which was subsequently tested using real-world
traffic data from Surrey, British Columbia. The VISSIM model, calibrated for each
intersection, underwent extensive testing over a 9-hour period from 9:00 am to 6:00 pm.
Through this testing, it was observed that the maximum safety benefit was achieved
when the Minimum Predicted Risk (MPR) of connected vehicles (CV) reached 50%,
resulting in a remarkable 90% safety benefit. This suggests that the summarized algo-
rithm could prove highly effective, particularly when the MPR of CVs is equal to or
greater than 30%. Additionally, the study highlights that higher MPR values corre-
spond to increased effectiveness of the safety algorithm. This research underscores the
potential of reinforcement learning techniques in optimizing traffic safety, particularly
in urban environments, and provides valuable insights into the importance of MPR in
enhancing the efficacy of such algorithms.
11
and image segmentation since these models have a good capability of representing
spatial hierarchies.
Recurrent Neural Networks (RNN): RNNs[33][34] are specifically designed
to handle sequential inputs because it stores the earlier inputs as part of its memory.
They have edges that form directed cycles that help in getting temporal dependencies.
RNNs are commonly used in language modeling, in speech recognition, and in the
times series prediction domain.
Long Short-Term Memory (LSTM): LSTM Networks are a category of RNNs
that have been scientifically designed to overcome what is commonly called the van-
ishing gradient problem, when training simple RNNs for lengthy sequences. LSTMs
are built out of many cells with gates that enable the model to retain information for
a longer pass. This makes them particularly suitable for those tasks, which require the
long term management of dependencies.
These neural network algorithms form the backbone of many modern machine
learning and artificial intelligence applications and have been instrumental in advanc-
ing various fields, including computer vision, natural language processing, and
reinforcement learning.These are also the most used algorithms in the field of smart
city and traffic signal controlling.[35][36][37]
[38] Espinoza et al. proposed a novel method for traffic flow prediction, combining
both Machine Learning and Deep Learning techniques. It utilized two publicly avail-
able datasets for training, validation, and testing purposes. Recurrent Neural Networks
(RNNs), includingLong Short-Term Memory (LSTM) networks, were developed, with
75% of the dataset allocated for training and the remaining 25% for testing.
The research explored the potential application of these models in smart traf-
fic light control systems. Among the models investigated, the Multilayer Perceptron
(MLP) demonstrated superior performance, achieving the highest accuracy while also
requiring the least processing time. This indicates its suitability for real-time traffic
flow prediction and integration into intelligent transportation systems.
By leveraging ML and DL technologies, this study contributes to the advance-
ment of traffic management strategies. The predictive capabilities of neural networks,
such as GRU, LSTM, and MLP, offer valuable insights for optimizing traffic con-
trol mechanisms and enhancing overall transportation efficiency. These findings drive
progress towards more adaptive and responsive traffic management solutions, essential
for mitigating congestion and improving urban mobility.
[39] Abdullah et al. presented a novel way of predicting traffic congestion using an
RNN model to enhance the traffic of smart cities. Their model employs deep learning
(DL), a technology reputed to improve the traffic flow control primarily through traffic
signals and the traffic flow prediction. The study firstly introduces an approach in
which the SpotLock application observes the network flow based on Ryu controller in
the context of SDN. The main objective is to use the method of network flow analysis
for the enhancement of the process of network traffic and the immediate information
on the usage of the network. The proposed model is based on three main elements:
In order to predict the road traffic, an edge-based vehicular environment is
employed where edge servers also maintain history and current data of social media,
weather conditions, and others including the number of vehicles on the road and the
condition of the road.
12
Table 3 Neural Network Based work summary
13
So after covering all the part about Neural Networks we move on to the next
category.
14
Table 4 Support Vector Machine Summary
3 Conclusion
In summary, this survey paper offers a comprehensive exploration of diverse traf-
fic signal management techniques prevalent in contemporary transportation systems.
Throughout our analysis, we’ve delved into a spectrum of methodologies, encom-
passing traditional practices, heuristic algorithms, and cutting-edge technologies like
15
machine learning and deep reinforcement learning. Our findings underscore the con-
tinued relevance of conventional methods such as static-time control and dynamic
traffic signal control, alongside the burgeoning adoption of adaptive control and rein-
forcement learning-based approaches. These newer techniques exhibit promise in their
capacity to dynamically adjust signal timings, responding adeptly to fluctuating traffic
conditions in real-time.
Furthermore, our study also considers multifarious factors like traffic volume,
vehicular composition, pedestrian patterns, and environmental variables in devising
and implementing traffic signal control strategies. The integration of emerging tech-
nologies such as IoT and machine learning emerges as pivotal for enhancing system
efficiency and safety. In essence, this survey paper serves as an invaluable resource for
researchers, transportation planners, and policymakers alike, offering nuanced insights
into the evolving terrain of traffic signal management methodologies. By discerning
the strengths and limitations of diverse approaches, stakeholders are empowered to
make informed decisions aimed at optimizing traffic flow, alleviating congestion, and
augmenting overall transportation system efficacy in urban landscapes.
References
[1] Smith, J., Johnson, E.: Traffic congestion: Causes, effects, and mitigation
strategies. Transportation Research Part A: Policy and Practice 134, 123–145
(2020)
[2] Chen, W., Li, X., Zhang, W.: Traffic congestion management: Recent advances
and future directions. Transportation Research Part C: Emerging Technologies
129, 102–123 (2021)
[3] Tan, M., Pang, R., Le, Q.V.: Efficientdet: Scalable and efficient object detec-
tion. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern
Recognition (2020)
[4] Zhou, X., Wang, D., Krähenbühl, P.: Objects as points. In: Proceedings of the
IEEE/CVF Conference on Computer Vision and Pattern Recognition (2019)
[5] Cai, Z., Vasconcelos, N.: Cascade r-cnn: Delving into high quality object detec-
tion. IEEE Transactions on Pattern Analysis and Machine Intelligence 43(2),
315–328 (2021)
[6] Bochkovskiy, A., Wang, C.-Y., Liao, H.-Y., Možeiko, V., Fidler, S.: Yolov4: Opti-
mal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934
(2020)
16
[7] Liu, H., Zhang, Z., Wang, C.: Intelligent traffic signal management: Challenges
and opportunities. IEEE Transactions on Intelligent Transportation Systems
22(4), 2394–2406 (2021)
[8] Jin, J., Jin, J.: Traffic congestion and air pollution: Empirical evidence
before/after covid-19 in seoul, korea. International Journal of Sustainable
Transportation 17(12), 1356–1369 (2023) https://doi.org/10.1080/15568318.
2023.2192688
[9] Kwon, D., Paul, K.C., Yu, Y., Zhang, K., Folle, A.D., Wu, J., Bronstein, J.M.,
Ritz, B.: Traffic-related air pollution and parkinson’s disease in central california.
Environmental Research 240, 117434 (2024) https://doi.org/10.1016/j.envres.
2023.117434
[10] Singh, S., Singh, J., Goyal, S.B., Singh Sehra, S., Ali, F., Ayad Alkhafaji, M.,
Singh, R.: A novel framework to avoid traffic congestion and air pollution for
sustainable development of smart cities. Sustainable Energy Technologies and
Assessments 56, 103125 (2023) https://doi.org/10.1016/j.seta.2023.103125
[11] Kehlbacher, A., Stark, K., Gebhardt, L., Jarass, J., Schuppan, J.: Comparing
municipal progress in implementing temporary cycle lanes during the covid-19
pandemic. Transportation Research Part A: Policy and Practice 174, 103752
(2023) https://doi.org/10.1016/j.tra.2023.103752
[12] Almallah, M., Alhajyaseen, W.K.M., Dias, C.: Safety assessment of on-road
cycling lanes: A comparative study of different layouts using driving simulator.
Accident Analysis and Prevention 196, 107431 (2024) https://doi.org/10.1016/j.
aap.2023.107431
[13] Yazdani, M., Sarvi, M., Asadi Bagloee, S., Nassir, N., Price, J., Parineh, H.:
Intelligent vehicle pedestrian light (ivpl): A deep reinforcement learning approach
for traffic signal control. Transportation Research Part C: Emerging Technologies
149, 103991 (2023) https://doi.org/10.1016/j.trc.2022.103991
[14] Wei, S., Zeng, A., Qiu, M., Guan, H.: Adaptive traffic signal control with deep
reinforcement learning. IEEE Transactions on Intelligent Transportation Systems
21(9), 3866–3879 (2020)
[15] Shao, J., Chen, G.: Reinforcement learning based adaptive traffic signal control
with congestion and emergency handling. IEEE Access 8, 11449–11461 (2020)
[16] Yang, J., Wu, J., Lv, Z., Zhang, K.: Deep reinforcement learning for adaptive
traffic signal control: Methodology and performance comparison. IEEE Access 8,
89572–89584 (2020)
[17] Dai, X., Lu, Z., Sun, L.: Traffic signal control based on multi-agent deep
reinforcement learning. Journal of Advanced Transportation 2020, 1–9 (2020)
17
[18] Genders, W., Clercq, S.D., Brussel, H.V., Vanden Berghe, G.: Optimizing traffic
signal control through deep reinforcement learning. In: 2020 23rd International
Conference on Intelligent Transportation Systems (ITSC), pp. 1–6 (2020). IEEE
[19] Essa, M., Sayed, T.: Self-learning adaptive traffic signal control for real-time safety
optimization. Accident Analysis Prevention 146, 105713 (2020) https://doi.org/
10.1016/j.aap.2020.105713
[20] Guo, J., Cheng, L., Wang, S.: Cotv: Cooperative control for traffic light sig-
nals and connected autonomous vehicles using deep reinforcement learning. IEEE
Transactions on Intelligent Transportation Systems 24(10), 10501–10512 (2023)
https://doi.org/10.1109/TITS.2023.3276416
[21] Zhu, R., Li, L., Wu, S., Lv, P., Li, Y., Xu, M.: Multi-agent broad reinforcement
learning for intelligent traffic light control. Information Sciences 619, 509–525
(2023) https://doi.org/10.1016/j.ins.2022.11.062
[22] Zhao, Z., Wang, K., Wang, Y., Liang, X.: Enhancing traffic signal control with
composite deep intelligence. Expert Systems with Applications 244, 123020
(2024) https://doi.org/10.1016/j.eswa.2023.123020
[23] Ng, S.-C., Kwok, C.-P.: An intelligent traffic light system using object detec-
tion and evolutionary algorithm for alleviating traffic congestion in hong kong.
International Journal of Computational Intelligence Systems 13, 802–809 (2020)
https://doi.org/10.2991/ijcis.d.200522.001
[25] Li, C., Ma, X., Xia, L., Zhao, Q., Yang, J.: Fairness control of traffic light via
deep reinforcement learning. In: 2020 IEEE 16th International Conference on
Automation Science and Engineering (CASE), pp. 652–658 (2020). https://doi.
org/10.1109/CASE48305.2020.9216899
[26] Liu, J., Qin, S., Su, M., Luo, Y., Zhang, S., Wang, Y., Yang, S.: Traffic signal
control using reinforcement learning based on the teacher-student framework.
Expert Systems with Applications 228, 120458 (2023) https://doi.org/10.1016/
j.eswa.2023.120458
[27] Song, X.B., Zhou, B., Ma, D.: Cooperative traffic signal control through a coun-
terfactual multi-agent deep actor critic approach. Transportation Research Part
C: Emerging Technologies 160, 104528 (2024) https://doi.org/10.1016/j.trc.2024.
104528
[28] Wang, T., Cao, J., Hussain, A.: Adaptive traffic signal control for large-scale
scenario with cooperative group-based multi-agent reinforcement learning. Trans-
portation Research Part C: Emerging Technologies 125, 103046 (2021) https:
//doi.org/10.1016/j.trc.2021.103046
18
[29] Krajzewicz, D., Erdmann, J., Behrisch, M., Bieker, L.: Sumo - simulation of urban
mobility: An overview. SIMULATION 88(9), 855–867 (2012)
[30] Tan, C., Yang, K.: Privacy-preserving adaptive traffic signal control in a connected
vehicle environment. Transportation Research Part C: Emerging Technologies
158, 104453 (2024) https://doi.org/10.1016/j.trc.2023.104453
[31] Zhang, Z., Li, X., Wu, J., Gao, J.: Cooperative multi-agent deep reinforcement
learning with dynamic information exchange for traffic signal control. IEEE
Transactions on Intelligent Transportation Systems (2022)
[32] Haykin, S.: Neural Networks and Learning Machines. Pearson Education, ???
(2009)
[33] Li, X., Wu, J., Gao, J., Wang, J., Wu, Y.: Adaptive traffic signal control based
on recurrent neural networks with time-varying coefficients. IEEE Transactions
on Intelligent Transportation Systems (2021)
[34] Chen, Y., Zhang, Z., Li, X., Wu, J.: Traffic signal control using recurrent
neural networks with reinforcement learning. IEEE Transactions on Intelligent
Transportation Systems 23(4), 2036–2045 (2022)
[35] Wang, C., Zegras, C.: Traffic signal control using convolutional neural networks.
Transportation Research Part C: Emerging Technologies 115, 102615 (2020)
[36] Vural, S., Li, X.: Real-time traffic signal control using recurrent neural networks.
Transportation Research Part C: Emerging Technologies 103, 152–167 (2019)
[37] Bessa, W.: Traffic signal control using echo state neural networks. Transportation
Research Part C: Emerging Technologies 119, 102810 (2020)
[39] Abdullah, S.M., Periyasamy, M., Kamaludeen, N.A., Towfek, S.K., Marappan,
R., Kidambi Raju, S., Alharbi, A.H., Khafaga, D.S.: Optimizing traffic flow in
smart cities: Soft gru-based recurrent neural networks for enhanced congestion
prediction using deep learning. Sustainability 15(7) (2023) https://doi.org/10.
3390/su15075949
[40] Hadi, F., Siddiqui, S., Rashid, S., Memon, A.G., Raza, M.A.: Vehicular ad-
hoc network based intelligent traffic signal system for smart cities. International
Journal of Distributed Sensor Networks 16(2), 1550147720909240 (2020)
19
[41] Tong, X., Gu, X., Xu, F., Han, D.: Vehicular ad-hoc networks (vanets): State-of-
the-art and future perspectives. Journal of Computer and Communications 8(8),
1–18 (2020)
[42] Li, Z., Guan, H., Zhang, Z., Lv, Z., Yang, J.: Vehicular ad-hoc networks (vanets):
Current challenges and future directions. IEEE Access 9, 30610–30622 (2021)
[43] Majumdar, S., Subhani, M.M., Roullier, B., Anjum, A., Zhu, R.: Congestion
prediction for smart sustainable cities using iot and machine learning approaches.
Sustainable Cities and Society 64, 102500 (2021) https://doi.org/10.1016/j.scs.
2020.102500
[44] Li, X., Zhang, Z., Wu, J., Gao, J.: Adaptive traffic signal control using long
short-term memory networks with dynamic time warping. IEEE Transactions on
Intelligent Transportation Systems (2022)
[45] Wang, C., Zhang, Z., Li, X.: Real-time traffic signal control using long short-term
memory networks and deep reinforcement learning. Transportation Research Part
C: Emerging Technologies 135, 103430 (2022)
[46] Li, L., Song, M.: Support vector machine-based traffic signal optimization under
stochastic demand. Transportation Research Part C: Emerging Technologies 112,
102642 (2020)
[47] Devi, T., Alice, K., Deepa, N.: Traffic management in smart cities using support
vector machine for predicting the accuracy during peak traffic conditions. Mate-
rials Today: Proceedings 62, 4980–4984 (2022) https://doi.org/10.1016/j.matpr.
2022.03.722 . International Conference on Innovative Technology for Sustainable
Development
[48] Tan, L., Zhu, Y.: Support vector machine for traffic signal control with variable
speed limits. Transportation Research Part C: Emerging Technologies 111, 142–
157 (2020)
[49] Shen, B., Gu, X., Xu, F., Wang, S.: Adaptive traffic signal control based on
support vector machine with statistical pattern recognition. Journal of Advanced
Transportation 2020, 1–12 (2020)
[50] Yao, Y., He, Z., Xie, X.: Traffic signal control based on support vector machine
optimized by bat algorithm. Computational Intelligence and Neuroscience 2019,
1–10 (2019)
[51] Anjaneyulu, M., Kubendiran, M.: Short-term traffic congestion prediction using
hybrid deep learning technique. Sustainability 15(1) (2023) https://doi.org/10.
3390/su15010074
20