Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (1,814)

Search Parameters:
Keywords = multiagent

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
25 pages, 5210 KiB  
Article
Application of SHAP and Multi-Agent Approach for Short-Term Forecast of Power Consumption of Gas Industry Enterprises
by Alina I. Stepanova, Alexandra I. Khalyasmaa, Pavel V. Matrenin and Stanislav A. Eroshenko
Algorithms 2024, 17(10), 447; https://doi.org/10.3390/a17100447 - 8 Oct 2024
Abstract
Currently, machine learning methods are widely applied in the power industry to solve various tasks, including short-term power consumption forecasting. However, the lack of interpretability of machine learning methods can lead to their incorrect use, potentially resulting in electrical system instability or equipment [...] Read more.
Currently, machine learning methods are widely applied in the power industry to solve various tasks, including short-term power consumption forecasting. However, the lack of interpretability of machine learning methods can lead to their incorrect use, potentially resulting in electrical system instability or equipment failures. This article addresses the task of short-term power consumption forecasting, one of the tasks of enhancing the energy efficiency of gas industry enterprises. In order to reduce the risks of making incorrect decisions based on the results of short-term power consumption forecasts made by machine learning methods, the SHapley Additive exPlanations method was proposed. Additionally, the application of a multi-agent approach for the decomposition of production processes using self-generation agents, energy storage agents, and consumption agents was demonstrated. It can enable the safe operation of critical infrastructure, for instance, adjusting the operation modes of self-generation units and energy-storage systems, optimizing the power consumption schedule, and reducing electricity and power costs. A comparative analysis of various algorithms for constructing decision tree ensembles was conducted to forecast power consumption by gas industry enterprises with different numbers of categorical features. The experiments demonstrated that using the developed method and production process factors reduced the MAE from 105.00 kWh (MAPE of 16.81%), obtained through expert forecasting, to 15.52 kWh (3.44%). Examples were provided of how the use of SHapley Additive exPlanation can increase the safety of the electrical system management of gas industry enterprises by improving experts’ confidence in the results of the information system. Full article
Show Figures

Figure 1

32 pages, 10874 KiB  
Article
Advanced Cooperative Formation Control in Variable-Sweep Wing UAVs via the MADDPG–VSC Algorithm
by Zhengyang Cao and Gang Chen
Appl. Sci. 2024, 14(19), 9048; https://doi.org/10.3390/app14199048 - 7 Oct 2024
Abstract
UAV technology is advancing rapidly, and variable-sweep wing UAVs are increasingly valuable because they can adapt to different flight conditions. However, conventional control methods often struggle with managing continuous action spaces and responding to dynamic environments, making them inadequate for complex multi-UAV cooperative [...] Read more.
UAV technology is advancing rapidly, and variable-sweep wing UAVs are increasingly valuable because they can adapt to different flight conditions. However, conventional control methods often struggle with managing continuous action spaces and responding to dynamic environments, making them inadequate for complex multi-UAV cooperative formation control tasks. To address these challenges, this study presents an innovative framework that integrates dynamic modeling with morphing control, optimized by the multi-agent deep deterministic policy gradient for two-sweep control (MADDPG–VSC) algorithm. This approach enables real-time sweep angle adjustments based on current flight states, significantly enhancing aerodynamic efficiency and overall UAV performance. The precise motion state model for wing morphing developed in this study underpins the MADDPG–VSC algorithm’s implementation. The algorithm not only optimizes multi-UAV formation control efficiency but also improves obstacle avoidance, attitude stability, and decision-making speed. Extensive simulations and real-world experiments consistently demonstrate that the proposed algorithm outperforms contemporary methods in multiple aspects, underscoring its practical applicability in complex aerial systems. This study advances control technologies for morphing-wing UAV formation and offers new insights into multi-agent cooperative control, with substantial potential for real-world applications. Full article
(This article belongs to the Special Issue Collaborative Learning and Optimization Theory and Its Applications)
Show Figures

Figure 1

19 pages, 3297 KiB  
Article
Consensus Control for Stochastic Multi-Agent Systems with Markovian Switching via Periodic Dynamic Event-Triggered Strategy
by Xue Luo, Chengbo Yi, Jianwen Feng, Jingyi Wang and Yi Zhao
Axioms 2024, 13(10), 694; https://doi.org/10.3390/axioms13100694 - 7 Oct 2024
Abstract
The consensus problem in stochastic multi-agent systems (MASs) with Markovian switching is addressed by proposing a novel distributed dynamic event-triggered (DDET) technique based on periodic sampling to reduce information transmission. Unlike traditional event-triggered control, the proposed periodic sampling-based DDET method is characterized by [...] Read more.
The consensus problem in stochastic multi-agent systems (MASs) with Markovian switching is addressed by proposing a novel distributed dynamic event-triggered (DDET) technique based on periodic sampling to reduce information transmission. Unlike traditional event-triggered control, the proposed periodic sampling-based DDET method is characterized by the following three advantages: (1) The need for continuous monitoring of the event trigger is eliminated. (2) Zeno behavior in stochastic MASs is effectively prevented. (3) Communication costs are significantly reduced. Based on this, sufficient conditions for achieving consensus in the mean-square sense are derived using Lyapunov–Krasovskii functions, providing a solid theoretical foundation for the proposed strategy. The effectiveness of the proposed DDET control is validated through two numerical examples. Full article
Show Figures

Figure 1

17 pages, 4996 KiB  
Article
Safeguarding Personal Identifiable Information (PII) after Smartphone Pairing with a Connected Vehicle
by Jason Carlton and Hafiz Malik
J. Sens. Actuator Netw. 2024, 13(5), 63; https://doi.org/10.3390/jsan13050063 - 6 Oct 2024
Abstract
The integration of connected autonomous vehicles (CAVs) has significantly enhanced driving convenience, but it has also raised serious privacy concerns, particularly regarding the personal identifiable information (PII) stored on infotainment systems. Recent advances in connected and autonomous vehicle control, such as multi-agent system [...] Read more.
The integration of connected autonomous vehicles (CAVs) has significantly enhanced driving convenience, but it has also raised serious privacy concerns, particularly regarding the personal identifiable information (PII) stored on infotainment systems. Recent advances in connected and autonomous vehicle control, such as multi-agent system (MAS)-based hierarchical architectures and privacy-preserving strategies for mixed-autonomy platoon control, underscore the increasing complexity of privacy management within these environments. Rental cars with infotainment systems pose substantial challenges, as renters often fail to delete their data, leaving it accessible to subsequent renters. This study investigates the risks associated with PII in connected vehicles and emphasizes the necessity of automated solutions to ensure data privacy. We introduce the Vehicle Inactive Profile Remover (VIPR), an innovative automated solution designed to identify and delete PII left on infotainment systems. The efficacy of VIPR is evaluated through surveys, hands-on experiments with rental vehicles, and a controlled laboratory environment. VIPR achieved a 99.5% success rate in removing user profiles, with an average deletion time of 4.8 s or less, demonstrating its effectiveness in mitigating privacy risks. This solution highlights VIPR as a critical tool for enhancing privacy in connected vehicle environments, promoting a safer, more responsible use of connected vehicle technology in society. Full article
(This article belongs to the Special Issue Feature Papers in the Section of Network Security and Privacy)
Show Figures

Figure 1

33 pages, 10372 KiB  
Article
Adaptive Multi-Agent Reinforcement Learning for Optimizing Dynamic Electric Vehicle Charging Networks in Thailand
by Pitchaya Jamjuntr, Chanchai Techawatcharapaikul and Pannee Suanpang
World Electr. Veh. J. 2024, 15(10), 453; https://doi.org/10.3390/wevj15100453 - 6 Oct 2024
Abstract
The rapid growth of electric vehicles (EVs) necessitates efficient management of dynamic EV charging networks to optimize resource utilization and enhance service reliability. This paper explores the application of adaptive multi-agent reinforcement learning (MARL) to address the complexities of EV charging infrastructure in [...] Read more.
The rapid growth of electric vehicles (EVs) necessitates efficient management of dynamic EV charging networks to optimize resource utilization and enhance service reliability. This paper explores the application of adaptive multi-agent reinforcement learning (MARL) to address the complexities of EV charging infrastructure in Thailand. By employing MARL, multiple autonomous agents learn to optimize charging strategies based on real-time data by adapting to fluctuating demand and varying electricity prices. Building upon previous research that applied MARL to static network configurations, this study extends the application to dynamic and real-world scenarios, integrating real-time data to refine agent learning processes and also evaluating the effectiveness of adaptive MARL in maximizing rewards and improving operational efficiency compared to traditional methods. Experimental results indicate that MARL-based strategies increased efficiency by 20% and reduced energy costs by 15% relative to conventional algorithms. Key findings demonstrate the potential of extending MARL in transforming EV charging network management, highlighting its benefits for stakeholders, including EV owners, operators, and utility providers. This research contributes insights into advancing electric mobility and energy management in Thailand through innovative AI-driven approaches. The implications of this study include significant improvements in the reliability and cost-effectiveness of EV charging networks, fostering greater adoption of electric vehicles and supporting sustainable energy initiatives. Future research directions include enhancing MARL adaptability and scalability as well as integrating predictive analytics for proactive network optimization and sustainability. These advancements promise to further refine the efficacy of EV charging networks, ensuring that they meet the growing demands of Thailand’s evolving electric mobility landscape. Full article
Show Figures

Figure 1

13 pages, 2958 KiB  
Article
Research on Decision-Making and Control Technology for Hydraulic Supports Based on Digital Twins
by Xiusong You and Shirong Ge
Symmetry 2024, 16(10), 1316; https://doi.org/10.3390/sym16101316 - 5 Oct 2024
Abstract
To further enhance the intelligent construction of coal mines and improve the control accuracy of hydraulic support displacement straightness, a digital twin control method for hydraulic support displacement has been proposed. First, a dataset related to hydraulic support is established, and a ridge [...] Read more.
To further enhance the intelligent construction of coal mines and improve the control accuracy of hydraulic support displacement straightness, a digital twin control method for hydraulic support displacement has been proposed. First, a dataset related to hydraulic support is established, and a ridge regression prediction model is developed to achieve digital twin-based displacement decision analysis. Next, by analyzing the mechanical structure and displacement principles of the hydraulic supports, a hydraulic cylinder mathematics model is established, leading to the state-space representation of the controlled object. This study focuses on error control during the multi-agent operation of the hydraulic supports, designing a corresponding controller and using discretization methods to verify the consistency of output displacement between followers and leaders. Finally, simulation experiments based on the digital twin model of hydraulic supports are conducted, validating that the hydraulic supports can be controlled in formation according to actual production requirements. The digital twin control method enables the precise adaptive displacement control of hydraulic supports and provides valuable insights for the intelligent construction of mining faces. Full article
Show Figures

Figure 1

14 pages, 1035 KiB  
Article
Distributed Disturbance Observer-Based Containment Control of Multi-Agent Systems with Event-Triggered Communications
by Lin Hu and Long Jian
Mathematics 2024, 12(19), 3117; https://doi.org/10.3390/math12193117 - 5 Oct 2024
Abstract
This article investigates a class of multi-agent systems (MASs) with known dynamics external disturbances, where the communication graph is directed, and the followers have undirected connections. To eliminate the impacts of external disturbance, the technologies of disturbance observer-based control are introduced into the [...] Read more.
This article investigates a class of multi-agent systems (MASs) with known dynamics external disturbances, where the communication graph is directed, and the followers have undirected connections. To eliminate the impacts of external disturbance, the technologies of disturbance observer-based control are introduced into the containment control problems. Additionally, to save communication costs and energy consumption, a distributed disturbance observer-based event-triggered controller is employed to achieve containment control and reject disturbance. Furthermore, designing the event-triggered function using an exponential function is beneficial for a time-dependent term while ensuring the exclusion of Zeno behavior. Finally, a numerical simulation is provided to validate the effectiveness of the theoretical analysis. Full article
Show Figures

Figure 1

20 pages, 1379 KiB  
Article
Energy Efficiency Maximization for Multi-UAV-IRS-Assisted Marine Vehicle Systems
by Chaoyue Zhang, Bin Lin, Chao Li and Shuang Qi
J. Mar. Sci. Eng. 2024, 12(10), 1761; https://doi.org/10.3390/jmse12101761 - 4 Oct 2024
Abstract
Mobile edge computing is envisioned as a prospective technology for supporting time-sensitive and computation-intensive applications in marine vehicle systems. However, the offloading performance is highly impacted by the poor wireless channel. Recently, an Unmanned Aerial Vehicle (UAV) equipped with an Intelligent Reflecting Surface [...] Read more.
Mobile edge computing is envisioned as a prospective technology for supporting time-sensitive and computation-intensive applications in marine vehicle systems. However, the offloading performance is highly impacted by the poor wireless channel. Recently, an Unmanned Aerial Vehicle (UAV) equipped with an Intelligent Reflecting Surface (IRS), i.e., UIRS, has drawn attention due to its capability to control wireless signals so as to improve the data rate. In this paper, we consider a multi-UIRS-assisted marine vehicle system where UIRSs are deployed to assist in the computation offloading of Unmanned Surface Vehicles (USVs). To improve energy efficiency, the optimization problem of the association relationships, computation resources of USVs, multi-UIRS phase shifts, and multi-UIRS trajectories is formulated. To solve the mixed-integer nonlinear programming problem, we decompose it into two layers and propose an integrated convex optimization and deep reinforcement learning algorithm to attain the near-optimal solution. Specifically, the inner layer solves the discrete variables by using the convex optimization based on Dinkelbach and relaxation methods, and the outer layer optimizes the continuous variables based on the Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (MATD3). The numerical results demonstrate that the proposed algorithm can effectively improve the energy efficiency of the multi-UIRS-assisted marine vehicle system in comparison with the benchmarks. Full article
(This article belongs to the Special Issue Unmanned Marine Vehicles: Navigation, Control and Sensing)
Show Figures

Figure 1

26 pages, 689 KiB  
Article
Simulation of Epidemic Dynamics Using a Multi-Agent Model: Analysis of Social Distancing Strategies and Their Impacts on Public Health and Economy
by Cloves Alberto Chaves de Lima, Luis Augusto Silva and Patricia Cabral de Azevedo Restelli Tedesco
Appl. Sci. 2024, 14(19), 8931; https://doi.org/10.3390/app14198931 - 3 Oct 2024
Abstract
Infectious disease epidemics have played a crucial role in shaping public health responses, particularly in global health crises. This study emerges as part of the efforts to prepare effective responses to potential future pandemics, leveraging lessons learned during the COVID-19 crisis. The research [...] Read more.
Infectious disease epidemics have played a crucial role in shaping public health responses, particularly in global health crises. This study emerges as part of the efforts to prepare effective responses to potential future pandemics, leveraging lessons learned during the COVID-19 crisis. The research uses an adapted compartmental epidemiological model and a synthetic multi-agent community to investigate how social variables influence epidemic forecasts in socioeconomically vulnerable regions. Focusing on the simulation of epidemic dynamics in the socio-economically disadvantaged neighbourhood of Ilha Joana Bezerra in Recife, this study examines the impacts of social distancing strategies and other control measures, such as face masks and moderate social isolation. Through the adapted SEPAI3R3O model, which includes compartments for pre-symptomatic and asymptomatic states, this study provides a detailed analysis of disease dynamics in contexts characterised by high social vulnerability. The results underscore the importance of public health policies adapted to socio-economic factors, emphasising the need for continuous preparedness to manage future epidemic threats in vulnerable communities effectively. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

20 pages, 3171 KiB  
Article
Consensus Control of Leader–Follower Multi-Agent Systems with Unknown Parameters and Its Circuit Implementation
by Yinfang Ye and Jianbin He
Appl. Sci. 2024, 14(19), 8894; https://doi.org/10.3390/app14198894 - 2 Oct 2024
Abstract
With the development and progress of Internet and data technology, the consensus control of multi-agent systems has been an important topic in nonlinear science. How to effectively achieve the consensus of leader–follower multi-agent systems at a low cost is a difficult problem. This [...] Read more.
With the development and progress of Internet and data technology, the consensus control of multi-agent systems has been an important topic in nonlinear science. How to effectively achieve the consensus of leader–follower multi-agent systems at a low cost is a difficult problem. This paper analyzes the consensus control of complex financial systems. Firstly, the dynamic characteristics of the financial system are analyzed by the equilibrium points, bifurcation diagrams, and Lyapunov exponent spectra. The behavior of the financial system is discussed by different parameter values. Secondly, according to the Lyapunov stability theorem, the consensus of master–slave systems is proposed by linear feedback control, wherein the controllers are simple and low cost. And an adaptive control method for the consensus of master–slave systems is investigated based on financial systems with unknown parameters. In theory, the consensus of the leader–follower multi-agent systems is proved by the parameter identification laws and linear feedback control method. Finally, the effectiveness and reliability of the consensus of leader–follower multi-agent systems are verified through the experimental simulation results and circuit implementation. Full article
Show Figures

Figure 1

23 pages, 670 KiB  
Article
Distributed Adaptive Optimization Algorithm for High-Order Nonlinear Multi-Agent Stochastic Systems with Lévy Noise
by Hui Yang, Qing Sun and Jiaxin Yuan
Entropy 2024, 26(10), 834; https://doi.org/10.3390/e26100834 - 30 Sep 2024
Abstract
An adaptive neural network output-feedback control strategy is proposed in this paper for the distributed optimization problem (DOP) of high-order nonlinear stochastic multi-agent systems (MASs) driven by Lévy noise. On the basis of the penalty-function method, the consensus constraint is removed and the [...] Read more.
An adaptive neural network output-feedback control strategy is proposed in this paper for the distributed optimization problem (DOP) of high-order nonlinear stochastic multi-agent systems (MASs) driven by Lévy noise. On the basis of the penalty-function method, the consensus constraint is removed and the global objective function (GOF) is reconstructed. The stability of the system is analyzed by combining the generalized Itô’s formula with the Lyapunov function method. Moreover, the command filtering mechanism is introduced to solve the “complexity explosion” problem in the process of designing virtual controller, and the filter errors are compensated by introducing compensating signals. The proposed algorithm has been proved that the outputs of all agents converge to the optimal solution of the DOP with bounded errors. The simulation results demonstrate the effectiveness of the proposed approach. Full article
(This article belongs to the Special Issue Information Theory in Control Systems, 2nd Edition)
Show Figures

Figure 1

32 pages, 6060 KiB  
Article
A Fault-Tolerant Multi-Agent Reinforcement Learning Framework for Unmanned Aerial Vehicles–Unmanned Ground Vehicle Coverage Path Planning
by Mahya Ramezani, M. A. Amiri Atashgah and Alireza Rezaee
Drones 2024, 8(10), 537; https://doi.org/10.3390/drones8100537 - 30 Sep 2024
Abstract
In this paper, we introduce a fault-tolerant multi-agent reinforcement learning framework called SERT-DQN to optimize the operations of UAVs with UGV central control in coverage path planning missions. Our approach leverages dual learning systems that combine individual agent autonomy with centralized strategic planning, [...] Read more.
In this paper, we introduce a fault-tolerant multi-agent reinforcement learning framework called SERT-DQN to optimize the operations of UAVs with UGV central control in coverage path planning missions. Our approach leverages dual learning systems that combine individual agent autonomy with centralized strategic planning, thus enhancing the efficiency of cooperative path planning missions. This framework is designed for high performance in environments with fault uncertainty detected and operational challenges such as interruptions in connectivity and compromised sensor reliability. With the integration of an innovative communication system between agents, our system appropriately handles both static and dynamic environments. Also, we introduce similarity-based shared experience replay to attain faster convergence and sample efficiency in the multi-agent system. The architecture is specially designed to respond adaptively to such irregularities by effectively showing enhanced resilience in scenarios where data integrity is impaired due to faults or the UAV faces disruptions. Simulation results indicate that our fault tolerance algorithms are very resilient and do indeed improve mission outcomes, especially under dynamic and highly uncertain operating conditions. This approach becomes critical for the most recent sensor-based research in autonomous systems. Full article
(This article belongs to the Special Issue Flight Control and Collision Avoidance of UAVs)
Show Figures

Figure 1

18 pages, 6211 KiB  
Article
A Long-Term Target Search Method for Unmanned Aerial Vehicles Based on Reinforcement Learning
by Dexing Wei, Lun Zhang, Mei Yang, Hanqiang Deng and Jian Huang
Drones 2024, 8(10), 536; https://doi.org/10.3390/drones8100536 - 30 Sep 2024
Abstract
Unmanned aerial vehicles (UAVs) are increasingly being employed in search operations. Deep reinforcement learning (DRL), owing to its robust self-learning and adaptive capabilities, has been extensively applied to drone search tasks. However, traditional DRL approaches often suffer from long training times, especially in [...] Read more.
Unmanned aerial vehicles (UAVs) are increasingly being employed in search operations. Deep reinforcement learning (DRL), owing to its robust self-learning and adaptive capabilities, has been extensively applied to drone search tasks. However, traditional DRL approaches often suffer from long training times, especially in long-term search missions for UAVs, where the interaction cycles between the agent and the environment are extended. This paper addresses this critical issue by introducing a novel method—temporally asynchronous grouped environment reinforcement learning (TAGRL). Our key innovation lies in recognizing that as the number of training environments increases, agents can learn knowledge from discontinuous trajectories. This insight leads to the design of grouped environments, allowing agents to explore only a limited number of steps within each interaction cycle rather than completing full sequences. Consequently, TAGRL demonstrates faster learning speeds and lower memory consumption compared to existing parallel environment learning methods. The results indicate that this framework enhances the efficiency of UAV search tasks, paving the way for more scalable and effective applications of RL in complex scenarios. Full article
Show Figures

Figure 1

17 pages, 3103 KiB  
Article
Distributed Consensus Tracking of Incommensurate Heterogeneous Fractional-Order Multi-Agent Systems Based on Vector Lyapunov Function Method
by Conggui Huang and Fei Wang
Fractal Fract. 2024, 8(10), 575; https://doi.org/10.3390/fractalfract8100575 - 30 Sep 2024
Abstract
This paper investigates the tracking problem of fractional-order multi-agent systems. Both the order and parameters of the leader are unknown. Firstly, based on the positive system approach, the asymptotically stable criteria for incommensurate linear fractional-order systems are derived. Secondly, the models of incommensurate [...] Read more.
This paper investigates the tracking problem of fractional-order multi-agent systems. Both the order and parameters of the leader are unknown. Firstly, based on the positive system approach, the asymptotically stable criteria for incommensurate linear fractional-order systems are derived. Secondly, the models of incommensurate heterogeneous multi-agent systems are introduced. To cope with incommensurate and heterogeneous situations among followers and the leader, radial basis function neural networks (RBFNNs) and a discontinuous control method are used. Thirdly, the consensus criteria are derived by using the Vector Lyapunov Function method. Finally, a numerical example is presented to illustrate the effectiveness of the proposed theoretical method. Full article
(This article belongs to the Special Issue Analysis and Modeling of Fractional-Order Dynamical Networks)
Show Figures

Figure 1

18 pages, 1077 KiB  
Article
Reinforcement Learning with Value Function Decomposition for Hierarchical Multi-Agent Consensus Control
by Xiaoxia Zhu
Mathematics 2024, 12(19), 3062; https://doi.org/10.3390/math12193062 - 30 Sep 2024
Abstract
A hierarchical consensus control algorithm based on value function decomposition is proposed for hierarchical multi-agent systems. To implement the consensus control algorithm, the reward function of the multi-agent systems can be decomposed, and two value functions can be obtained by analyzing the communication [...] Read more.
A hierarchical consensus control algorithm based on value function decomposition is proposed for hierarchical multi-agent systems. To implement the consensus control algorithm, the reward function of the multi-agent systems can be decomposed, and two value functions can be obtained by analyzing the communication content and the corresponding control objective of each layer in the hierarchical multi-agent systems. Therefore, for each agent in the systems, a dual-critic network and a single-actor network structure are applied to realize the objective of each layer. In addition, the target network is introduced to prevent overfitting in the critic network and improve the stability of the online learning process. During the updating of network parameters, a soft updating mechanism and experience replay buffer are introduced to slow down the update rate of the network and improve the utilization rate of training data. The convergence and stability of the consensus control algorithm with the soft updating mechanism are analyzed theoretically. Finally, the correctness of the theoretical analysis and the effectiveness of the algorithm were verified by two experiments. Full article
(This article belongs to the Special Issue Advance in Control Theory and Optimization)
Show Figures

Figure 1

Back to TopTop