Quantum Finance and Fuzzy Reinforcement Learning-Based Multi-agent Trading System

Cheng, Chi; Chen, Bingshen; Xiao, Ziting; Lee, Raymond S. T.

doi:10.1007/s40815-024-01731-1

Quantum Finance and Fuzzy Reinforcement Learning-Based Multi-agent Trading System

Published: 25 May 2024

Volume 26, pages 2224–2245, (2024)
Cite this article

International Journal of Fuzzy Systems Aims and scope Submit manuscript

390 Accesses
Explore all metrics

Abstract

In a volatile stock market, an investor’s long-term goal involves determining the most effective buying, selling strategies, and money management techniques in order to maximize profits. This paper introduces a multi-agent trading system to achieve this goal, termed QF-FRL, based on quantum finance and fuzzy reinforcement learning (QF-FRL). The system comprises two agents: (1) The trading agent, constructed using the Deep Deterministic Policy Gradient (DDPG) and Twin Delayed Deep Deterministic Policy Gradient (TD3). This agent employs a Denoising Auto Encoder (DAE) to extract stock representations from historical time series data. The trading agent initially employed the DDPG model, which was subsequently supplanted by the TD3 model. It integrates traditional financial technology indicators, like moving averages, with modern deep reinforcement learning technology to generate buying and selling signals for determining the optimal strategy. (2) The risk control agent, founded on quantum finance and an adaptive network-based fuzzy inference system. This agent merges the QPL indicator with a fuzzy risk control method to ascertain transaction amounts. Furthermore, a genetic algorithm is utilized to optimize the parameters of the fuzzy system, aiming to enhance profits and ensure accuracy in transactions at specific amounts. The experiments in this study involved selecting nine stocks and testing them against seven competing quantitative trading models. Upon comparing the profit rate, trading frequency, Sharpe ratio, and average return of each stock, eight stocks within the QF-FRL system achieved the highest returns and a greater number of transactions. Additionally, the QF-FRL system has also attained the highest average return and the second highest average Sharpe ratio. The results indicate that QF-FRL outperforms competing models, yielding higher profits and being particularly suitable for long-term investment. Moreover, it exhibits more favorable risk-adjusted returns and a notable degree of robustness.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

To learn or not to learn? Evaluating autonomous, adaptive, automated traders in cryptocurrencies financial bubbles

Article Open access 27 July 2022

Applying Deep Reinforcement Learning in Automated Stock Trading

Human Centered AI for Financial Decisions

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data Availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Azhikodan, A.R., Bhat, A.G.K., Jadhav, M.V.: Stock trading bot using deep reinforcement learning. In: Lecture Notes in Networks and Systems, pp. 41–49. Springer, Singapore (2019)
Baostock: http://baostock.com. Accessed 26 Feb 2024
Bekiros, S.D.: Heterogeneous trading strategies with adaptive fuzzy actor-critic reinforcement learning: a behavioral approach. J. Econ. Dyn. Control 34(6), 1153–1170 (2010)
Article MathSciNet MATH Google Scholar
Bengio, Y., Vincent, P., Larochelle, H., Manzagol, P.A.: Extracting and composing robust features with denoising autoencoders. In: Machine Learning, Proceedings of the Twenty-Fifth International Conference (ICML 2008), Helsinki, Finland, 5–9 June 2008
Carta, S., Ferreira, A., Podda, A.S., Reforgiato Recupero, D., Sanna, A.: Multi-DQN: An ensemble of deep Q-learning agents for stock market forecasting. Expert Syst. Appl. 164, 113820 (2021)
Article Google Scholar
Chang, P.C., Liao, T.W., Lin, J.J., Fan, C.Y.: A dynamic threshold decision system for stock trading signal detection. Appl. Soft Comput. 11(5), 3998–4010 (2011)
Article MATH Google Scholar
Cheng, Y., Xu, B., Lian, Z., Shi, Z., Shi, P.: Adaptive learning control of switched strict-feedback nonlinear systems with dead zone using NN and DOB. IEEE Trans. Neural Netw. Learn. Syst. 34(5), 2503–2512 (2014)
Article MathSciNet MATH Google Scholar
Chourmouziadis, K., Chourmouziadou, D.K., Chatzoglou, P.D.: Embedding four medium-term technical indicators to an intelligent stock trading fuzzy system for predicting: a portfolio management approach. Comput. Econ. 57(4), 1183–1216 (2020)
Article Google Scholar
Dempster, M.A.H., Leemans, V.: An automated FX trading system using adaptive reinforcement learning. Expert Syst. Appl. 30, 543–552 (2006)
Article MATH Google Scholar
Fujimoto, S., Hoof, H.V., Meger, D.: Addressing function approximation error in actor-critic methods. ICML 2018, 1582–1591 (2018)
MATH Google Scholar
Gong, X., Yu, C., Min, L., Ge, Z.: Regret theory-based fuzzy multi-objective portfolio selection model involving DEA cross-efficiency and higher moments. Appl. Soft Comput. 100, Article 106958 (2021)
Gupta, P., Mehlawat, M.K., Khan, A.Z.: Multi-period portfolio optimization using coherent fuzzy numbers in a credibilistic environment. Expert Syst. Appl. 167, Article 114135 (2021)
Huang, S., Miao, Y., Hsiao, Y.: Novel deep reinforcement algorithm with adaptive sampling strategy for continuous portfolio optimization. IEEE Access 9, 77371–77385 (2021)
Article Google Scholar
Huang, Q., Yang, J., Feng, X., Liew, A.W.C., Li, X.: Automated trading point forecasting based on bicluster mining and fuzzy inference. IEEE Trans. Fuzzy Syst. 28(2), 259–272 (2020)
Article MATH Google Scholar
Kirkpatrick, C.D., Dahlquist, J.: Technical Analysis: The Complete Resource for Financial Market Technicians, 2nd edn. Pearson, New Jersey (2011)
MATH Google Scholar
Lee, R.S.T.: Quantum Finance: Intelligent Forecast and Trading Systems. Springer, Singapore (2020)
Book MATH Google Scholar
Li, N., Li, X., Peng, J., Xu, Z.Q.: Stochastic linear quadratic optimal control problem: a reinforcement learning method. IEEE Trans. Autom. Control 67, 5009–5016 (2020)
Article MathSciNet MATH Google Scholar
Lillicrap, T.P., Hunt, J.J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., et al.: Continuous control with deep reinforcement learning. arXiV Preprint (2016). arXiv:1509.02971
Lo, A.W., Mamaysky, H., Wang, J.: Foundations of technical analysis: computational algorithms, statistical inference, and empirical implementation. J. Financ. 55(4), 1705–1770 (2000)
Article MATH Google Scholar
Luo, L., Chen, X.: Integrating piecewise linear representation and weighted support vector machine for stock trading signal prediction. Appl. Soft Comput. 13(2), 806–816 (2013)
Article MATH Google Scholar
Mashayekhi, Z., Omrani, H.: An integrated multi-objective Markowitz–DEA cross-efficiency model with fuzzy returns for portfolio selection problem. Appl. Soft Comput. 38, 1–9 (2016)
Article MATH Google Scholar
Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)
Article Google Scholar
Moody, J.E., Saffell, M.: Learning to trade via direct reinforcement. IEEE Trans. Neural Netw. 12(4), 875–889 (2001)
Article MATH Google Scholar
Neuneier, R.: Enhancing Q-learning for optimal asset allocation. Adv. Neural. Inf. Process. Syst. 10(1), 936–942 (1998)
MATH Google Scholar
de Oliveira, F.A., Nobre, C.N., Zárate, L.E.: Applying Artificial Neural Networks to prediction of stock price and improvement of the directional prediction index—case study of PETR4, Petrobras, Brazil. Expert Syst. Appl. 40(18), 7596–7606 (2013)
Article MATH Google Scholar
Pendharkar, P., Cusatis, P.: Trading financial indices with reinforcement learning agents. Expert Syst. Appl. 103, 1–13 (2018)
Article MATH Google Scholar
Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back propagating errors. Nature 323(6088), 533–536 (1986)
Article MATH Google Scholar
Silver, D., Lever, G., Heess, N., Degris, T., Riedmiller, M.: Deterministic policy gradient algorithms. In: International Conference on Machine Learning. PMLR (2014)
Skeepers, T., van Zyl, T.L., Paskaramoorthy, A.: MA-FDRNN: Multi-asset fuzzy deep recurrent neural network reinforcement learning for portfolio management. In: 2021 8th International Conference on Soft Computing & Machine Intelligence (ISCMI), pp. 32–37 (2021)
Tsaur, R.C., Chiu, C.L., Huang, Y.Y.: Guaranteed rate of return for excess investment in a fuzzy portfolio analysis. Int. J. Fuzzy Syst. 23, 94–106 (2021)
Article MATH Google Scholar
Wang, C., Sandas, P., Beling, P.: Improving pairs trading strategies via reinforcement learning. In: 2021 International Conference on Applied Artificial Intelligence (ICAPAI) (2021)
Wang, J., Zhang, Y., Tang, K., Wu, J., Xiong, Z.: AlphaStock: a buying-winners-and-selling-losers investment strategy using interpretable deep reinforcement attention networks. CoRR (2019). arXiv:1908.02646
Yang, X.Y., Liu, W.L., Chen, S.D., Zhang, Y.: A multi-period fuzzy mean minimax risk portfolio model with investor’s risk attitude. Soft. Comput. 25, 2949–2963 (2021)
Article MATH Google Scholar
Zadeh, L.A.: Fuzzy sets. Inf. Control 8(3), 338–353 (1965)
Article MATH Google Scholar
Zhang, Y.Y., Li, X., Guo, S.N.: Portfolio selection problems with Markowitz’s mean-variance framework: a review of literature. Fuzzy Optim. Decis. Making 17(2), 125–158 (2018)
Article MathSciNet MATH Google Scholar
Zhang, Y., Liu, W., Yang, X.: An automatic trading system for fuzzy portfolio optimization problem with sell orders. Expert Syst. Appl. 187, 115822 (2022)
Article MATH Google Scholar

Download references

Acknowledgements

This paper was supported in part by the Guangdong Provincial Key Laboratory IRADS (2022B1212010006, R0400001-22), Key Laboratory for Artificial Intelligence and Multi-Model Data Processing of Department of Education of Guangdong Province and Guangdong Province F1 project grant on Curriculum Development and Teaching Enhancement on Quantum Finance course UICR0400050-21CTL.

Author information

Authors and Affiliations

Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, Beijing Normal University-Hong Kong Baptist University United International College, Zhuhai, Guangdong Province, China
Chi Cheng, Bingshen Chen, Ziting Xiao & Raymond S. T. Lee

Authors

Chi Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Bingshen Chen
View author publications
You can also search for this author in PubMed Google Scholar
Ziting Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Raymond S. T. Lee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Dr Raymond Lee: Supervision, project administration, funding acquisition, reviewing and editing. Chi Cheng: Conceptualization, methodology, reviewing and editing. Bingshen Chen: Conceptualization, methodology, writing—original draft preparation, formal analysis, data visualization, validation. Ziting Xiao: Investigation, literature review, software design, implementation.

Corresponding author

Correspondence to Raymond S. T. Lee.

Ethics declarations

Conflict of interest

The authors have not disclosed any competing interests.

Ethical Approval

This article does not contain any studies conducted by any of the authors on human participants or animals.

Informed Consent

Informed consent was obtained from all individual participants included in the study.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Cheng, C., Chen, B., Xiao, Z. et al. Quantum Finance and Fuzzy Reinforcement Learning-Based Multi-agent Trading System. Int. J. Fuzzy Syst. 26, 2224–2245 (2024). https://doi.org/10.1007/s40815-024-01731-1

Download citation

Received: 08 January 2023
Revised: 25 February 2024
Accepted: 13 March 2024
Published: 25 May 2024
Issue Date: October 2024
DOI: https://doi.org/10.1007/s40815-024-01731-1

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Quantum Finance and Fuzzy Reinforcement Learning-Based Multi-agent Trading System

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

To learn or not to learn? Evaluating autonomous, adaptive, automated traders in cryptocurrencies financial bubbles

Applying Deep Reinforcement Learning in Automated Stock Trading

Human Centered AI for Financial Decisions

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical Approval

Informed Consent

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

Quantum Finance and Fuzzy Reinforcement Learning-Based Multi-agent Trading System

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

To learn or not to learn? Evaluating autonomous, adaptive, automated traders in cryptocurrencies financial bubbles

Applying Deep Reinforcement Learning in Automated Stock Trading

Human Centered AI for Financial Decisions

Explore related subjects

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical Approval

Informed Consent

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation