Abstract
In the prediction of turning points (TPs) of time series, the improved model of integrating piecewise linear representation and weighted support vector machine (IPLR-WSVM) has achieved good performance. However, due to the single data source and the limitation of algorithm, IPLR-WSVM has encountered challenges in profitability. In this paper, a model fusion method based on multi-source heterogeneous data and different learning algorithms is proposed for the prediction of TPs (MF-MSHD). Multi-source heterogeneous data include weighted unstructured and structured information with different granularities. RF, WSVM, BPNN, GBDT, and LSTM are selected to be the learning algorithms. The differences among meta-models are constructed by different inputs and algorithms as much as possible, and a model fusion rule is designed to determine the final TPs. Moreover, the TPs are generated based on the characteristics of individual stock. For sentiment analysis, a more accurate sentiment dictionary of stock market comments is established. Specifically, the fine-grained data is introduced to jointly determine the accurate trading moment. The prediction level of the proposal improves the accuracy and profitability, and also outperforms the composite indexes. Experimental results show that the profit rate of randomly selected stocks in MF-MSHD reaches 0.5172, while the highest value is 0.2841 in single meta-model and 0.0992 in buy and hold strategy, respectively. The other indicators including the accuracy are also modified. Compared with the increases of 0.1648, 0.4051, and 0.3397 in Shanghai Composite Index, Shenzhen Composite Index, and CSI 300 Index, MF-MSHD shows higher profitability in stock trading signal prediction.














Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
All the datasets used are publicly available.
References
Abu MYS, Atiya AF (1996) Introduction to financial forecasting. Appl Intell 6(3):205–213
Baidu (2020) Baidu nlp. https://aip.baidubce.com/rpc/2.0/nlp/v1/sentiment_classify
Ballings M, Poel DVD, Hespeels N et al (2015) Evaluating multiple classifiers for stock price direction prediction. Expert Syst Appl 42(20):7046–7056
Barra S, Carta SM, Corriga A et al (2020) Deep learning and time series-to-image encoding for financial forecasting. IEEE-CAA J Automatica Sinica 7(3):683–692
Bollen J, Mao H, Zeng X (2011) Twitter mood predicts the stock market. J Comput Sci 2(1):1–8
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Cao J, Wang J (2019) Exploration of stock index change prediction model based on the combination of principal component analysis and artificial neural network. Soft Comput
Chandar SK (2020) Grey wolf optimization-Elman neural network model for stock price prediction. Soft Comput
Chang C, Lin C (2011) Libsvm: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):1–27
Chang P, Fan C, Liu C (2009) Integrating a piecewise linear representation method and a neural network model for stock trading points prediction. IEEE Trans Syst Man Cybern Part C 39(1):80–92
Dang LM, Sadeghi-Niaraki A, Huynh HD et al (2018) Deep learning approach for short-term stock trends prediction based on two-stream gated recurrent unit network. IEEE Access 6:392–404
Dong Z, Dong Q (2003) Hownet - a hybrid language and knowledge resource. In: 2003 international conference on natural language processing and knowledge engineering proceedings, pp 820–824
Edwards RD, Magee J, Bassetti W (2018) Technical analysis of stock trends, 11th edn. CRC Press, London
Friedman JH (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 29(5):1189–1232
Gers FA, Schmidhuber J, Cummins F (2000) Learning to forget: continual prediction with LSTM. Neural Comput 12(10):2451–71
Gu Q, Chang Y, Xiong N et al (2021) Forecasting nickel futures price based on the empirical wavelet transform and gradient boosting decision trees. Appl Soft Comput 109(107):472
Hao PY, Kung CF, Chang CY et al (2021) Predicting stock price trends based on financial news articles and using a novel twin support vector machine with fuzzy hyperplane. Appl Soft Comput 98(106):806
Hao Y, Gao Q (2020) Predicting the trend of stock market index using the hybrid neural network based on multiple time scale feature learning. Appl Sci Basel 10(11):3961
Ismail MS, Md Noorani MS, Ismail M et al (2020) Predicting next day direction of stock price movement using machine learning methods with persistent homology: evidence from Kuala Lumpur stock exchange. Appl Soft Comput 93(106):422
Keogh E, Chu S, Hart D, et al (2001) An online algorithm for segmenting time series. In: proceedings 2001 IEEE international conference on data mining, pp 289–296
Liu G, Wang X (2019) A new metric for individual stock trend prediction. Eng Appl Artif Intell 82:1–12
Long J, Chen Z, He W et al (2020) An integrated framework of deep learning and knowledge graph for prediction of stock price trend: an application in Chinese stock exchange market. Appl Soft Comput 91(106):205
Luo L, Chen X (2013) Integrating piecewise linear representation and weighted support vector machine for stock trading signal prediction. Appl Soft Comput 13(2):806–816
Luo L, You S, Xu Y et al (2017) Improving the integration of piece wise linear representation and weighted support vector machine for stock trading signal prediction. Appl Soft Comput 56:199–216
Mallqui DCA, Fernandes RAS (2019) Predicting the direction, maximum, minimum and closing prices of daily bitcoin exchange rate using machine learning techniques. Appl Soft Comput 75:596–606
Mikolov T, Chen K, Corrado G, et al (2013a) Efficient estimation of word representations in vector space. In: international conference on learning representations, Scottsdale, pp 1–12
Mikolov T, Sutskever I, Chen K et al (2013) Distributed representations of words and phrases and their compositionality. Adv Neural Inf Process Syst 26:3111–3119
Patel J, Shah S, Thakkar P et al (2015) Predicting stock and stock price index movement using trend deterministic data preparation and machine learning techniques. Expert Syst Appl 42(1):259–268
Qi F, Yang C, Liu Z, et al (2019) Openhownet: an open sememe-based lexical knowledge base. arXiv: 1901.09957
Renault T (2017) Intraday online investor sentiment and return patterns in the U.S. stock market. J Bank Financ 84:25–40
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323(6088):533–536
Song Y, Shi S, Li J, et al (2018) Directional skip-gram: explicitly distinguishing left and right context for word embeddings. In: proceedings of the 2018 conference of the North American chapter of the association for computational Linguistics: human language technologies, Vol. 2 (Short Papers). New Orleans, Louisiana, pp 175–180
Tang H, Dong P, Shi Y (2019) A new approach of integrating piecewise linear representation and weighted support vector machine for forecasting stock turning points. Appl Soft Comput 78:685–696
Thakkar A, Patel D, Shah P (2021) Pearson correlation coefficient-based performance enhancement of vanilla neural network for stock trend prediction. Neural Comput Appl 32(2):1–16
Wu JL, Huang MT, Yang CS, et al (2021a) Sentiment analysis of stock markets using a novel dimensional valence-arousal approach. Soft Comput
Wu JMT, Li Z, Herencsar N et al (2021) A graph-based CNN-ISTM stock price prediction algorithm with leading indicators. Multimed Syst. https://doi.org/10.1007/s00530-021-00758-w
Wu JMT, Li Z, Srivastava G et al (2021) A graph-based convolutional neural network stock price prediction with leading indicators. Softw Pract Exp 51(3):628–644. https://doi.org/10.1002/spe.2915
Wu ME, Syu JH, Lin JCW et al (2022) Effective fuzzy system for qualifying the characteristics of stocks by random trading. IEEE Trans Fuzzy Syst 30(8):3152–3165. https://doi.org/10.1109/TFUZZ.2021.3105192
Xu Y, Yang C, Peng S et al (2020) A hybrid two-stage financial stock forecasting algorithm based on clustering and ensemble learning. Appl Intell 50(11):3852–3867
Yadav K, Yadav M, Saini S (2022) Stock values predictions using deep learning based hybrid models. CAAI Trans Intell Technol 7(1):107–116. https://doi.org/10.1049/cit2.12052
Yang Y, Yang Y, Zhou W (2021) Research on a hybrid prediction model for stock price based on long short-term memory and variational mode decomposition. Soft Comput pp 1–19
Yao S (2019) A prediction of stock trading signal based on multi-model fusion and emotion of financial text. Master’s thesis, Xiamen University
Yao S, Luo L, Peng H (2018) High-frequency stock trend forecast using LSTM model. In: The 13th international conference on computer science & education, Colombo, pp 293–296
You S (2017) A prediction of stock trading signals based on interval turning points. Master’s thesis, Xiamen University
Yu P, Yan X (2020) Stock price prediction based on deep neural networks. Neural Comput Appl 32(6):1609–1628
Funding
This work is supported by the National Talents Foundation [grant number WQ20141100]; and the Beijing Municipal Natural Science Foundation [grant numbers L191020, 3192028].
Author information
Authors and Affiliations
Contributions
Chen conceptualized and designed the experiments; Chen and Hirota analyzed and interpreted the data; Chen drafted the manuscript; Hirota, Dai, and Jia revised the manuscript critically for important intellectual content; Chen, Hirota, Dai, and Jia gave approval of the version of the manuscript to be published.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Informed consent
All authors checked the final draft and agreed on the submission.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chen, X., Hirota, K., Dai, Y. et al. A model fusion method based on multi-source heterogeneous data for stock trading signal prediction. Soft Comput 27, 6587–6611 (2023). https://doi.org/10.1007/s00500-022-07714-4
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-022-07714-4