Importance of Event Binary Features in Stock Price Prediction
Abstract
:1. Introduction
- We developed three stock price prediction models to determine the importance of input feature selection using input features with different characteristics. We organized models by selecting and calculating input features that are expected to be able to describe the reason for fluctuation.
- The first model uses 315 input features based on technical analysis. The second model uses 250 event binary features to represent the moment of stock price fluctuation. The third model is based on 13 existing well-known technical analysis indicators.
- The 315 input features used in Model 1 were developed as follows. After analyzing the data distribution of the 715 novel input features presented in a previous study [7], we have proposed 315 meaningful input features by eliminating all noise-prone values that affect prediction performance. This model ultimately yields profits over KOSDAQ.
- The 250 event binary input features used in Model 2 were developed as follows. We defined the moment of stock price change as an event. An event implies a turning point between the lines. If you look closely at the stock chart, you can see that the price changes depend on the relationships among the lines. Based on this, we presented 250 new event input features and developed a model that yields a higher return than the KOSPI/KODAQ. This model shows the highest performance among the three models proposed in this paper and yields the most profit.
- Finally, our stock price prediction model uses a very simple neural network structure. However, our model yielded higher profits than the Korean stock index KOSPI/KOSDAQ when using binary event features and meaningful input features. We found that event binary features and noise-prone values play a very important role in predicting stock prices.
2. Related Works
2.1. Recent Stock Price Prediction Studies Using Various Prediction Models
2.2. Related Studies with Approaches Similar to Ours
2.3. Our Previous Studies on Stock Price Prediction
3. Design and Development of the Stock Price Prediction Model
3.1. Deep Neural Network (DNN)
3.2. Model 1: 315 Novel Input Features Model
3.3. Model 2: Event Binary Features Model
3.4. Model 3: Existing Well-Known Technical Analysis Indicator Model
3.5. Target Vector
3.6. Feature Normalization
4. Experimental Results
5. Performance Evaluation through Fund Simulation
6. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Phoon, K.; Koh, F. Robo-advisors and wealth management. J. Altern. Investig. 2017, 20, 79–94. [Google Scholar] [CrossRef]
- Kaya, O.; Schildbach, J.; Schneider, S. Robo-Advice–A True Innovation in Asset Management. Deutsche Bank Research. Available online: https://www.dbresearch.com/PROD/DBR_INTERNET_EN-PROD/PROD0000000000449010/Robo-advice_-_a_true_innovation_in_asset_managemen.pdf (accessed on 21 February 2020).
- Lim, H.; Ryu, D.; Yang, H. Economic analysis of robo-advisor industries: A case study. Korean Acad. Soc. Bus. Admin. 2018, 47, 725–749. [Google Scholar]
- Trading Economics. Available online: https://tradingeconomics.com/united-states/interest-rate (accessed on 13 June 2019).
- Blenman, L.P. Market liberalization and trading in Korea. Int. J. Bank. Financ. 2020, 7, 37–58. [Google Scholar]
- Bollerslev, T.; Wright, J.H. High-frequency data, frequency domain inference, and volatility forecasting. Rev. Econ. Stat. 2001, 83, 596–602. [Google Scholar] [CrossRef]
- Song, Y.; Lee, J.W.; Lee, J.W. A study on novel filtering and relationship between input-features and target-vectors in a deep learning model for stock price prediction. Appl. Intell. 2019, 49, 897–911. [Google Scholar] [CrossRef]
- Arora, N. Financial Analysis: Stock Market Prediction Using Deep Learning Algorithms. In Proceedings of the International Conference on Sustainable Computing in Science, Technology and Management (SUSCOM), Jaipur, India, 26–28 February 2019. [Google Scholar]
- Sim, H.S.; Kim, H.I.; Ahn, J.J. Is deep learning for image recognition applicable to stock market prediction? Complexity 2019, 2019, 1–10. [Google Scholar] [CrossRef]
- Bollen, J.; Mao, H.; Zeng, X. Twitter mood predicts the stock market. J. Comput. Sci. 2011, 2, 1–8. [Google Scholar] [CrossRef] [Green Version]
- Schumaker, R.P.; Chen, H. Textual analysis of stock market prediction using breaking financial news: The azfin text system. ACM Trans. Inf. Syst. 2009, 27. [Google Scholar] [CrossRef]
- Naeini, M.P.; Taremian, H.; Hashemi, H.B. Stock Market Value Prediction Using Neural Networks. In Proceedings of the 2010 International Conference on Computer Information Systems and Industrial Management Applications (CISIM), Krackow, Poland, 8–10 October 2010; pp. 132–136. [Google Scholar]
- JuHyok, U.; Lu, P.; Kim, C.; Ryu, U.; Pak, K. A new LSTM based reversal point prediction method using upward/downward reversal point feature sets. Chaos Solitons Fractals 2020, 132, 109559. [Google Scholar]
- Rajput, V.; Bobde, S. Stock market forecasting techniques: Literature survey. Int. J. Comput. Sci. Mob. Comput. 2016, 5, 500–506. [Google Scholar]
- Song, Y.; Lee, J.W. Implementation of Chart Type Filtering for Stock Price Prediction. In Proceedings of the KCC 2018, Seoul, Korea, 20–22 June 2018; pp. 680–682. [Google Scholar]
- Rosenblatt, F.X. Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms; Spartan Books: Washington, DC, USA, 1961. [Google Scholar]
- Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
- Ahmar, A. Sutte Indicator: A Technical Indicator in Stock Market. Int. J. Econ. Financ. Issues 2017, 7, 223–226. [Google Scholar]
- Tkacz, G. Neural network forecasting of Canadian GDP growth. Int. J. Forecast. 2001, 17, 57–69. [Google Scholar] [CrossRef]
- Lee, J.W.; Kim, S.Y.; Kim, S.D.; Lee, J.W.; Chae, J.S. A two-phase stock trading system based on pattern matching and automatic rule induction. Korean Inf. Process. Soc. 2003, 10, 257–264. [Google Scholar]
- Wong, W.K.; Manzur, M.; Chew, B.K. How rewarding is technical analysis? Evidence from Singapore stock market. Appl. Financ. Econ. 2003, 13, 543–551. [Google Scholar] [CrossRef]
- Rosillo, R.; De la Fuente, D.; Brugos, J.A.L. Technical analysis and the Spanish stock exchange: Testing the RSI, MACD, momentum and stochastic rules using Spanish market companies. Appl. Econ. 2013, 45, 1541–1550. [Google Scholar] [CrossRef]
- Lawrance, A.J.; Lewis, P.A.W. An exponential moving-average sequence and point process (EMA1). J. Appl. Probab. 1977, 14, 98–113. [Google Scholar] [CrossRef]
- Yu, L.; Wang, S.; Lai, K.K. Mining stock market tendency using GA-based support vector machines. In International Workshop on Internet and Network Economics; Springer: Berlin/Heidelberg, Germany, 2015; pp. 336–345. [Google Scholar]
- Bollinger Band Definition. Available online: https://www.investopedia.com/terms/b/bollingerbands.asp (accessed on 20 February 2020).
- Bollinger, J. Using Bollinger bands. Stock. Commod. 1992, 10, 47–51. [Google Scholar]
- Ferri, C.; Hernández-Orallo, J.; Salido, M.A. Volume under the ROC surface for multi-class problems. In European Conference on Machine Learning; Springer: Berlin/Heidelberg, Germany, 2003; pp. 108–120. [Google Scholar]
- Auer, P.; Jaksch, T.; Ortner, R. Near-optimal regret bounds for reinforcement learning. J. Mach. Learn. Res. 2010, 11, 1563–1600. [Google Scholar]
- Watkins, C.J.; Dayan, P. Q-learning. Mach. Learn. 1992, 8, 279–292. [Google Scholar] [CrossRef]
- Iglewicz, B. Robust Scale Estimators and Confidence Intervals for Location; Wiley: New York, NY, USA, 1983; p. 417. [Google Scholar]
- Jin, J.; Li, M.; Jin, L. Data normalization to accelerate training for linear neural net to predict tropical cyclone tracks. Math. Probl. Eng. 2015, 931629. [Google Scholar] [CrossRef] [Green Version]
- Aryal, D.R.; Wang, Y.W. Neural network forecasting of the production level of Chinese construction industry. J. Comp. Int. Manag. 2003, 6, 45–64. [Google Scholar]
- Dilli, R.; Wang, Y.W. An application of the ARIMA model for forecasting the production level of construction industry. J. Harbin Inst. Technol. 2002, 9, 39–45. [Google Scholar]
- Lee, J.W. Integrated multiple simulation for optimizing performance of stock trading systems based on neural networks. KIPS Trans. B 2007, 14, 127–134. [Google Scholar] [CrossRef] [Green Version]
Description of 715 Features | Number of Features |
---|---|
1. Gradient of moving average line | 10 |
2. Sum of long-term moving average gradient | 2 |
3. Gradient of 5-days/20-days volume moving average | 3 |
4. Change in volume moving average | 60 |
5. Difference between yesterday’s and today’s moving average | 40 |
6. The change of long and short-term moving average and the point of golden/dead cross | 220 |
7. The change of long and short-term volume moving average and the point of golden/dead cross | 140 |
8. Disparity of moving average | 10 |
9. Disparity of 60-days/120-days volume moving average | 30 |
10. Disparity of 20-days/60-days volume moving average | 30 |
11. Disparity of 5-days/20-days volume moving average | 30 |
12. Change in closing price on a specific day | 110 |
13. Other simple price indicators | 30 |
Histogram of Data Distribution | ||||
---|---|---|---|---|
Valid Distribution of Feature Values | ||||
Invalid Distribution of Feature Values |
Feature Descriptions | Formula | Number of Features |
---|---|---|
Gradient of the moving average/volume moving average of k trading day | 13 | |
Sum of gradient of 60 and 120 days during past 40 days | 2 | |
Rate of change of volume moving average of the k days | 60 | |
Rate of change of closing price | 110 | |
Disparity from moving average of k days | 10 | |
Disparity of k days and n days’ volume moving average | 120 |
Price Feature Description | Formula | Number of Features |
---|---|---|
Gradient of the moving average of k trading days | , | 5 |
Sum of gradient of 10 trading days during past i days | , | 2 |
Sum of gradient of 20 trading days during past i days | , | 3 |
Sum of gradient of 60 trading days during past i days | , | 4 |
Sum of gradient of 120 trading days during past i days | , | 3 |
Rate of change of closing price in a specific j days. | 1 | |
Gradient of the volume moving average of 5 trading days | 1 | |
Disparity of the volume moving average of k trading days | 4 | |
Rate of 120 day volume moving average and j days ago volume moving average | 3 | |
Rate of 120 day volume moving average and j days ago volume moving average | 4 | |
Disparity from MA of k days of stock price | 5 | |
Rate of change of volume in today and yesterday | 2 |
Binary Features Description | Formula | Number of Features |
---|---|---|
Turn up/down point between moving average lines | 30 | |
Golden cross point between moving average lines | 45 | |
Dead cross point between moving average lines | 45 | |
, | ||
Support and resist point for moving average lines | 12 | |
Upward/downward penetration point between moving average lines | 12 | |
Golden cross point between volume moving average lines | 25 | |
Dead cross point between volume moving average lines | 25 | |
Relationship of volume moving average lines arrangement | 6 | |
Relationship between volume moving average of 5 trading days and volume moving average of k trading days | 4 |
Model 1 | Model 2 | Model 3 | |
---|---|---|---|
Number of Input Features | 315 | 250 | 13 |
Configuration of Input Features | Novel input feature using price information | Simple price feature + Binary chart feature | Only technical indicator |
Activation Function | Tahn | Tahn | Tahn |
Dropout | 0.5 | 0.5 | 0.5 |
Optimizer | RMSprop | RMSprop | RMSprop |
MSE | 0.1105 | 0.0795 | 0.1171 |
MAE | 0.2447 | 0.2162 | 0.2498 |
Fund Simulation | O | O | X |
Forecasting Model | MSE | MAE | Comparison of Prediction Accuracy with Model 1% (MSE) | Comparison of Prediction Accuracy with Model 2% (MSE) | Comparison of Prediction Accuracy with Model 3% (MSE) |
---|---|---|---|---|---|
Model 1 | 0.1105 | 0.2447 | X | 28.05 | −5.97 |
Model 2 | 0.0795 | 0.2162 | −38.99 | X | −47.29 |
Model 3 | 0.1171 | 0.2498 | 5.63 | 32.10 | X |
Optimal Trading Policy for Each Model | ||||||||
---|---|---|---|---|---|---|---|---|
Buy Discount Rate (%) | Target Profit Rate (%) | Stop-Loss Rate (%) | Profit Rate (%) | Maximum Holding Period (Day) | Profit Rate per Trade (%) | Profit Rate per Daily Trade (%) | Hit Ratio (%) | |
Model 1 | +0 | +24 | −12 | 52.6 | 22 | 1.86 | 0.11 | 27.3 |
Model 2 | −6 | +22 | −12 | 68.5 | 21 | 7.61 | 0.46 | 38.2 |
Model 3 | X | X | X | X | X | X | X | X |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Song, Y.; Lee, J. Importance of Event Binary Features in Stock Price Prediction. Appl. Sci. 2020, 10, 1597. https://doi.org/10.3390/app10051597
Song Y, Lee J. Importance of Event Binary Features in Stock Price Prediction. Applied Sciences. 2020; 10(5):1597. https://doi.org/10.3390/app10051597
Chicago/Turabian StyleSong, Yoojeong, and Jongwoo Lee. 2020. "Importance of Event Binary Features in Stock Price Prediction" Applied Sciences 10, no. 5: 1597. https://doi.org/10.3390/app10051597