Abstract
To extract effective information in massive financial news, this paper proposes a method to calculate the correlation between text and text set by extracting structured events in the stock market news text, and provides more detailed and interpretable information. First, the structured event triplet was extracted from the text set, and the trained word vector was used to represent the event triplet as an event vector. Event vectors were clustered, the cosine distances were calculated for the cluster centers, and the correlation between the text sets was determined by matching. Finally, the event triplets with the highest correlation between the text sets were selected to provide explanation information for the calculation results. Experimental results show that this method effectively measures the correlation between text and text set.
J. Wu and Y. Wang—Contributed equally to this work. This work is supported by: Engineering Research Center of State Financial Security, Ministry of Education, Central University of Finance and Economics, Beijing, 102206, China; Program for Innovation Research in Central University of Finance and Economics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Schumaker, R.P., Chen, H.: Textual analysis of stock market prediction using breaking financial news. ACM Trans. Inf. Syst. (TOIS) 27(2), 12 (2009)
Hao, P.Y., Kung, C.F., Chang, C.Y., et al.: Predicting stock price trends based on financial news articles and using a novel twin support vector machine with fuzzy hyperplane. Appl. Soft Comput. 98, 106806 (2021)
Ding, X., Zhang, Y., Liu, T., et al.: Using structured events to predict stock price movement: an empirical investigation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1415–1425 (2014)
Warner, B.: Predicting the DJIA with news headlines and historic data using hybrid genetic algorithm/support vector regression and BERT. In: Proceedings of the International Conference on Big Data – BigData 2020, pp. 23–37 (2020)
Kilimci, Z.H., Duvar, R.: An efficient word embedding and deep learning based model to forecast the direction of stock exchange market using Twitter and financial news sites: a case of Istanbul stock exchange (BIST 100). IEEE Access 8, 188186–188198 (2020)
Zhang, B., Hu, L.M., Hou, L., et al.: Chinese event detection and representation based on word vector. Pattern Recognit. Artif. Intell. 3, 275–282 (2018)
Ding, X., Zhang, Y., Liu, T., et al.: Deep learning for event-driven stock prediction. In: Proceedings of the International Joint Conferences on Artificial Intelligence Organization (IJCAI), pp. 2327–2333 (2015)
Lei, Y.X., Zhou, K.Y., Liu, Y.C.: Multi-category events driven stock price trends prediction. In: Proceedings of the 5th IEEE International Conference on Cloud Computing and Intelligence Systems (CCIS), pp. 497–501 (2018)
Wang, X.L., Xiao, H., Tai, W.P.: Research on text similarity detection system based on hadoop platform. Comput. Technol. Dev. 25(8), 90–93 (2015)
Atoum, I., Otoom, A.: Efficient hybrid semantic text similarity using Wordnet and a corpus. Int. J. Adv. Comput. Sci. Appl. 7(9), 124–130 (2016)
Rao, J., Liu, L., Tay, Y., et al.: Bridging the gap between relevance matching and semantic matching for short text similarity modeling. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 5369–5380 (2019)
Yang, J., Li, Y., Gao, C., et al.: Measuring the short text similarity based on semantic and syntactic information. Future Gener. Comput. Syst. 114, 169–180 (2021)
Farouk, M.: Measuring text similarity based on structure and word embedding. Cogn. Syst. Res. 63, 1–10 (2020)
Li, Z.: Research on Automatic Extraction of Key Words and Text Summaries in Text Mining. Qingdao University of Technology (2018)
Yang, X.: Research on Answer Optimization Method for Question Answering System. Harbin Institute of Technology (2017)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wu, J., Wang, Y. (2021). A Text Correlation Algorithm for Stock Market News Event Extraction. In: Zeng, J., Qin, P., Jing, W., Song, X., Lu, Z. (eds) Data Science. ICPCSEE 2021. Communications in Computer and Information Science, vol 1452. Springer, Singapore. https://doi.org/10.1007/978-981-16-5943-0_5
Download citation
DOI: https://doi.org/10.1007/978-981-16-5943-0_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-5942-3
Online ISBN: 978-981-16-5943-0
eBook Packages: Computer ScienceComputer Science (R0)