Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

A Text Correlation Algorithm for Stock Market News Event Extraction

  • Conference paper
  • First Online:
Data Science (ICPCSEE 2021)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1452))

Abstract

To extract effective information in massive financial news, this paper proposes a method to calculate the correlation between text and text set by extracting structured events in the stock market news text, and provides more detailed and interpretable information. First, the structured event triplet was extracted from the text set, and the trained word vector was used to represent the event triplet as an event vector. Event vectors were clustered, the cosine distances were calculated for the cluster centers, and the correlation between the text sets was determined by matching. Finally, the event triplets with the highest correlation between the text sets were selected to provide explanation information for the calculation results. Experimental results show that this method effectively measures the correlation between text and text set.

J. Wu and Y. Wang—Contributed equally to this work. This work is supported by: Engineering Research Center of State Financial Security, Ministry of Education, Central University of Finance and Economics, Beijing, 102206, China; Program for Innovation Research in Central University of Finance and Economics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Schumaker, R.P., Chen, H.: Textual analysis of stock market prediction using breaking financial news. ACM Trans. Inf. Syst. (TOIS) 27(2), 12 (2009)

    Article  Google Scholar 

  2. Hao, P.Y., Kung, C.F., Chang, C.Y., et al.: Predicting stock price trends based on financial news articles and using a novel twin support vector machine with fuzzy hyperplane. Appl. Soft Comput. 98, 106806 (2021)

    Google Scholar 

  3. Ding, X., Zhang, Y., Liu, T., et al.: Using structured events to predict stock price movement: an empirical investigation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1415–1425 (2014)

    Google Scholar 

  4. Warner, B.: Predicting the DJIA with news headlines and historic data using hybrid genetic algorithm/support vector regression and BERT. In: Proceedings of the International Conference on Big Data – BigData 2020, pp. 23–37 (2020)

    Google Scholar 

  5. Kilimci, Z.H., Duvar, R.: An efficient word embedding and deep learning based model to forecast the direction of stock exchange market using Twitter and financial news sites: a case of Istanbul stock exchange (BIST 100). IEEE Access 8, 188186–188198 (2020)

    Article  Google Scholar 

  6. Zhang, B., Hu, L.M., Hou, L., et al.: Chinese event detection and representation based on word vector. Pattern Recognit. Artif. Intell. 3, 275–282 (2018)

    Google Scholar 

  7. Ding, X., Zhang, Y., Liu, T., et al.: Deep learning for event-driven stock prediction. In: Proceedings of the International Joint Conferences on Artificial Intelligence Organization (IJCAI), pp. 2327–2333 (2015)

    Google Scholar 

  8. Lei, Y.X., Zhou, K.Y., Liu, Y.C.: Multi-category events driven stock price trends prediction. In: Proceedings of the 5th IEEE International Conference on Cloud Computing and Intelligence Systems (CCIS), pp. 497–501 (2018)

    Google Scholar 

  9. Wang, X.L., Xiao, H., Tai, W.P.: Research on text similarity detection system based on hadoop platform. Comput. Technol. Dev. 25(8), 90–93 (2015)

    Google Scholar 

  10. Atoum, I., Otoom, A.: Efficient hybrid semantic text similarity using Wordnet and a corpus. Int. J. Adv. Comput. Sci. Appl. 7(9), 124–130 (2016)

    Google Scholar 

  11. Rao, J., Liu, L., Tay, Y., et al.: Bridging the gap between relevance matching and semantic matching for short text similarity modeling. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 5369–5380 (2019)

    Google Scholar 

  12. Yang, J., Li, Y., Gao, C., et al.: Measuring the short text similarity based on semantic and syntactic information. Future Gener. Comput. Syst. 114, 169–180 (2021)

    Article  Google Scholar 

  13. Farouk, M.: Measuring text similarity based on structure and word embedding. Cogn. Syst. Res. 63, 1–10 (2020)

    Article  Google Scholar 

  14. Li, Z.: Research on Automatic Extraction of Key Words and Text Summaries in Text Mining. Qingdao University of Technology (2018)

    Google Scholar 

  15. Yang, X.: Research on Answer Optimization Method for Question Answering System. Harbin Institute of Technology (2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wu, J., Wang, Y. (2021). A Text Correlation Algorithm for Stock Market News Event Extraction. In: Zeng, J., Qin, P., Jing, W., Song, X., Lu, Z. (eds) Data Science. ICPCSEE 2021. Communications in Computer and Information Science, vol 1452. Springer, Singapore. https://doi.org/10.1007/978-981-16-5943-0_5

Download citation

  • DOI: https://doi.org/10.1007/978-981-16-5943-0_5

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-16-5942-3

  • Online ISBN: 978-981-16-5943-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics