Abstract
People are spending huge amount of time on social media platforms these days. Through this they get engage in various real-world activities, either for awareness or just for participation. Twitter has 330 million active users, who generates around 6,000 tweet per second. This forms a huge corpus of data that is widely available for analysis, monitoring and research. Different forms of user behavior can be studied with this data. Analysis in this paper shows how simple machine learning and natural language processing techniques can be used to predict user interests based on his/her past tweets. The paper proposes to use a keyword extraction and semantic clustering based approach to do the analysis. The proposed approach has been tested on a dataset of 1,69,000 tweets and has achieved an accuracy of 80%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Twitter dataset of UK geolocated tweets. http://www.followthehashtag.com/datasets/170000-uk-geolocated-tweets-free-twitter-dataset/
Becker, H., Naaman, M., Gravano, L.: Beyond trending topics: real-world event identification on twitter. In: ICWSM 2011, pp. 438–441 (2011)
Dubien, J.L., Warde, W.D.: A mathematical comparison of the members of an infinite family of agglomerative clustering algorithms. Can. J. Stat. 7(1), 29–38 (1979)
Hu, Y., Farnham, S., Talamadupula, K.: Predicting user engagement on twitter with real-world events. In: ICWSM, pp. 168–178 (2015)
Hu, Y., Farnham, S.D., Monroy-Hernández, A.: Whoo.ly: facilitating information seeking for hyperlocal communities using social media. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 3481–3490. ACM (2013)
Hu, Y., John, A., Seligmann, D.D., Wang, F.: What were the tweets about? Topical associations between public events and twitter feeds. In: ICWSM (2012)
Jain, A.K.: Data clustering: 50 years beyond k-means. Pattern Recogn. Lett. 31(8), 651–666 (2010)
Kwak, H., Lee, C., Park, H., Moon, S.: What is twitter, a social network or a news media? In: Proceedings of the 19th International Conference on World Wide Web, pp. 591–600. ACM (2010)
Li, Y., Bandar, Z.A., McLean, D.: An approach for measuring semantic similarity between words using multiple information sources. IEEE Trans. Knowl. Data Eng. 15(4), 871–882 (2003)
MacQueen, J., et al.: Some methods for classification and analysis of multivariate observations (1967)
Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: analysis and an algorithm. In: Advances in Neural Information Processing Systems, pp. 849–856 (2002)
Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes twitter users real-time event detection by social sensors. In: Proceedings of the 19th International Conference on World Wide Web, pp. 851–860. ACM (2010)
Varelas, G., Voutsakis, E., Raftopoulou, P., Petrakis, E.G., Milios, E.E.: Semantic similarity methods in wordnet and their application to information retrieval on the web. In: Proceedings of the 7th Annual ACM International Workshop on Web Information and Data Management, pp. 10–16. ACM (2005)
Vieweg, S., Hughes, A.L., Starbird, K., Palen, L.: Microblogging during two natural hazards events: what twitter may contribute to situational awareness. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 1079–1088. ACM (2010)
Zhao, W.X., Jiang, J., He, J., Song, Y., Achananuparp, P., Lim, E.P., Li, X.: Topical keyphrase extraction from twitter. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 379–388. Association for Computational Linguistics (2011)
Zhu, G., Iglesias, C.A.: Sematch: semantic similarity framework for knowledge graphs. Knowl. Based Syst. 130, 30–32 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Mittal, A., Arora, G., Tiwari, K., Kaushik, V.D., Gupta, P. (2018). User Engagement Prediction Using Tweets. In: Huang, DS., Jo, KH., Zhang, XL. (eds) Intelligent Computing Theories and Application. ICIC 2018. Lecture Notes in Computer Science(), vol 10955. Springer, Cham. https://doi.org/10.1007/978-3-319-95933-7_88
Download citation
DOI: https://doi.org/10.1007/978-3-319-95933-7_88
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-95932-0
Online ISBN: 978-3-319-95933-7
eBook Packages: Computer ScienceComputer Science (R0)