Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3106426.3106450acmconferencesArticle/Chapter ViewAbstractPublication PageswiConference Proceedingsconference-collections
research-article

Predicting citations from mainstream news, weblogs and discussion forums

Published: 23 August 2017 Publication History

Abstract

The growth in the alternative digital publishing is widening the breadth of scholarly impact beyond the conventional bibliometric community. Thus, research is becoming more reachable both inside and outside of academic institutions and are found to be shared, downloaded and discussed in social media. In this study, we linked the scientific articles found in mainstream news, weblogs and Stack Overflow to the citation database of peer-reviewed literature called Scopus. We then explored how standard graph-based influence metrics can be used to measure the social impact of scientific articles. We also proposed the variant of Katz centrality metrics called EgoMet score to measure the local importance of scientific articles in its ego network. Later we evaluated these computed graph-based influence metrics by predicting absolute citations. Our results of the prediction model describe 34% variance to predict citations from blogs and mainstream news and 44% variance to predict citations from Stack Overflow.

References

[1]
Eugene Agichtein, Carlos Castillo, Debora Donato, Aristides Gionis, and Gilad Mishne. 2008. Finding high-quality content in social media. In Proceedings of the 2008 international conference on web search and data mining. ACM, 183--194.
[2]
Ashton Anderson, Daniel Huttenlocher, Jon Kleinberg, and Jure Leskovec. 2012. Discovering value from community activity on focused question answering sites: a case study of stack overflow. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 850--858.
[3]
Nina Belojevic, Jentery Sayers, et al. 2014. Peer review personas. Journal of Electronic Publishing 17, 3 (2014).
[4]
Phillip Bonacich and Paulette Lloyd. 2001. Eigenvector-like measures of centrality for asymmetric relations. Social networks 23, 3 (2001), 191--201.
[5]
Sergey Brin and Lawrence Page. 2012. Reprint of: The anatomy of a large-scale hypertextual web search engine. Computer networks 56, 18 (2012), 3825--3833.
[6]
Tim Brody, Stevan Harnad, and Leslie Carr. 2006. Earlier web usage statistics as predictors of later citation impact. Journal of the American Society for Information Science and Technology 57, 8 (2006), 1060--1072.
[7]
Michael Callaham, Robert L Wears, and Ellen Weber. 2002. Journal prestige, publication bias, and other characteristics associated with citation of published studies in peer-reviewed journals. Jama 287, 21 (2002), 2847--2850.
[8]
Rodrigo Costas, Zohreh Zahedi, and Paul Wouters. 2015. Do "altmetrics" correlate with citations? Extensive comparison of altmetric indicators with citations from a multidisciplinary perspective. Journal of the Association for Information Science and Technology 66, 10 (2015), 2003--2019.
[9]
Gunther Eysenbach. 2011. Can tweets predict citations? Metrics of social impact based on Twitter and correlation with traditional metrics of scientific impact. Journal of medical Internet research 13, 4 (2011), e123.
[10]
google keyword search 2016. Google Trend. https://www.google.com/trends/explore?date=2010-11-01%202011-07-31&q=%2Fm%2F0292d3. (2016). {Online; accessed 7-August-2016}.
[11]
Stefanie Haustein, Isabella Peters, Cassidy R Sugimoto, Mike Thelwall, and Vincent Larivière. 2014. Tweeting biomedicine: An analysis of tweets and citations in the biomedical literature. Journal of the Association for Information Science and Technology 65, 4 (2014), 656--669.
[12]
Christian Pieter Hoffmann, Christoph Lutz, and Miriam Meckel. 2014. Impact factor 2.0: Applying social network analysis to scientific impact assessment. In 2014 47th Hawaii International Conference on System Sciences. IEEE, 1576--1585.
[13]
Andreas M Kaplan and Michael Haenlein. 2010. Users of the world, unite! The challenges and opportunities of Social Media. Business horizons 53, 1 (2010), 59--68.
[14]
Leo Katz. 1953. A new status index derived from sociometric analysis. Psychometrika 18, 1 (1953), 39--43.
[15]
Jon M Kleinberg. 1999. Hubs, authorities, and communities. ACM computing surveys (CSUR) 31, 4es (1999), 5.
[16]
Abhaya V Kulkarni, Jason W Busse, and Iffat Shams. 2007. Characteristics associated with citation rate of the medical literature. PloS one 2, 5 (2007), e403.
[17]
Na Li and Denis Gillet. 2013. Identifying influential scholars in academic social media platforms. In Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. ACM, 608--614.
[18]
Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. 1999. The PageRank citation ranking: bringing order to the web. (1999).
[19]
Thomas V Perneger. 2004. Relation between online "hit counts" and subsequent citations: prospective study of research papers in the BMJ. BMJ 329, 7465 (2004), 546--547.
[20]
Jason Priem, Paul Groth, and Dario Taraborelli. 2012. The altmetrics collection. PloS one 7, 11 (2012), e48753.
[21]
Jason Priem and Bradely H Hemminger. 2010. Scientometrics 2.0: New metrics of scholarly impact on the social Web. First Monday 15, 7 (2010).
[22]
Jason Priem, Heather A Piwowar, and Bradley M Hemminger. 2012. Altmetrics in the wild: Using social media to explore scholarly impact. arXiv preprint arXiv:1203.4745 (2012).
[23]
Stefanie Ringelhan, Jutta Wollersheim, and Isabell M Welpe. 2015. I like, I cite? Do Facebook likes predict the impact of scientific work? PloS one 10, 8 (2015), e0134389.
[24]
Xin Shuai, Alberto Pepe, and Johan Bollen. 2012. How the scientific community reacts to newly submitted preprints: Article downloads, twitter mentions, and citations. PloS one 7, 11 (2012), e47523.
[25]
Yla R Tausczik and James W Pennebaker. 2011. Predicting the perceived quality of online mathematics contributions from users' reputations. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 1885--1888.
[26]
Mike Thelwall. 2012. Journal impact evaluation: a webometric perspective. Scientometrics 92, 2 (2012), 429--441.
[27]
Mohan Timilsina, Brian Davis, Mike Taylor, and Conor Hayes. 2016. Towards predicting academic impact from mainstream news and weblogs: A heterogeneous graph based approach. In Advances in Social Networks Analysis and Mining (ASONAM), 2016 IEEE/ACM International Conference on. IEEE, 1388--1389.
[28]
N Seth Trueger, Brent Thoma, Cindy H Hsu, Daniel Sullivan, Lindsay Peters, and Michelle Lin. 2015. The Altmetric score: a new measure for article-level dissemination and impact. Annals of emergency medicine (2015).
[29]
Yujing Wang, Yunhai Tong, and Ming Zeng. 2013. Ranking Scientific Articles by Exploiting Citations, Authors, Journals, and Time Information. In AAAI.
[30]
Stanley Wasserman and Katherine Faust. 1994. Social network analysis: Methods and applications. Vol. 8. Cambridge university press.
[31]
Scott White and Padhraic Smyth. 2003. Algorithms for estimating relative importance in networks. In Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 266--275.
[32]
Xin-min Xiang. 2009. Futurerank: Ranking scientific articles by predicting their future pagerank. (2009).
[33]
Ding Zhou, Sergey A Orshanskiy, Hongyuan Zha, and C Lee Giles. 2007. Co-ranking authors and documents in a heterogeneous network. In Seventh IEEE International Conference on Data Mining (ICDM 2007). IEEE, 739--744.
[34]
Xiaodan Zhu, Peter Turney, Daniel Lemire, and André Vellino. 2015. Measuring academic influence: Not all citations are equal. Journal of the Association for Information Science and Technology 66, 2 (2015), 408--427.

Cited By

View all
  • (2023)Neural age screening on question answering communitiesEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.106219123(106219)Online publication date: Aug-2023
  • (2021)What identifies different age cohorts in Yahoo! Answers?Knowledge-Based Systems10.1016/j.knosys.2021.107278(107278)Online publication date: Jul-2021
  • (2021)Early indicators of scientific impact: Predicting citations with altmetricsJournal of Informetrics10.1016/j.joi.2020.10112815:2(101128)Online publication date: May-2021
  • Show More Cited By

Index Terms

  1. Predicting citations from mainstream news, weblogs and discussion forums

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WI '17: Proceedings of the International Conference on Web Intelligence
      August 2017
      1284 pages
      ISBN:9781450349512
      DOI:10.1145/3106426
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 23 August 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. altmetrics
      2. centrality
      3. graphs
      4. impact
      5. prediction

      Qualifiers

      • Research-article

      Funding Sources

      • Elsevier
      • Science Foundation of Ireland

      Conference

      WI '17
      Sponsor:

      Acceptance Rates

      WI '17 Paper Acceptance Rate 118 of 178 submissions, 66%;
      Overall Acceptance Rate 118 of 178 submissions, 66%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 11 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Neural age screening on question answering communitiesEngineering Applications of Artificial Intelligence10.1016/j.engappai.2023.106219123(106219)Online publication date: Aug-2023
      • (2021)What identifies different age cohorts in Yahoo! Answers?Knowledge-Based Systems10.1016/j.knosys.2021.107278(107278)Online publication date: Jul-2021
      • (2021)Early indicators of scientific impact: Predicting citations with altmetricsJournal of Informetrics10.1016/j.joi.2020.10112815:2(101128)Online publication date: May-2021
      • (2019)Heat diffusion approach for scientific impact analysis in social mediaSocial Network Analysis and Mining10.1007/s13278-019-0560-39:1Online publication date: 25-Apr-2019
      • (2018)A 2-Layered Graph Based Diffusion Approach for Altmetric Analysis2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)10.1109/ASONAM.2018.8508290(463-466)Online publication date: Aug-2018

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media