Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3201064.3201080acmconferencesArticle/Chapter ViewAbstractPublication PageswebsciConference Proceedingsconference-collections
research-article

Observing Burstiness in Wikipedia Articles during New Disease Outbreaks

Published: 15 May 2018 Publication History

Abstract

Wikipedia can be conceptualized as an open sociotechnical environment that supports communities of humans and bots that update and contest information in Wikipedia articles. This environment affords a view to community or domain interactions and reactions to salient topics, such as disease outbreaks. But do reactions to different topics vary, and how can we measure them? One widely-used approach when answering these questions is to delineate levels of burstiness-communication flows characterized by repeated bursts instead of a continuous stream-in the construction of a Wikipedia article. A literature review, however, reveals that current burstiness approaches do not fully support efforts to compare Wikipedia community reactions to different articles. Through an empirical analysis of the construction of Wikipedia health-related articles, we both extend and refine burstiness as an analytical technique to understand the community dynamics underlying the construction of Wikipedia articles. We define a method by which we can categorize burstiness as high medium and low. Our empirical results suggest a proposed a model of burstiness.

References

[1]
George Box and Gwilym Jenkins. 1970. Time Series Analysis: Forecasting and Control. San Francisco: Holden-Day.
[2]
Lawrence K. Altman. 1976. Legion Disease Tests Increase Mystery. The New York Times. Retrieved February 14, 2018 from https://www.nytimes.com/1976/09/05/archives/legion-disease-tests-increase-mystery-legion-disease-tests-increase.html
[3]
Albert-László Barabási. 2005. The origin of bursts and heavy tails in human dynamics. Nature 435, 7039: 207--211.
[4]
R. Crane and D. Sornette. 2008. Robust dynamic classes revealed by measuring the response function of a social system.
[5]
Sara Davies and Jeremy Youde. The Politics of Surveillance and Response to Disease Outbreaks: The New Frontier for States and Non-state Actors (Hardback) - Routledge. Routledge Taylor & Francis Group, NY, USA. Retrieved February 14, 2018 from https://www.routledge.com/The-Politics-of-Surveillance-and-Response-to-Disease-Outbreaks-The-New/Davies-Youde/p/book/9781409467182
[6]
Cambridge Dictionary. burst Meaning in the Cambridge English Dictionary. Retrieved February 14, 2018 from https://dictionary.cambridge.org/dictionary/english/burst
[7]
Catherine L. Dumas, Daniel LaManna, Teresa M. Harrison, S. S. Ravi, Loni Hagen, Christopher Kotfila, and Feng Chen. 2015. E-petitioning as Collective Political Action in We the People. Scholar Commons: 1.
[8]
Milad Eftekhar, Nick Koudas, and Yashar Ganjali. 2013. Bursty Subgraphs in Social Networks. In Proceedings of the Sixth ACM International Conference on Web Search and Data Mining (WSDM '13), 213--222.
[9]
Niles Eldredge and Stephen Gould. 1972. Punctuated equilibria: an alternative to phyletic gradualism. In Models in paleobiology. 82--115. Retrieved February 14, 2018 from https://archive.org/details/B-001-004--118
[10]
David F. Prindle. 2012. Importing Concepts from Biology into Political Science: The Case of Punctuated Equilibrium. Policy Studies Journal 40: 21--44.
[11]
Gabriel Pui Cheong Fung, Jeffrey Xu Yu, Philip S. Yu, and Hongjun Lu. 2005. Parameter Free Bursty Events Detection in Text Streams. In Proceedings of the 31st International Conference on Very Large Data Bases (VLDB '05), 181--192. Retrieved February 14, 2018 from http://dl.acm.org/citation.cfm?id=1083592.1083616
[12]
Nicholas Generous, Geoffrey Fairchild, Alina Deshpande, Sara Y. Del Valle, and Reid Priedhorsky. 2014. Global disease monitoring and forecasting with Wikipedia. PLoS computational biology 10, 11: e1003892.
[13]
Connie J. G. Gersick. 1991. Revolutionary Change Theories: A Multilevel Exploration of the Punctuated Equilibrium Paradigm. The Academy of Management Review 16, 1: 10--36.
[14]
A. Grabowski, N. Kruszewska, and R. A. Kosi'ski. 2008. Dynamic phenomena and human activity in an artificial society. Physical Review. E, Statistical, Nonlinear, and Soft Matter Physics 78, 6 Pt 2: 066110.
[15]
A. A. Grace and B. S. Bunney. 1984. The control of firing pattern in nigral dopamine neurons: burst firing. The Journal of Neuroscience: The Official Journal of the Society for Neuroscience 4, 11: 2877--2890.
[16]
Jheser Guzman and Barbara Poblete. 2013. On-line Relevant Anomaly Detection in the Twitter Stream: An Efficient Bursty Keyword Detection Model. In Proceedings of the ACM SIGKDD Workshop on Outlier Detection and Description (ODD '13), 31--39.
[17]
Hai-Bo Hu and Ding-Yi Han. 2008. Empirical analysis of individual popularity and activity on an online music service system. Physica A: Statistical Mechanics and its Applications 387, 23: 5916--5921.
[18]
Bin Jiang. 2013. Head/Tail Breaks: A New Classification Scheme for Data with a Heavy-Tailed Distribution. The Professional Geographer 65, 3: 482--494.
[19]
Bin Jiang and Daniel Z. Sui. 2014. A New Kind of Beauty Out of the Underlying Scaling of Geographic Space. The Professional Geographer 66, 4: 676--686.
[20]
Brian Keegan, Darren Gergle, and Noshir Contractor. 2011. Hot off the Wiki: Dynamics, Practices, and Structures in Wikipedia's Coverage of the TOHoku Catastrophes. In Proceedings of the 7th International Symposium on Wikis and Open Collaboration (WikiSym '11), 105--113.
[21]
Jon Kleinberg. 2003. Bursty and Hierarchical Structure in Streams. Data Mining and Knowledge Discovery 7, 4: 373--397.
[22]
Celine Klemm, Enny Das, and Tilo Hartmann. 2016. Swine flu and hype: A systematic review of media dramatization of the H1N1 influenza pandemic. Journal of Risk Research 19, 1: 1--20.
[23]
Daniel Krewski, Michelle C. Turner, Louise Lemyre, and Jennifer E. C. Lee. 2012. Expert vs. public perception of population health risks in Canada. Journal of Risk Research 15, 6: 601--625.
[24]
Ravi Kumar, Jasmine Novak, Prabhakar Raghavan, and Andrew Tomkins. 2005. On the Bursty Evolution of Blogspace. World Wide Web 8, 2: 159--178.
[25]
Theodoros Lappas, Benjamin Arai, Manolis Platakis, Dimitrios Kotsakos, and Dimitrios Gunopulos. 2009. On Burstiness-aware Search for Document Sequences. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '09), 477--486.
[26]
Theodoros Lappas, Marcos R. Vieira, Dimitrios Gunopulos, and Vassilis J. Tsotras. 2013. STEM: A Spatio-temporal Miner for Bursty Activity. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data (SIGMOD '13), 1021--1024.
[27]
Michaël R. Laurent and Tim J. Vickers. 2009. Seeking Health Information Online: Does Wikipedia Matter? Journal of the American Medical Informatics Association?: JAMIA 16, 4: 471--479.
[28]
Sungjun Lee, Sangjin Lee, Kwanho Kim, and Jonghun Park. 2012. Bursty Event Detection from Text Streams for Disaster Management. In Proceedings of the 21st International Conference on World Wide Web (WWW '12 Companion), 679--682.
[29]
Yue Lin. 2013. A Comparison Study on Natural and Head/tail Breaks Involving Digital Elevation Models. Retrieved February 15, 2018 from http://urn.kb.se/resolve?urn=urn:nbn:se:hig:diva-15609
[30]
Miles Lincoln. 2012. Death and Change Tracking: Wikipedia Edit Bursts. Retrieved from https://www.ocf.berkeley.edu/~milesl/content/deathedits.pdf
[31]
Christoph H. Loch and Bernardo A. Huberman. 1999. A Punctuated-Equilibrium Model of Technology Diffusion. Management Science 45, 2: 160--177.
[32]
Xinjiang Lu, Zhiwen Yu, Bin Guo, Jiafan Zhang, Alvin Chin, Jilei Tian, and Yang Cao. 2014. Trending Words Based Event Detection in Sina Weibo. In Proceedings of the 2014 International Conference on Big Data Science and Computing (BigDataScience '14), 4:1--4:6.
[33]
Helen Margetts, Peter John, Scott Hale, and Taha Yasseri. 2015. Political Turbulence: How Social Media Shape Collective Action. Princeton University Press, Princeton, NJ, USA.
[34]
Ningfang Mi, Giuliano Casale, Ludmila Cherkasova, and Evgenia Smirni. 2008. Burstiness in Multi-tier Applications: Symptoms, Causes, and New Models. In Middleware 2008 (Lecture Notes in Computer Science), 265--286.
[35]
Patricia Niland and Antonia C. Lyons. 2011. Uncertainty in medicine: meanings of menopause and hormone replacement therapy in medical textbooks. Social Science & Medicine (1982) 73, 8: 1238--1245.
[36]
orld Health Organization. World Health Organization, Disease outbreaks. SEARO. Retrieved February 14, 2018 from http://www.searo.who.int/topics/disease_outbreaks/en/
[37]
Mauro Politi and Enrico Scalas. 2008. Fitting the empirical distribution of intertrade durations. Physica A: Statistical Mechanics and its Applications 387, 8: 2025--2034.
[38]
Yla Tausczik, Kate Faasse, James W. Pennebaker, and Keith J. Petrie. 2012. Public anxiety and information seeking following the H1N1 outbreak: blogs, newspaper articles, and Wikipedia visits. Health Communication 27, 2: 179--185.
[39]
Ramine Tinati, Markus Luczak-Roesch, and Wendy Hall. 2016. Finding Structure in Wikipedia Edit Activity: An Information Cascade Approach. In Proceedings of the 25th International Conference Companion on World Wide Web (WWW '16 Companion), 1007--1012.
[40]
Ramine Tinati, Thanassis Tiropanis, and Lesie Carr. 2013. An Approach for Using Wikipedia to Measure the Flow of Trends Across Countries. In Proceedings of the 22Nd International Conference on World Wide Web (WWW '13 Companion), 1373--1378.
[41]
J.L. True, B.D. Jones, and F.R. Baumgartner. 2007. Punctuated Equilibrium Theory: Explaining stability and change in public policymaking. Em Paul A. Sabatier (Org.) (2007): 155--187.
[42]
Michail Vlachos, Christopher Meek, Zografoula Vagena, and Dimitrios Gunopulos. 2004. Identifying Similarities, Periodicities and Bursts for Online Search Queries. In Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data (SIGMOD '04), 131--142.
[43]
Morten Warncke-Wang, Vivek Ranjan, Loren Terveen, and Brent Hecht. 2015. Misalignment between supply and demand of quality content in peer production communities. In Proceedings of the 9th International Conference on Web and Social Media, ICWSM 2015. Retrieved February 14, 2018 from https://experts.umn.edu/en/publications/misalignment-between-supply-and-demand-of-quality-content-in-peer
[44]
Xiaoxue Zhao, Takahiro Omi, Nanae Matsuno, and Shigeru Shinomoto. 2010. A non-universal aspect in the temporal occurrence of earthquakes. New Journal of Physics 12, 6: 063010.
[45]
Xin Zhao, Jing Jiang, Jing He, Xiaoming Li, Hongfei Yan, and Dongdong Shan. 2010. Context Modeling for Ranking and Tagging Bursty Features in Text Streams. 1769.
[46]
Tao Zhou, Hoang Anh Tuan Kiet, Beom Jun Kim, Bing-Hong Wang, and Petter Holme. 2007. Role of Activity in Human Dynamics.
[47]
Leptokurtosis Definition from Financial Times Lexicon. Retrieved February 14, 2018 from http://lexicon.ft.com/Term?term=leptokurtosis

Cited By

View all
  • (2022)Why does the president tweet this? Discovering reasons and contexts for politicians’ tweets from news articlesInformation Processing and Management: an International Journal10.1016/j.ipm.2022.10289259:3Online publication date: 1-May-2022
  • (2021)Volunteer contributions to Wikipedia increased during COVID-19 mobility restrictionsScientific Reports10.1038/s41598-021-00789-311:1Online publication date: 2-Nov-2021
  • (2021)Unified approach to retrospective event detection for event- based epidemic intelligenceInternational Journal on Digital Libraries10.1007/s00799-021-00308-9Online publication date: 9-Oct-2021
  • Show More Cited By

Index Terms

  1. Observing Burstiness in Wikipedia Articles during New Disease Outbreaks

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WebSci '18: Proceedings of the 10th ACM Conference on Web Science
    May 2018
    399 pages
    ISBN:9781450355636
    DOI:10.1145/3201064
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 15 May 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. burstiness
    2. collective behavior
    3. human-machine network
    4. online community
    5. peer production
    6. wikipedia.

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    WebSci '18
    Sponsor:
    WebSci '18: 10th ACM Conference on Web Science
    May 27 - 30, 2018
    Amsterdam, Netherlands

    Acceptance Rates

    WebSci '18 Paper Acceptance Rate 30 of 113 submissions, 27%;
    Overall Acceptance Rate 245 of 933 submissions, 26%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)7
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 22 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Why does the president tweet this? Discovering reasons and contexts for politicians’ tweets from news articlesInformation Processing and Management: an International Journal10.1016/j.ipm.2022.10289259:3Online publication date: 1-May-2022
    • (2021)Volunteer contributions to Wikipedia increased during COVID-19 mobility restrictionsScientific Reports10.1038/s41598-021-00789-311:1Online publication date: 2-Nov-2021
    • (2021)Unified approach to retrospective event detection for event- based epidemic intelligenceInternational Journal on Digital Libraries10.1007/s00799-021-00308-9Online publication date: 9-Oct-2021
    • (2020)The Role of Wikipedia in providing information on coronavirus to Societies during the COVID-19 PandemicMiddle Black Sea Journal of Health Science10.19127/mbsjohs.7819306:3(316-324)Online publication date: 31-Dec-2020
    • (2020)Bursts of Activity: Temporal Patterns of Help-Seeking and Support in Online Mental Health ForumsProceedings of The Web Conference 202010.1145/3366423.3380056(2906-2912)Online publication date: 20-Apr-2020
    • (2020)Online Workload Burst Detection for Efficient Predictive Autoscaling of ApplicationsIEEE Access10.1109/ACCESS.2020.29882078(73730-73745)Online publication date: 2020
    • (2020)Detection of Editing Bursts and Extraction of Significant Keyphrases from Wikipedia Edit HistoryBig Data Analyses, Services, and Smart Data10.1007/978-981-15-8731-3_4(45-65)Online publication date: 11-Sep-2020
    • (2019)Towards Linked Data for Wikidata Revisions and Twitter Trending HashtagsProceedings of the 21st International Conference on Information Integration and Web-based Applications & Services10.1145/3366030.3366048(166-175)Online publication date: 2-Dec-2019
    • (2019)Efficient, but Effective?Proceedings of the ACM on Human-Computer Interaction10.1145/33592793:CSCW(1-35)Online publication date: 7-Nov-2019
    • (2019)Detection of Bursty and Significant Keyphrases from Wikipedia edit history2019 IEEE International Conference on Big Data and Smart Computing (BigComp)10.1109/BIGCOMP.2019.8679105(1-4)Online publication date: Feb-2019

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media