Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3184558.3186357acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article
Free access

Disease Tracking in GCC Region Using Arabic Language Tweets

Published: 23 April 2018 Publication History

Abstract

Several prior studies have demonstrated the possibility of tracking the outbreak and spread of diseases using public tweets and other social media platforms. However, almost all such prior studies were restricted to geographically filtered English language tweets only. This study is the first to attempt a similar approach for Arabic language tweets originating from the Gulf Cooperation Council (GCC) countries. We obtained a list of commonly occurring diseases in the region from the Saudi Ministry of Health. We used both the English disease names as well as their Arabic translations to filter the stream of tweets. We acquired old tweets for a period spanning 29 months. All tweets were geographically filtered for the Middle East and the list of disease names in both English and Arabic languages. We observed that only a small fraction of tweets were in English, demonstrating that prior approaches to disease tracking relying on English language features are less effective for this region. We also demonstrate how Arabic language tweets can be used rather effectively to track the spread of some infectious diseases in the region. We verified our approach by demonstrating that a high degree of correlation between the occurrence of MERS-Coronavirus cases and Arabic language tweets on the disease. We also show that infectious diseases generating fewer tweets and non-infectious diseases do not exhibit the same high correlation. We also verify the usefulness of tracking cases using Twitter mentions by comparing against a ground truth data set of MERS-CoV cases obtained from the Saudi Ministry of Health.

References

[1]
Theresa Marie Bernardo, Andrijana Rajic, Ian Young, Katie Robiadek, Mai T Pham, and Julie A Funk. 2013. Scoping review on search queries and social media for disease surveillance: a chronology of innovation. Journal of medical Internet research Vol. 15, 7 (2013), e147.
[2]
John S Brownstein, Clark C Freifeld, and Lawrence C Madoff. 2009. Digital disease detectionharnessing the Web for public health surveillance. New England Journal of Medicine Vol. 360, 21 (2009), 2153--2157.
[3]
Declan Butler. 2013. When Google got flu wrong. Nature Vol. 494, 7436 (2013), 155.
[4]
Rumi Chunara, Jason R Andrews, and John S Brownstein. 2012. Social and news media enable estimation of epidemiological patterns early in the 2010 Haitian cholera outbreak. The American journal of tropical medicine and hygiene Vol. 86, 1 (2012), 39--45.
[5]
Courtney D Corley, Diane J Cook, Armin R Mikler, and Karan P Singh. 2010. Using Web and social media for influenza surveillance. In Advances in Computational Biology. Springer, 559--564.
[6]
Aron Culotta. 2010. Towards detecting influenza epidemics by analyzing Twitter messages Proceedings of the first workshop on social media analytics. ACM, 115--122.
[7]
Center for Disease Control. last accessed: January 15, 2018. FastStats - Infectious Disease. https://www.cdc.gov/nchs/fastats/infectious-disease.htm (. last accessed: January 15, 2018).
[8]
Google. last accessed 18 June 2017. Google Trends. https://trends.google.com/trends/ (. last accessed 18 June 2017).
[9]
Ministry of Health Kingdom of Saudi Arabia. last accessed June 10, 2017 a. Diseases - Diseases List. http://www.moh.gov.sa/en/HealthAwareness/EducationalContent/Diseases/ Pages/ (. last accessed June 10, 2017).
[10]
Ministry of Health Kingdom of Saudi Arabia. last accessed June 10, 2017 b. Health Days 2016 - Breast Cancer Awareness Month. http://www.moh.gov.sa/en/HealthAwareness/healthDay/2016/Pages/HealthDay-2016--10-01--31.aspx (. last accessed June 10, 2017).
[11]
Ministry of Health Kingdom of Saudi Arabia. last accessed November 14, 2017 c. Statistics - Statistics. https://www.moh.gov.sa/en/ccc/pressreleases/pages/default.aspxPageIndex=1 (. last accessed November 14, 2017).
[12]
Microsoft. last accessed 18 June 2017. Bing - Keyword Research. https://www.bing.com/toolbox/keywords (. last accessed 18 June 2017).
[13]
World Health Organization. 2018. WHO | Global Health Observatory (GHO) Data. http://www.who.int/gho/en/ (2018).
[14]
Camille Pelat, Clement Turbelin, Avner Bar-Hen, Antoine Flahault, and Alain-Jacques Valleron. 2009. More diseases tracked by using Google Trends. Emerging infectious diseases Vol. 15, 8 (2009), 1327--8.
[15]
Susan Young Rojahn. 2012. Pakistan Uses Smartphone Data to Head Off Dengue Outbreak. MIT Technology Review (10. 2012).
[16]
Marcel Salathé, Clark C Freifeld, Sumiko R Mekaru, Anna F Tomasulo, and John S Brownstein. 2013. Influenza A (H7N9) and the importance of digital epidemiology. The New England journal of medicine Vol. 369, 5 (2013), 401.
[17]
Charles W. Schmidt. 2012. Using social media to predict and track disease outbreaks. Environmental health perspectives Vol. 120, 1 (2012), A31.
[18]
Antonio Valdivia and Susana Monge-Corella. 2010. Diseases tracked by using Google trends, Spain. Emerging infectious diseases Vol. 16, 1 (2010), 168.

Cited By

View all
  • (2021)Tracking sentiment towards news entities from Arabic news on social mediaFuture Generation Computer Systems10.1016/j.future.2021.01.015118(467-484)Online publication date: May-2021
  • (2020)Sehaa: A Big Data Analytics Tool for Healthcare Symptoms and Diseases Detection Using Twitter, Apache Spark, and Machine LearningApplied Sciences10.3390/app1004139810:4(1398)Online publication date: 19-Feb-2020
  • (2020)Social media based surveillance systems for healthcare using machine learning: A systematic reviewJournal of Biomedical Informatics10.1016/j.jbi.2020.103500108(103500)Online publication date: Aug-2020

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '18: Companion Proceedings of the The Web Conference 2018
April 2018
2023 pages
ISBN:9781450356404
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • IW3C2: International World Wide Web Conference Committee

In-Cooperation

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 23 April 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. arabic
  2. disease tracking
  3. epidemiology
  4. gulf region
  5. twitter

Qualifiers

  • Research-article

Conference

WWW '18
Sponsor:
  • IW3C2
WWW '18: The Web Conference 2018
April 23 - 27, 2018
Lyon, France

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)82
  • Downloads (Last 6 weeks)10
Reflects downloads up to 01 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Tracking sentiment towards news entities from Arabic news on social mediaFuture Generation Computer Systems10.1016/j.future.2021.01.015118(467-484)Online publication date: May-2021
  • (2020)Sehaa: A Big Data Analytics Tool for Healthcare Symptoms and Diseases Detection Using Twitter, Apache Spark, and Machine LearningApplied Sciences10.3390/app1004139810:4(1398)Online publication date: 19-Feb-2020
  • (2020)Social media based surveillance systems for healthcare using machine learning: A systematic reviewJournal of Biomedical Informatics10.1016/j.jbi.2020.103500108(103500)Online publication date: Aug-2020
  • (2019)An Enhanced Algorithm for Matching Arabic Names Entered by Mobile Phones2019 First International Conference of Intelligent Computing and Engineering (ICOICE)10.1109/ICOICE48418.2019.9035148(1-8)Online publication date: Dec-2019

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media