Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3404835.3462930acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Understanding and Mitigating Bias in Online Health Search

Published: 11 July 2021 Publication History

Abstract

Search engines are perceived as a reliable source for general information needs. However, finding the answer to medical questions using search engines can be challenging for an ordinary user. Content can be biased and results may present different opinions. In addition, interpreting medically related content can be difficult for users with no medical background. All of these can lead users to incorrect conclusions regarding health related questions.
In this work we address this problem from two perspectives. First, to gain insight on users' ability to correctly answer medical questions using search engines, we conduct a comprehensive user study. We show that for questions regarding medical treatment effectiveness, participants struggle to find the correct answer and are prone to overestimating treatment effectiveness. We analyze participants' demographic traits according to age and education level and show that this problem persists in all demographic groups. We then propose a semi-automatic machine learning approach to find the correct answer to queries on medical treatment effectiveness as it is viewed by the medical community. The model relies on the opinions presented in medical papers related to the queries, as well as features representing their impact. We show that, compared to human behaviour, our method is less prone to bias. We compare various configurations of our inference model and a baseline method that determines treatment effectiveness based solely on the opinion of medical papers. The results bolster our confidence that our approach can pave the way to developing automatic bias-free tools that can help mediate complex health related content to users.

Supplementary Material

MP4 File (sigir.mp4)
Presentation Video for the SIGIR paper: Understanding and Mitigating Bias in Online Health Search

References

[1]
[n.d.]. Cochrane | Trusted evidence. Informed decisions. Better health. https://www.cochrane.org/. (Accessed on 02/06/2021).
[2]
2019. Majority of Adults Look Online for Health Information | PEW ResearchCenter. https://www.pewresearch.org/fact-tank/2013/02/01/majority-of-adults-look-online-for-health-information/. https://www.pewresearch.org/fact-tank/2013/02/01/majority-of-adults-look-online-for-health-information/
[3]
Asma Ben Abacha and Pierre Zweigenbaum. 2015. MEANS: A medical question-answering system combining NLP techniques and semantic Web technologies. Information processing & management, Vol. 51, 5 (2015), 570--594.
[4]
Lutz Bornmann and Hans-Dieter Daniel. 2007. What do we know about the h index? Journal of the American Society for Information Science and technology, Vol. 58, 9 (2007), 1381--1385.
[5]
US Census Bureau. 2020. Educational Attainment in the United States: 2018. https://www.census.gov/data/tables/2018/demo/education-attainment/cps-detailed-tables.html
[6]
Nitesh V Chawla, Kevin W Bowyer, Lawrence O Hall, and W Philip Kegelmeyer. 2002. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research, Vol. 16 (2002), 321--357.
[7]
Amira Ghenai and Yelena Mejova. 2017. Catching Zika Fever: Application of Crowdsourcing and Machine Learning for Tracking Health Misinformation on Twitter. (jul 2017). arxiv: 1707.03778 http://arxiv.org/abs/1707.03778
[8]
Amira Ghenai and Yelena Mejova. 2018. Fake Cures: User-centric Modeling of Health Misinformation in Social Media., Vol. 2, November (2018). arxiv: 1809.00557 http://arxiv.org/abs/1809.00557
[9]
Patty Kostkova, Vino Mano, Heidi J. Larson, and William S. Schulz. 2016. VAC Medi+ board. In Proceedings of the 6th International Conference on Digital Health Conference - DH '16. ACM Press, New York, New York, USA, 163--164. https://doi.org/10.1145/2896338.2896370
[10]
Minsuk Lee, James Cimino, Hai Ran Zhu, Carl Sable, Vijay Shanker, John Ely, and Hong Yu. 2006. Beyond information retrieval-medical question answering. In AMIA annual symposium proceedings, Vol. 2006. American Medical Informatics Association, 469.
[11]
Jimmy Lin. 2008. PageRank without hyperlinks: Reranking with related article networks for biomedical text retrieval. BMC bioinformatics, Vol. 9, 1 (2008), 270.
[12]
Susan Michie, James Thomas, Johnston, et al. 2017. The Human Behaviour-Change Project: Harnessing the power of artificial intelligence and machine learning for evidence synthesis and interpretation. Implementation Science, Vol. 12 (12 2017). https://doi.org/10.1186/s13012-017-0641--5
[13]
Frances A Pogacar, Amira Ghenai, Mark D Smucker, and Charles LA Clarke. 2017. The Positive and Negative Influence of Search Results on People's Decisions about the Efficacy of Medical Treatments. In Proceedings of the ACM SIGIR International Conference on Theory of Information Retrieval. ACM, 209--216.
[14]
Hinrich Schütze, Christopher D Manning, and Prabhakar Raghavan. 2007. An introduction to information retrieval .Cambridge University Press,.
[15]
Joseph D Tucker, Suzanne Day, Weiming Tang, and Barry Bayus. 2019. Crowdsourcing in medical research: concepts and applications. PeerJ, Vol. 7 (2019), e6762.
[16]
Yuxi Wang, Martin McKee, Aleksandra Torbica, and David Stuckler. 2019. Systematic literature review on the spread of health-related misinformation on social media. Social Science & Medicine, Vol. 240 (2019), 112552.
[17]
Ryen White. 2013. Beliefs and biases in web search. In Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval. 3--12.
[18]
Ryen W White and Ahmed Hassan. 2014. Content bias in online health search. ACM Transactions on the Web (TWEB), Vol. 8, 4 (2014), 1--33.
[19]
Elliot J Yates and Louise C Dixon. 2015. PageRank as a method to rank biomedical literature by importance. Source code for biology and medicine, Vol. 10, 1 (2015), 16.
[20]
Bei Yu, Matt Willis, Peiyuan Sun, and Jun Wang. 2013. Crowdsourcing participatory evaluation of medical pictograms using Amazon Mechanical Turk. Journal of medical Internet research, Vol. 15, 6 (2013), e108.
[21]
Haijun Zhai, Todd Lingren, Louise Deleger, Qi Li, Megan Kaiser, Laura Stoutenborough, and Imre Solti. 2013. Web 2.0-based crowdsourcing for high-quality gold standard development in clinical natural language processing. Journal of medical Internet research, Vol. 15, 4 (2013), e73.

Cited By

View all
  • (2024)Health Insurance Literacy Among Young AdultsProceedings of the Association for Information Science and Technology10.1002/pra2.116861:1(1005-1007)Online publication date: 15-Oct-2024
  • (2023)UNFair: Search Engine Manipulation, Undetectable by Amortized InequityProceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency10.1145/3593013.3594046(830-839)Online publication date: 12-Jun-2023
  • (2023)Not Just Skipping: Understanding the Effect of Sponsored Content on Users' Decision-Making in Online Health SearchProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591744(1056-1065)Online publication date: 19-Jul-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '21: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
July 2021
2998 pages
ISBN:9781450380379
DOI:10.1145/3404835
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 July 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. biases
  2. health search
  3. machine learning

Qualifiers

  • Research-article

Funding Sources

  • NSF IIS
  • Israel Innovation Authority

Conference

SIGIR '21
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)43
  • Downloads (Last 6 weeks)5
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Health Insurance Literacy Among Young AdultsProceedings of the Association for Information Science and Technology10.1002/pra2.116861:1(1005-1007)Online publication date: 15-Oct-2024
  • (2023)UNFair: Search Engine Manipulation, Undetectable by Amortized InequityProceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency10.1145/3593013.3594046(830-839)Online publication date: 12-Jun-2023
  • (2023)Not Just Skipping: Understanding the Effect of Sponsored Content on Users' Decision-Making in Online Health SearchProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591744(1056-1065)Online publication date: 19-Jul-2023
  • (2023)Behavioral Economics in IRA Behavioral Economics Approach to Interactive Information Retrieval10.1007/978-3-031-23229-9_6(155-180)Online publication date: 18-Feb-2023
  • (2023)Is googling risky? A study on risk perception and experiences of adverse consequences in web searchJournal of the Association for Information Science and Technology10.1002/asi.2480275:5(567-580)Online publication date: 6-Jun-2023
  • (2022)How Misinformation Density Affects Health Information SearchProceedings of the ACM Web Conference 202210.1145/3485447.3512141(2668-2677)Online publication date: 25-Apr-2022
  • (2022)Learning Trustworthy Web Sources to Derive Correct Answers and Reduce Health Misinformation in SearchProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531812(2099-2104)Online publication date: 6-Jul-2022
  • (2022)Monant Medical Misinformation DatasetProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3477495.3531726(2949-2959)Online publication date: 6-Jul-2022
  • (2021)Misbeliefs and Biases in Health-Related SearchesProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482141(2894-2899)Online publication date: 26-Oct-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media