Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article
Free access

Bias on the web

Published: 23 May 2018 Publication History

Abstract

Bias in Web data and use taints the algorithms behind Web-based applications, delivering equally biased results.

Supplemental Material

PDF File
Further Readings

References

[1]
ACM U.S. Public Policy Council. Statement on Algorithmic Transparency and Accountability, ACM, Washington, D.C., Jan. 2017; https://www.acm.org/binaries/content/assets/public-policy/2017_usacm_statement_algorithms.pdf
[2]
Agarwal, D., Chen, B-C., and Elango, P. Explore/exploit schemes for Web content optimization. In Proceedings of the Ninth IEEE International Conference on Data Mining (Miami, FL, Dec. 6--9). IEEE Computer Society Press, 2009.
[3]
Baeza-Yates, R., Castillo, C., and López, V. Characteristics of the Web of Spain. Cybermetrics 9, 1 (2005), 1--41.
[4]
Baeza-Yates, R. and Castillo, C. Relationship between Web links and trade (poster). In Proceedings of the 15th International Conference on the World Wide Web (Edinburgh, U.K., May 23--26). ACM Press, New York, 2006, 927--928.
[5]
Baeza-Yates, R., Castillo, C., and Efthimiadis, E.N. Characterization of national Web domains. ACM Transactions on Internet Technology 7, 2 (May 2007), article 9.
[6]
Baeza-Yates, R., Pereira, Á., and Ziviani, N. Genealogical trees on the Web: A search engine user perspective. In Proceedings of the 17th International Conference on the World Wide Web (Beijing, China, Apr 21--25). ACM Press, New York, 2008, 367--376.
[7]
Baeza-Yates, R. Incremental sampling of query logs. In Proceedings of the 38th ACM SIGIR Conference (Santiago, Chile, Aug. 9--13). ACM Press, New York, 2015, 1093--1096.
[8]
Baeza-Yates, R. and Saez-Trumper, D. Wisdom of the crowd or wisdom of a few? An analysis of users' content generation. In Proceedings of the 26th ACM Conference on Hypertext and Social Media (Guzelyurt, TRNC, Cyprus, Sept. 1--4). ACM Press, New York, 2015, 69--74.
[9]
Bolukbasi, R., Chang, K.W., Zou, J., Saligrama, V., and Kalai, A. Man is to computer programmer as woman is to homemaker? De-biasing word embeddings. In Proceedings of the 30th Conference on Neural Information Processing Systems (Barcelona, Spain, Dec. 5--10). Curran Associates, Inc., Red Hook, NY, 2016, 4349--4357.
[10]
Caliskan, A., Bryson, J.J., and Narayanan, A. Semantics derived automatically from language corpora contain human-like biases. Science 356, 6334 (Apr. 2017), 183--186.
[11]
Chapelle, O. and Zhang, Y. A dynamic Bayesian network click model for Web search ranking. In Proceedings of the 18th International Conference on the World Wide Web (Madrid, Spain, Apr. 20--24). ACM Press, New York, 2009, 1--10.
[12]
Dupret, G.E. and Piwowarski, B. A user-browsing model to predict search engine click data from past observations. In Proceedings of the 31st ACM SIGIR Conference (Singapore, July 20--24). ACM Press, New York, 2008, 331--338.
[13]
Fetterly, D., Manasse, M., and Najork, M. 0n the evolution of clusters of near-duplicate webpages. Journal of Web Engineering 2, 4 (Oct. 2003), 228--246.
[14]
Gong, W., Lim, E.-P., and Zhu, F. Characterizing silent users in social media communities. In Proceedings of the Ninth International AAAI Conference on Web and Social Media (Oxford, U.K., May 26--29). AAAI, Fremont, CA, 2015, 140--149.
[15]
Graells-Garrido, E. and Lalmas, M. Balancing diversity to countermeasure geographical centralization in microblogging platforms. In Proceedings of the 25th ACM Conference on Hypertext and Social Media (Santiago, Chile, Sept. 1--4). ACM Press, New York, 2014, 231--236.
[16]
Graells-Garrido, E., Lalmas, M., and Menczer, F. First women, second sex: Gender bias in Wikipedia. In Proceedings of the 26th ACM Conference on Hypertext and Social Media (Guzelyurt, TRNC, Cyprus, Sept. 1--4). ACM Press, New York, 2015, 165--174.
[17]
Lazer, D.M.J. et al. The science of fake news. Science 359, 6380 (Mar. 2018), 1094--1096.
[18]
Mediative. The Evolution of Google's Search Results Pages & Effects on User Behaviour, White paper, 2014; http://www.mediative.com/SERP
[19]
Mercer, A., Deane, C., and McGeeney, K. Why 2016 Election Polls Missed Their Mark, Pew Research Center, Washington, D.C., Nov 2016; http://www.pewresearch.org/fact-tank/2016/11/09/why-2016-election-polls-missed-their-mark/
[20]
Olteanu, A., Castillo, C., Diaz, F., and Kiciman, E. Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries, SSRN, Rochester, NY, Dec. 20, 2016; https://ssrn.com/abstract=2886526
[21]
Pariser, E. The Filter Bubble: How the New Personalized Web Is Changing What We Read and How We Think, Penguin, London, U.K., 2011.
[22]
Saez-Trumper, D., Castillo, C., and Lalmas, M. Social media news communities: Gatekeeping, coverage, and statement bias. In Proceedings of the ACM International Conference on Information and Knowledge Management (San Francisco, CA, Oct. 27-Nov. 1). ACM Press, New York, 2013, 1679--1684.
[23]
Silberzahn, R. and Uhlmann, E.L. Crowdsourced research: Many hands make tight work. Nature 526, 7572 (Oct. 2015), 189--191; https://psyarxiv.com/qkwst/
[24]
Smith, M., Patil, D.J., and Muñoz, C. Big Data: A Report on Algorithmic Systems, Opportunity, and Civil Rights. Executive Office of the President, Washington, D.C., 2016; https://obamawhitehouse.archives.gov/sites/default/files/microsites/ostp/2016_0504_data_discrimination.pdf
[25]
Wagner, C., Garcia, D., Jadidi, M., and Strohmaier, M. It's a man's Wikipedia? Assessing gender inequality in an online encyclopedia. In Proceedings of the Ninth International AAAI Conference on Web and Social Media (Oxford, U.K., May 26--29). AAAI, Fremont, CA, 2015, 454--463.
[26]
Wang, T. and Wang, D. Why Amazon's ratings might mislead you: The story of herding effects. Big Data 2, 4 (Dec. 2014), 196--204.
[27]
White, R. Beliefs and biases in Web search. In Proceedings of the 36th ACM SIGIR Conference (Dublin, Ireland, July 28-Aug. 1). ACM Press, New York, 2013, 3--12.
[28]
Wu, S., Hofman, J.M., Mason, W.A., and Watts, D.J. Who says what to whom on Twitter. In Proceedings of the 20th International Conference on the World Wide Web (Hyderabad, India, Mar. 28--Apr. 1). ACM Press, New York, 2011, 705--714.
[29]
Zipf, G.K. Human Behavior and the Principle of Least Effort, Addison-Wesley Press, Cambridge, MA, 1949.

Cited By

View all
  • (2025)Properties of Group Fairness Measures for RankingsACM Transactions on Social Computing10.1145/36748838:1-2(1-45)Online publication date: 17-Jan-2025
  • (2025)Toward Equitable Progress: A Review of Equity Assessment and Perspectives in Emerging Technologies and Mobility Innovations in TransportationJournal of Transportation Engineering, Part A: Systems10.1061/JTEPBS.TEENG-8675151:1Online publication date: Jan-2025
  • (2025)Fairness for machine learning software in educationJournal of Systems and Software10.1016/j.jss.2024.112244219:COnline publication date: 1-Jan-2025
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Communications of the ACM
Communications of the ACM  Volume 61, Issue 6
June 2018
97 pages
ISSN:0001-0782
EISSN:1557-7317
DOI:10.1145/3229066
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 May 2018
Published in CACM Volume 61, Issue 6

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Popular
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3,686
  • Downloads (Last 6 weeks)167
Reflects downloads up to 26 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Properties of Group Fairness Measures for RankingsACM Transactions on Social Computing10.1145/36748838:1-2(1-45)Online publication date: 17-Jan-2025
  • (2025)Toward Equitable Progress: A Review of Equity Assessment and Perspectives in Emerging Technologies and Mobility Innovations in TransportationJournal of Transportation Engineering, Part A: Systems10.1061/JTEPBS.TEENG-8675151:1Online publication date: Jan-2025
  • (2025)Fairness for machine learning software in educationJournal of Systems and Software10.1016/j.jss.2024.112244219:COnline publication date: 1-Jan-2025
  • (2025)Enhancing recommender systems with provider fairness through preference distribution-awarenessInternational Journal of Information Management Data Insights10.1016/j.jjimei.2024.1003115:1(100311)Online publication date: Jun-2025
  • (2025)“What are they not telling me?” Learning machine learning: Understanding the challenges for novicesInternational Journal of Human-Computer Studies10.1016/j.ijhcs.2024.103438196(103438)Online publication date: Mar-2025
  • (2025)Artificial intelligence governance: Understanding how public organizations implement itGovernment Information Quarterly10.1016/j.giq.2024.10200342:1(102003)Online publication date: Mar-2025
  • (2025)Preference eigensystems for fair rankingExpert Systems with Applications10.1016/j.eswa.2024.126324269(126324)Online publication date: Apr-2025
  • (2024)Navigating Information Overload on Social Media: Opportunities and Misadventures for Clinicians and ProfessionalsNewborn10.5005/jp-journals-11002-01083:4(292-296)Online publication date: 20-Dec-2024
  • (2024)A Multifaceted Approach at Discerning Redditors Feelings Towards ChatGPTEAI Endorsed Transactions on Internet of Things10.4108/eetiot.644710Online publication date: 28-Jun-2024
  • (2024)Forms of Bias in Online Reviews and Their Implications for Management of Customer KnowledgeConvergence of Digitalization, Innovation, and Sustainable Development in Business10.4018/979-8-3693-0798-4.ch010(206-236)Online publication date: 23-Feb-2024
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Digital Edition

View this article in digital edition.

Digital Edition

Magazine Site

View this article on the magazine site (external)

Magazine Site

Login options

Full Access

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media