Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3627043.3659558acmconferencesArticle/Chapter ViewAbstractPublication PagesumapConference Proceedingsconference-collections
research-article
Open access

User Perception of Fairness-Calibrated Recommendations

Published: 22 June 2024 Publication History

Abstract

The research community has become increasingly aware of possible undesired effects of algorithmic biases in recommender systems. One common bias in such systems is to over-proportionally expose certain items to users, which may ultimately result in a system that is considered unfair to individual stakeholders. From a technical perspective, calibration approaches are commonly adopted in such situations to ensure that the individual user’s preferences are better taken into account, thereby also leading to a more balanced exposure of items overall. Given the known limitations of today’s predominant offline evaluation approaches, our work aims to contribute to a better understanding of the users’ perception of the fairness and quality of recommendations when these are served in a calibrated way. Therefore, we conducted an online user study (N=500) in which we exposed the treatment groups with recommendations calibrated for fairness in terms of two different item characteristics. Our results show that calibration can indeed be effective in guiding the users’ choices towards the “fairness items” without negatively impacting the overall quality perception of the system. We however also found that calibration did not measurably impact the users’ fairness perceptions unless explanatory information is provided by the system. Finally, our study points to challenges when applying calibration approaches in practice in terms of finding appropriate parameters.

References

[1]
Himan Abdollahpouri, Masoud Mansoury, R. Burke, and Bamshad Mobasher. 2019. The Impact of Popularity Bias on Fairness and Calibration in Recommendation. ArXiv abs/1910.05755 (2019).
[2]
Himan Abdollahpouri, Masoud Mansoury, Robin Burke, and Bamshad Mobasher. 2020. The Connection Between Popularity Bias, Calibration, and Fairness in Recommendation. In Fourteenth ACM Conference on Recommender Systems. 726–731.
[3]
Himan Abdollahpouri, Masoud Mansoury, Robin Burke, Bamshad Mobasher, and Edward C. Malthouse. 2021. User-centered Evaluation of Popularity Bias in Recommender Systems. In Proceedings of the 29th ACM Conference on User Modeling, Adaptation and Personalization, UMAP 2021. ACM, 119–129.
[4]
Gabrielle Alves, Dietmar Jannach, Rodrigo Ferrari, Daniela Damian, and Marcelo Garcia Manzato. 2023. Digitally Nudging Users to Explore Off-Profile Recommendations: Here Be Dragons. User Modeling and User-Adapted Interaction online first (2023).
[5]
Vito Walter Anelli, Alejandro Bellogin, Tommaso Di Noia, Dietmar Jannach, and Claudio Pomo. 2022. Top-N Recommendation Algorithms: A Quest for the State-of-the-Art. In 30th ACM Conference on User Modeling, Adaptation and Personalization (UMAP 2022).
[6]
Julia Angwin, Jeff Larson, Surya Mattu, and Lauren Kirchner. 2016. Machine bias. In Ethics of Data and Analytics. Auerbach Publications, 254–264.
[7]
Solon Barocas, Moritz Hardt, and Arvind Narayanan. 2019. Fairness and Machine Learning. fairmlbook.org. http://www.fairmlbook.org.
[8]
Jiawei Chen, Hande Dong, Xiang Wang, Fuli Feng, Meng Wang, and Xiangnan He†. 2022. Bias and Debias in Recommender System: A Survey and Future Directions. ACM Trans. Inf. Syst. (2022).
[9]
Li Chen and Pearl Pu. 2014. Experiments on user experiences with recommender interfaces. Behaviour & Information Technology 33 (2014), 372 – 394.
[10]
Diego Corrêa da Silva, Marcelo Garcia Manzato, and Frederico Araújo Durão. 2021. Exploiting personalized calibration and metrics for fairness recommendation. Expert Syst. Appl. 181 (2021), 115112.
[11]
Yashar Deldjoo, Dietmar Jannach, Alejandro Bellogin, Alessandro Difonzo, and Dario Zanzonelli. 2023. Fairness in Recommender Systems: Research Landscape and Future Directions. User Modeling and User-Adapted Interaction online first (2023).
[12]
Karlijn Dinnissen and Christine Bauer. 2023. Amplifying Artists’ Voices: Item Provider Perspectives on Influence and Fairness of Music Streaming Platforms. In Proceedings of the 31st ACM Conference on User Modeling, Adaptation and Personalization, UMAP 2023. 238–249.
[13]
Joseph A. Durlak. 2009. How to select, calculate, and interpret effect sizes.Journal of pediatric psychology 34 9 (2009), 917–28.
[14]
Michael D. Ekstrand, Anubrata Das, Robin Burke, and Fernando Diaz. 2022. Fairness in Information Access Systems. Found. Trends Inf. Retr. 16, 1-2 (2022), 1–177.
[15]
Michael D. Ekstrand, Anubrata Das, Robin Burke, and Fernando Diaz. 2022. Fairness in Recommender Systems. In Recommender Systems Handbook, Francesco Ricci, Lior Rokach, and Bracha Shapira (Eds.). 679–707.
[16]
Mehdi Elahi, Himan Abdollahpouri, Masoud Mansoury, and Helma Torkamaan. 2021. Beyond Algorithmic Fairness in Recommender Systems. In Adjunct Proceedings of the 29th ACM Conference on User Modeling, Adaptation and Personalization (Utrecht, Netherlands) (UMAP ’21). 41–46.
[17]
Mehdi Elahi, Dietmar Jannach, Lars Skjærven, Erik Knudsen, Helle Sjøvaag, Kristian Tolonen, Øyvind Holmstad, Igor Pipkin, Eivind Throndsen, Agnes Stenbom, Eivind Fiskerud, Adrian Oesch, Loek Vredenberg, and Christoph Trattner. 2021. Towards Responsible Media Recommendation. AI and Ethics 2, 1 (2021), 103–114.
[18]
Satu Elo and Helvi Aulikki Kyngäs. 2008. The qualitative content analysis process.Journal of advanced nursing 62 1 (2008), 107–15.
[19]
Daniel Fleder and Kartik Hosanagar. 2009. Blockbuster Culture’s Next Rise or Fall: The Impact of Recommender Systems on Sales Diversity. Management Science 55, 5 (2009), 697–712.
[20]
Carlos A. Gomez-Uribe and Neil Hunt. 2015. The Netflix Recommender System: Algorithms, Business Value, and Innovation. Transactions on Management Information Systems 6, 4 (2015), 13:1–13:19.
[21]
Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In WWW ’17. 173–182.
[22]
Larry V Hedges. 1981. Distribution Theory for Glass’s Estimator of Effect size and Related Estimators. Journal of Educational Statistics 6 (1981), 107 – 128.
[23]
H. F. Hsieh and Sarah Elizabeth Shannon. 2005. Three Approaches to Qualitative Content Analysis. Qualitative Health Research 15 (2005), 1277 – 1288.
[24]
Nyi Nyi Htun, Elisa Lecluse, and Katrien Verbert. 2021. Perception of Fairness in Group Music Recommender Systems. In 26th International Conference on Intelligent User Interfaces. 302–306.
[25]
Dietmar Jannach and Markus Zanker. 2021. Impact and Value of Recommender Systems. In Recommender Systems Handbook, Francesco Ricci, Bracha Shapira, and Lior Rokach (Eds.). Springer US.
[26]
Yucheng Jin, Li Chen, Wanling Cai, and Pearl Pu. 2021. Key Qualities of Conversational Recommender Systems: From Users’ Perspective. Proceedings of the 9th International Conference on Human-Agent Interaction (2021).
[27]
Michael Jugovac, Dietmar Jannach, and Lukas Lerche. 2017. Efficient Optimization of Multiple Recommendation Quality Factors According to Individual User Tendencies. Expert Systems With Applications 81 (2017), 321–331.
[28]
Mesut Kaya and Derek G. Bridge. 2019. A comparison of calibrated and intent-aware recommendations. Proceedings of the 13th ACM Conference on Recommender Systems (2019).
[29]
Anastasiia Klimashevskaia, Mehdi Elahi, Dietmar Jannach, Lars Skjærven, Astrid Tessem, and Christoph Trattner. 2023. Evaluating The Effects of Calibrated Popularity Bias Mitigation: A Field Study. In 17th ACM Conference on Recommender Systems (Late Breaking Results).
[30]
Anastasiia Klimashevskaia, Mehdi Elahi, Dietmar Jannach, Christoph Trattner, and Lars Skjærven. 2022. Mitigating Popularity Bias in Recommendation: Potential and Limits of Calibration Approaches. In Advances in Bias and Fairness in Information Retrieval, Ludovico Boratto, Stefano Faralli, Mirko Marras, and Giovanni Stilo (Eds.). 82–90.
[31]
Kibeom Lee and Kyogu Lee. 2015. Escaping your comfort zone: A graph-Based recommender system for finding novel recommendations among relevant items. Expert Systems with Applications 42, 10 (2015), 4851–4858.
[32]
Oleg Lesota, Gustavo Escobedo, Yashar Deldjoo, Bruce Ferwerda, Simone Kopeinik, Elisabeth Lex, Navid Rekabsaz, and Markus Schedl. 2023. Computational Versus Perceived Popularity Miscalibration in Recommender Systems. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR ’23). 1889–1893.
[33]
Dawen Liang, Rahul G Krishnan, Matthew D Hoffman, and Tony Jebara. 2018. Variational Autoencoders for Collaborative Filtering. In WWW ’18. 689–698.
[34]
Ninareh Mehrabi, Fred Morstatter, Nripsuta Saxena, Kristina Lerman, and Aram Galstyan. 2021. A Survey on Bias and Fairness in Machine Learning. ACM Comput. Surv. 54, 6, Article 115 (2021).
[35]
Lien Michiels, Jorre Vannieuwenhuyze, Jens Leysen, Robin Verachtert, Annelien Smets, and Bart Goethals. 2023. How Should We Measure Filter Bubbles? A Regression Model and Evidence for Online News. In Proceedings of the 17th ACM Conference on Recommender Systems(RecSys ’23). 640–651.
[36]
Arvind Narayanan. 2018. Translation tutorial: 21 fairness definitions and their politics. In Proc. Conf. Fairness Accountability Transp., New York, USA, Vol. 1170. 3.
[37]
Xia Ning and George Karypis. 2011. SLIM: Sparse linear methods for top-n recommender systems. In Proceedings of ICDM ’11. 497–506.
[38]
Jinoh Oh, Sun Park, Hwanjo Yu, Min Song, and Seung-Taek Park. 2011. Novel Recommendation Based on Personal Popularity Tendency. In ICDM ’11. 507–516.
[39]
Vincenzo Paparella, Vito Walter Anelli, Franco Maria Nardini, R. Perego, and T. D. Noia. 2023. Post-hoc Selection of Pareto-Optimal Solutions in Search and Recommendation. Proceedings of the 32nd ACM International Conference on Information and Knowledge Management (2023).
[40]
Pearl Pu, Li Chen, and Rong Hu. 2011. A User-centric Evaluation Framework for Recommender Systems. In Proceedings of the 5th ACM Conference on Recommender Systems. 157–164.
[41]
Robert Rosenthal. 1984. Meta-analytic procedures for social research.
[42]
Dimitris Serbos, Shuyao Qi, Nikos Mamoulis, Evaggelia Pitoura, and Panayiotis Tsaparas. 2017. Fairness in Package-to-Group Recommendations. In Proceedings of the 26th International Conference on World Wide Web, WWW 2017. 371–379.
[43]
Nasim Sonboli, Jessie J. Smith, Florencia Cabral Berenfus, Robin Burke, and Casey Fiesler. 2021. Fairness and Transparency in Recommendation: The Users’ Perspective. In Proceedings of the 29th ACM Conference on User Modeling, Adaptation and Personalization. 274–279.
[44]
Harald Steck. 2011. Item popularity and recommendation accuracy. In Proceedings of the 2011 ACM Conference on Recommender Systems (RecSys ’11). Chicago, Illinois, USA, 125–132.
[45]
Harald Steck. 2018. Calibrated recommendations. In Proceedings of the 12th ACM Conference on Recommender Systems. 154–162.
[46]
Maria Stratigi, Haridimos Kondylakis, and Kostas Stefanidis. 2017. Fairness in Group Recommendations in the Health Domain. In 33rd IEEE International Conference on Data Engineering, ICDE 2017. 1481–1488.
[47]
Tom Sühr, Sophie Hilgard, and Himabindu Lakkaraju. 2021. Does Fair Ranking Improve Minority Outcomes? Understanding the Interplay of Human and Algorithmic Biases in Online Hiring. 989–999.
[48]
Ruotong Wang, F. Maxwell Harper, and Haiyi Zhu. 2020. Factors Influencing Perceived Fairness in Algorithmic Decision-Making: Algorithm Outcomes, Development Procedures, and Individual Differences. Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (2020).
[49]
Yifan Wang, Weizhi Ma, Min Zhang, Yiqun Liu, and Shaoping Ma. 2023. A Survey on the Fairness of Recommender Systems. ACM Trans. Inf. Syst. 41, 3 (2023).
[50]
Haolun Wu, Chen Ma, Bhaskar Mitra, Fernando Diaz, and Xue Liu. 2021. A Multi-Objective Optimization Framework for Multi-Stakeholder Fairness-Aware Recommendation. ACM Transactions on Information Systems 41 (2021), 1 – 29.
[51]
Ye Yuan, Xin Luo, and Mingsheng Shang. 2018. Effects of preprocessing and training biases in latent factor models for recommender systems. Neurocomputing 275 (2018), 2019–2030.
[52]
Ziwei Zhu, Jianling Wang, and James Caverlee. 2020. Measuring and Mitigating Item Under-Recommendation Bias in Personalized Ranking Systems. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (2020).

Index Terms

  1. User Perception of Fairness-Calibrated Recommendations

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    UMAP '24: Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization
    June 2024
    338 pages
    ISBN:9798400704338
    DOI:10.1145/3627043
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 22 June 2024

    Check for updates

    Author Tags

    1. Fairness
    2. Recommender systems
    3. User Study

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Funding Sources

    • FAPESP

    Conference

    UMAP '24
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 162 of 633 submissions, 26%

    Upcoming Conference

    UMAP '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 277
      Total Downloads
    • Downloads (Last 12 months)277
    • Downloads (Last 6 weeks)47
    Reflects downloads up to 25 Jan 2025

    Other Metrics

    Citations

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Login options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media