Influence of AI’s Uncertainty in the Dawid-Skene Aggregation for Human-AI Crowdsourcing

Tamura, Takumi; Ito, Hiroyoshi; Oyama, Satoshi; Morishima, Atsuyuki

doi:10.1007/978-3-031-57867-0_17

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14598))

Included in the following conference series:

International Conference on Information

590 Accesses

Abstract

The power and expressiveness of AIs are rapidly increasing, and now AIs have the ability to complete tasks in crowdsourcing as if they were human crowd workers. Therefore, the development of methods to effectively aggregate the results of tasks performed by AIs and humans is becoming a critical problem. In this study, we revisit the Dawid-Skene model that has been used to aggregate human votes to obtain better results in classification problems. Most of the state-of-the-art AI classifiers predict the class probabilities as their output. Considering the probabilities represent their uncertainty, utilizing them in Dawid-Skene aggregation may provide higher-quality annotations. To this end, we introduce a variation of the Dawid-Skene model to directly use the probabilities without discarding them and conduct experiments with two real-world datasets of different domains. Experimental results show that the Dawid-Skene model with probabilities improves the overall accuracy. Moreover, a detailed analysis shows that the aggregation results were improved for classification tasks with high uncertainty.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 119.00; Price excludes VAT (USA)

Softcover Book: USD 159.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Managing Uncertainty in Crowdsourcing with Interval-Valued Labels

An Evidential Semi-supervised Label Aggregation Approach

Crowd Label Aggregation Under a Belief Function Framework

Notes

1.
https://huggingface.co/models.
2.
https://www.kaggle.com/.
3.
Available at https://github.com/Evgeneus/screening-classification-datasets/.

References

Amer-Yahia, S., et al.: Making AI machines work for humans in FoW. ACM SIGMOD Rec. 49(2), 30–35 (2020)
Article Google Scholar
Bi, W., Wang, L., Kwok, J.T., Tu, Z.: Learning to predict from crowdsourced data. In: Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence (UAI), pp. 82–91 (2014)
Google Scholar
Branson, S., Horn, G.V., Perona, P.: Lean crowdsourcing: combining humans and machines in an online system. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6109–6118 (2017)
Google Scholar
Correia, A., et al.: Designing for hybrid intelligence: a taxonomy and survey of crowd-machine interaction. Appl. Sci. 13(4), 2198 (2023)
Article Google Scholar
Dawid, A.P., Skene, A.M.: Maximum likelihood estimation of observer error-rates using the EM algorithm. J. Roy. Stat. Soc. Ser. C (Appl. Stat.) 28(1), 20–28 (1979)
Google Scholar
He, X., et al.: AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators. arXiv preprint arXiv:2303.16854 (2023)
Kanda, T., Ito, H., Morishima, A.: Efficient evaluation of AI workers for the human+AI crowd task assignment. In: Proceedings of IEEE International Conference on Big Data (BigData), pp. 3995–4001 (2022)
Google Scholar
Kobayashi, M., Wakabayashi, K., Morishima, A.: Human+AI crowd task assignment considering result quality requirements. In: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing (HCOMP), vol. 9, pp. 97–107 (2021)
Google Scholar
Krivosheev, E., Casati, F., Baez, M., Benatallah, B.: Combining crowd and machines for multi-predicate item screening. In: Proceedings of the ACM on Human-Computer Interaction (CSCW), vol. 2, pp. 1–18 (2018)
Google Scholar
Le, Y., Yang, X.: Tiny imagenet visual recognition challenge. CS 231N 7(7) (2015)
Google Scholar
Liao, Y.H., Kar, A., Fidler, S.: Towards good practices for efficiently annotating large-scale image classification datasets. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4348–4357 (2021)
Google Scholar
Nakov, P., Ritter, A., Rosenthal, S., Sebastiani, F., Stoyanov, V.: SemEval-2016 task 4: sentiment analysis in Twitter. In: Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval), pp. 1–18 (2016)
Google Scholar
Nguyen, D.Q., Vu, T., Nguyen, A.T.: BERTweet: a pre-trained language model for English Tweets. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP), pp. 9–14 (2020)
Google Scholar
Oyama, S., Baba, Y., Sakurai, Y., Kashima, H.: Accurate integration of crowdsourced labels using workers’ self-reported confidence scores. In: Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence (IJCAI), pp. 2554–2560 (2013)
Google Scholar
Pérez, J.M., Giudici, J.C., Luque, F.: pysentimiento: A Python Toolkit for Sentiment Analysis and SocialNLP tasks. arXiv preprint arXiv:2106.09462 (2021)
Ramírez, J., Baez, M., Casati, F., Benatallah, B.: Understanding the impact of text highlighting in crowdsourcing tasks. In: Proceedings of the Seventh AAAI Conference on Human Computation and Crowdsourcing (HCOMP), vol. 7, pp. 144–152 (2019)
Google Scholar
Whitehill, J., Ruvolo, P., Wu, T., Bergsma, J., Movellan, J.: Whose vote should count more: optimal integration of labels from labelers of unknown expertise. In: Proceedings of the 22nd International Conference on Neural Information Processing Systems (NIPS), pp. 2035–2043 (2009)
Google Scholar
Yamashita, Y., Ito, H., Wakabayashi, K., Kobayashi, M., Morishima, A.: HAEM: obtaining higher-quality classification task results with AI workers. In: Proceedings of the 14th ACM Web Science Conference (WebSci), pp. 118–128 (2022)
Google Scholar

Download references

Acknowledgements

This work was supported by JSPS KAKENHI Grant Number JP21H03552, JP22H00508, JP22K17944, JP23H03405, JST CREST Grant Number JPMJCR21D1, and JPMJCR22M2.

Author information

Authors and Affiliations

School of Informatics, University of Tsukuba, 1-2 Kasuga, Tsukuba, Ibaraki, 305-8577, Japan
Takumi Tamura
Faculty of Library, Information and Media Science, University of Tsukuba, 1-2 Kasuga, Tsukuba, Ibaraki, 305-8577, Japan
Hiroyoshi Ito & Atsuyuki Morishima
School of Data Science, Nagoya City University, 1 Yamanobata, Mizuho-cho, Mizuho-ku, Nagoya, Aichi, 467-8501, Japan
Satoshi Oyama

Authors

Takumi Tamura
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyoshi Ito
View author publications
You can also search for this author in PubMed Google Scholar
Satoshi Oyama
View author publications
You can also search for this author in PubMed Google Scholar
Atsuyuki Morishima
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Takumi Tamura .

Editor information

Editors and Affiliations

iSchool organization, Berlin, Germany
Isaac Sserwanga
University of Tsukuba, Tsukuba, Japan
Hideo Joho
Jilin University, Changchun, China
Jie Ma
Stockholm University, Kista, Sweden
Preben Hansen
Wuhan University, Wuhan, China
Dan Wu
University of Tsukuba, Tsukuba, Japan
Masanori Koizumi
University of California, Los Angeles, CA, USA
Anne J. Gilliland

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tamura, T., Ito, H., Oyama, S., Morishima, A. (2024). Influence of AI’s Uncertainty in the Dawid-Skene Aggregation for Human-AI Crowdsourcing. In: Sserwanga, I., et al. Wisdom, Well-Being, Win-Win. iConference 2024. Lecture Notes in Computer Science, vol 14598. Springer, Cham. https://doi.org/10.1007/978-3-031-57867-0_17

Download citation

DOI: https://doi.org/10.1007/978-3-031-57867-0_17
Published: 10 April 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-57866-3
Online ISBN: 978-3-031-57867-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Influence of AI’s Uncertainty in the Dawid-Skene Aggregation for Human-AI Crowdsourcing