Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3442381.3449933acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article
Open access

Online Label Aggregation: A Variational Bayesian Approach

Published: 03 June 2021 Publication History

Abstract

Noisy labeled data is more a norm than a rarity for crowd sourced contents. It is effective to distill noise and infer correct labels through aggregating results from crowd workers. To ensure the time relevance and overcome slow responses of workers, online label aggregation is increasingly requested, calling for solutions that can incrementally infer true label distribution via subsets of data items. In this paper, we propose a novel online label aggregation framework, BiLA, which employs variational Bayesian inference method and designs a novel stochastic optimization scheme for incremental training. BiLA is flexible to accommodate any generating distribution of labels by the exact computation of its posterior distribution. We also derive the convergence bound of the proposed optimizer. We compare BiLA with the state of the art based on minimax entropy, neural networks and expectation maximization algorithms, on synthetic and real-world data sets. Our evaluation results on various online scenarios show that BiLA can effectively infer the true labels, with an error rate reduction of at least 10 to 1.5 percent points for synthetic and real-world datasets, respectively.

References

[1]
European Commission 2018. European Union’s General Data Protection Regulation. European Commission. https://ec.europa.eu/info/law/law-topic/data-protection/eu-data-protection-rules_en
[2]
Christopher M Bishop. 2006. Pattern recognition and machine learning. (2006), 461–517.
[3]
José María Cavanillas, Edward Curry, and Wolfgang Wahlster(Eds.). 2016. New Horizons for a Data-Driven Economy - A Roadmap for Usage and Exploitation of Big Data in Europe. Springer. https://doi.org/10.1007/978-3-319-21569-3
[4]
Alexander Philip Dawid and Allan M Skene. 1979. Maximum likelihood estimation of observer error-rates using the EM algorithm. Applied statistics (1979), 20–28.
[5]
Dheeru Dua and Casey Graff. 2017. UCI Machine Learning Repository. http://archive.ics.uci.edu/ml
[6]
Li Fei-Fei. 2010. ImageNet: crowdsourcing, benchmarking & other cool things. In CMU VASC Seminar, Vol. 16. 18–25.
[7]
Alex Gaunt, Diana Borsa, and Yoram Bachrach. 2016. Training deep neural nets to aggregate crowdsourced responses. In UAI. 242–251.
[8]
Amirmasoud Ghiassi, Taraneh Younesian, Robert Birke, and Lydia Y. Chen. 2020. TrustNet: Learning from Trusted Data Against (A)symmetric Label Noise. CoRR abs/2007.06324(2020).
[9]
Amirmasoud Ghiassi, Taraneh Younesian, Zilong Zhao, Robert Birke, Valerio Schiavoni, and Lydia Y. Chen. 2019. Robust (Deep) Learning Framework Against Dirty Labels and Beyond. In TPS-ISA. 236–244.
[10]
Hu Han, Charles Otto, Xiaoming Liu, and Anil K Jain. 2015. Demographic estimation from face images: Human vs. machine performance. IEEE transactions on pattern analysis and machine intelligence 37, 6(2015), 1148–1161.
[11]
Muhammad Imran, Carlos Castillo, Ji Lucas, Patrick Meier, and Sarah Vieweg. 2014. AIDR: Artificial intelligence for disaster response. In WWW. 159–162.
[12]
A Janosi, W Steinbrunn, M Pfisterer, and R Detrano. 1988. Heart disease data set. In https://archive.ics.uci.edu/ml/datasets/Heart+Disease.
[13]
Hyun-Chul Kim and Zoubin Ghahramani. 2012. Bayesian classifier combination. In AISTATS. 619–627.
[14]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In ICLR.
[15]
Daphne Koller and Nir Friedman. 2009. Probabilistic graphical models: principles and techniques. MIT press.
[16]
Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton. 2009. CIFAR-10 (Canadian Institute for Advanced Research). (2009). http://www.cs.toronto.edu/~kriz/cifar.html
[17]
Kenichi Kurihara, Max Welling, and Yee Whye Teh. 2007. Collapsed Variational Dirichlet Process Mixture Models. In IJCAI, Vol. 7. 2796–2801.
[18]
Yuan Li, Benjamin Rubinstein, and Trevor Cohn. 2019. Exploiting worker correlation for label aggregation in crowdsourcing. In ICML. 3886–3895.
[19]
Qiang Liu, Jian Peng, and Alexander T Ihler. 2012. Variational inference for crowdsourcing. In NeurIPS. 692–700.
[20]
Alan Lundgard, Yiwei Yang, Maya L Foster, and Walter S Lasecki. 2018. Bolt: Instantaneous crowdsourcing via just-in-time training. In CHI. 1–7.
[21]
H. Brendan McMahan and Matthew J. Streeter. 2010. Adaptive Bound Optimization for Online Convex Optimization. In COLT. 244–256.
[22]
John W. Paisley, David M. Blei, and Michael I. Jordan. 2012. Variational Bayesian Inference with Stochastic Search. In ICML. 1363–1370.
[23]
Foster Provost, Wang Jing, and Panagiotis G. Ipeirotis. 2010. Quality management on amazon mechanical turk. In SIGKDD workshop on human computation. 64–67.
[24]
Vikas C Raykar, Shipeng Yu, Linda H Zhao, Gerardo Hermosillo Valadez, Charles Florin, Luca Bogoni, and Linda Moy. 2010. Learning from crowds. Journal of Machine Learning Research 11, Apr (2010), 1297–1322.
[25]
Pedro Savarese. 2019. On the Convergence of AdaBound and its Connection to SGD. CoRR abs/1908.04457(2019).
[26]
Edwin D Simpson, Matteo Venanzi, Steven Reece, Pushmeet Kohli, John Guiver, Stephen J Roberts, and Nicholas R Jennings. 2015. Language understanding in the wild: Combining crowdsourcing and machine learning. In WWW. 992–1002.
[27]
Rion Snow, Brendan O’Connor, Daniel Jurafsky, and Andrew Y Ng. 2008. Cheap and fast—but is it good?: evaluating non-expert annotations for natural language tasks. In EMNLP. 254–263.
[28]
Yee W Teh, David Newman, and Max Welling. 2007. A collapsed variational Bayesian inference algorithm for latent Dirichlet allocation. In NeurIPS. 1353–1360.
[29]
T. Tieleman and G. Hinton. 2012. Lecture 6.5—RmsProp: Divide the gradient by a running average of its recent magnitude. COURSERA: Neural Networks for Machine Learning.
[30]
Matteo Venanzi, John Guiver, Gabriella Kazai, Pushmeet Kohli, and Milad Shokouhi. 2014. Community-based bayesian aggregation models for crowdsourcing. In WWW. 155–164.
[31]
Martin J Wainwright, Michael I Jordan, 2008. Graphical models, exponential families, and variational inference. Foundations and Trends® in Machine Learning 1, 1–2(2008), 1–305.
[32]
Jacob Whitehill, Ting-fan Wu, Jacob Bergsma, Javier R Movellan, and Paul L Ruvolo. 2009. Whose vote should count more: Optimal integration of labels from labelers of unknown expertise. In NeurIPS. 2035–2043.
[33]
Tong Xiao, Tian Xia, Yi Yang, Chang Huang, and Xiaogang Wang. 2015. Learning from massive noisy labeled data for image classification. In CVPR. 2691–2699.
[34]
Jie Yang, Thomas Drake, Andreas Damianou, and Yoelle Maarek. 2018. Leveraging crowdsourcing data for deep active learning an application: Learning intents in alexa. In WWW. 23–32.
[35]
Jie Yang, Alisa Smirnova, Dingqi Yang, Gianluca Demartini, Yuan Lu, and Philippe Cudré-Mauroux. 2019. Scalpel-cd: leveraging crowdsourcing and deep probabilistic modeling for debugging noisy training data. In WWW. 2158–2168.
[36]
Li’ang Yin, Jianhua Han, Weinan Zhang, and Yong Yu. 2017. Aggregating crowd wisdoms with label-aware autoencoders. In IJCAI. 1325–1331.
[37]
Taraneh Younesian, Zilong Zhao, Amirmasoud Ghiassi, Robert Birke, and Lydia Y. Chen. 2020. QActor: On-line Active Learning for Noisy Labeled Stream Data. CoRR abs/2001.10399(2020).
[38]
Denny Zhou, Sumit Basu, Yi Mao, and John C Platt. 2012. Learning from the wisdom of crowds by minimax entropy. In NeurIPS. 2195–2203.
[39]
Dengyong Zhou, Qiang Liu, John Platt, and Christopher Meek. 2014. Aggregating ordinal labels from crowds by minimax conditional entropy. In ICML. 262–270.
[40]
Martin Zinkevich. 2003. Online convex programming and generalized infinitesimal gradient ascent. In ICML. 928–936.

Cited By

View all
  • (2024)A Lightweight, Effective, and Efficient Model for Label Aggregation in CrowdsourcingACM Transactions on Knowledge Discovery from Data10.1145/363010218:4(1-27)Online publication date: 13-Feb-2024
  • (2023)Crowdsourcing Truth Inference via Reliability-Driven Multi-View Graph EmbeddingACM Transactions on Knowledge Discovery from Data10.1145/356557617:5(1-26)Online publication date: 27-Feb-2023
  1. Online Label Aggregation: A Variational Bayesian Approach

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WWW '21: Proceedings of the Web Conference 2021
    April 2021
    4054 pages
    ISBN:9781450383127
    DOI:10.1145/3442381
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 03 June 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. convergence bound
    2. label aggregation
    3. online
    4. stochastic optimizer
    5. variational bayesian inference

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    WWW '21
    Sponsor:
    WWW '21: The Web Conference 2021
    April 19 - 23, 2021
    Ljubljana, Slovenia

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)132
    • Downloads (Last 6 weeks)16
    Reflects downloads up to 12 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A Lightweight, Effective, and Efficient Model for Label Aggregation in CrowdsourcingACM Transactions on Knowledge Discovery from Data10.1145/363010218:4(1-27)Online publication date: 13-Feb-2024
    • (2023)Crowdsourcing Truth Inference via Reliability-Driven Multi-View Graph EmbeddingACM Transactions on Knowledge Discovery from Data10.1145/356557617:5(1-26)Online publication date: 27-Feb-2023

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media