Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/3077136.3080679acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper

Label Aggregation for Crowdsourcing with Bi-Layer Clustering

Published: 07 August 2017 Publication History

Abstract

This paper proposes a novel general label aggregation method for both binary and multi-class labeling in crowdsourcing, namely Bi-Layer Clustering (BLC), which clusters two layers of features - the conceptual-level and the physical-level features - to infer true labels of instances. BLC first clusters the instances using the conceptual-level features extracted from their multiple noisy labels and then performs clustering again using the physical-level features. It can facilitate tracking the uncertainty changes of the instances, so that the integrated labels that are likely to be falsely inferred on the conceptual layer can be easily corrected using the estimated labels on the physical layer. Experimental results on two real-world crowdsourcing data sets show that BLC outperforms seven state-of-the-art methods.

References

[1]
W. Bi, L. Wang, J. T. Kwok, and Z. Tu. Learning to predict from crowdsourced data. In UAI, pages 82--91, 2014.
[2]
A. P. Dawid and A. M. Skene. Maximum likelihood estimation of observer error-rates using the EM algorithm. Applied Statistics, 28(1): 20--28, 1979.
[3]
G. Demartini, D. E. Difallah, and P. Cudré-Mauroux. ZenCrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking. In WWW, pages 469--478, 2012.
[4]
E. Kamar, A. Kapoor, and E. Horvitz. Identifying and accounting for task-dependent bias in crowdsourcing. In AAAI HCOMP, pages 92--101, 2015.
[5]
D. R. Karger, S. Oh, and D. Shah. Iterative learning for reliable crowdsourcing systems. In NIPS, 24: 1953--1961, 2011.
[6]
A. Kurve, D. J. Miller, and G. Kesidis. Multicategory crowdsourcing accounting for variable task difficulty, worker skill, and worker intention. IEEE TKDE, 27(3): 794--809, 2015.
[7]
H. Li and B. Yu. Error rate bounds and iterative weighted majority voting for crowdsourcing. arXiv preprint arXiv:1411.4086, 2014.
[8]
V. C. Raykar, S. Yu, L. H. Zhao, C. Florin, G. H. Valadez, L. Bogoni, and L. Moy. Learning from crowds. JMLR, 11: 1297--1322, 2010.
[9]
A. Sheshadri and M. Lease. SQUARE: a benchmark for research on computing crowd consensus. In AAAI HCOMP, pages 56--164, 2013.
[10]
M. Venanzi, J. Guiver, G. Kazai, P. Kohli, and M. Shokouhi. Community-based bayesian aggregation models for crowdsourcing. In WWW, pages 155--164, 2014.
[11]
P. Welinder, S. Branson, P. Perona, and S. J. Belongie. The multidimensional wisdom of crowds. In NIPS, 23: 2424--2432, 2010.
[12]
J. Whitehill, P. Ruvolo, T. Wu, J. Bergsma, and J. Movella. Whose vote should count more: optimal integration of labels from labelers of unknown expertise. In NIPS, 22: 2035--2043, 2009.
[13]
Y. Zhang, X. Chen, D. Zhou, and M. I. Jordan. Spectral methods meet EM: a provably optimal algorithm for crowdsourcing. In NIPS, 27: 1260--1268, 2014.
[14]
J. Zhang, V. S. Sheng, J. Wu, and X. Wu. Multi-class ground truth inference in crowdsourcing with clustering. IEEE TKDE, 28(4): 1080--1085, 2016.
[15]
Y. Zheng, G. Li, Y. Li, C. Shan, and R. Cheng. Truth inference in crowdsourcing: Is the problem solved? In VLDB Endowment, 10(5), 2017.
[16]
D. Zhou, S. Basu, Y. Mao, and J. C. Platt. Learning from the wisdom of crowds by minimax entropy. In NIPS, pages 2195--2203, 2012.

Cited By

View all
  • (2024)A Little Truth Injection but a Big Reward: Label Aggregation With Graph Neural NetworksIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.3338216(1-14)Online publication date: 2024
  • (2023)TIRA: Truth Inference via Reliability Aggregation on Object-Source GraphIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.322530835:11(11967-11981)Online publication date: 1-Nov-2023
  • (2023)A Generative Answer Aggregation Model for Sentence-Level Crowdsourcing TasksIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.314282135:4(3299-3312)Online publication date: 1-Apr-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '17: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval
August 2017
1476 pages
ISBN:9781450350228
DOI:10.1145/3077136
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 August 2017

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. clustering
  2. crowdsourcing
  3. inference
  4. label aggregation

Qualifiers

  • Short-paper

Funding Sources

Conference

SIGIR '17
Sponsor:

Acceptance Rates

SIGIR '17 Paper Acceptance Rate 78 of 362 submissions, 22%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)15
  • Downloads (Last 6 weeks)1
Reflects downloads up to 03 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A Little Truth Injection but a Big Reward: Label Aggregation With Graph Neural NetworksIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.3338216(1-14)Online publication date: 2024
  • (2023)TIRA: Truth Inference via Reliability Aggregation on Object-Source GraphIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.322530835:11(11967-11981)Online publication date: 1-Nov-2023
  • (2023)A Generative Answer Aggregation Model for Sentence-Level Crowdsourcing TasksIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.314282135:4(3299-3312)Online publication date: 1-Apr-2023
  • (2022)Crowdsourcing Truth Inference via Reliability-Driven Multi-View Graph EmbeddingACM Transactions on Knowledge Discovery from Data10.1145/356557617:5(1-26)Online publication date: 4-Oct-2022
  • (2022)Knowledge Learning With Crowdsourcing: A Brief Review and Systematic PerspectiveIEEE/CAA Journal of Automatica Sinica10.1109/JAS.2022.1054349:5(749-762)Online publication date: May-2022
  • (2020)CrowdWTACM Transactions on Knowledge Discovery from Data10.1145/342171215:1(1-24)Online publication date: 7-Dec-2020
  • (2020)A Review of Judgment Analysis Algorithms for Crowdsourced OpinionsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.290406432:7(1234-1248)Online publication date: 1-Jul-2020
  • (2020)Label similarity-based weighted soft majority voting and pairing for crowdsourcingKnowledge and Information Systems10.1007/s10115-020-01475-yOnline publication date: 14-May-2020
  • (2019)Crowdsourced Label Aggregation Using Bilayer Collaborative ClusteringIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2018.289014830:10(3172-3185)Online publication date: Oct-2019
  • (2019)Multi-Label Truth Inference for Crowdsourcing Using Mixture ModelsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2019.2951668(1-1)Online publication date: 2019

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media