Search | arXiv e-print repository

Evidential uncertainty sampling for active learning

Authors: Arthur Hoarau, Vincent Lemaire, Arnaud Martin, Jean-Christophe Dubois, Yolande Le Gall

Abstract: Recent studies in active learning, particularly in uncertainty sampling, have focused on the decomposition of model uncertainty into reducible and irreducible uncertainties. In this paper, the aim is to simplify the computational process while eliminating the dependence on observations. Crucially, the inherent uncertainty in the labels is considered, the uncertainty of the oracles. Two strategies… ▽ More Recent studies in active learning, particularly in uncertainty sampling, have focused on the decomposition of model uncertainty into reducible and irreducible uncertainties. In this paper, the aim is to simplify the computational process while eliminating the dependence on observations. Crucially, the inherent uncertainty in the labels is considered, the uncertainty of the oracles. Two strategies are proposed, sampling by Klir uncertainty, which tackles the exploration-exploitation dilemma, and sampling by evidential epistemic uncertainty, which extends the concept of reducible uncertainty within the evidential framework, both using the theory of belief functions. Experimental results in active learning demonstrate that our proposed method can outperform uncertainty sampling. △ Less

Submitted 25 May, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

arXiv:2303.04548 [pdf, other]

Estimation of the qualification and behavior of a contributor and aggregation of his answers in a crowdsourcing context

Authors: Constance Thierry, Arnaud Martin, Jean-Christophe Dubois, Yolande Le Gall

Abstract: Crowdsourcing is the outsourcing of tasks to a crowd of contributors on a dedicated platform. The crowd on these platforms is very diversified and includes various profiles of contributors which generates data of uneven quality. However, majority voting, which is the aggregating method commonly used in platforms, gives equal weight to each contribution. To overcome this problem, we propose a metho… ▽ More Crowdsourcing is the outsourcing of tasks to a crowd of contributors on a dedicated platform. The crowd on these platforms is very diversified and includes various profiles of contributors which generates data of uneven quality. However, majority voting, which is the aggregating method commonly used in platforms, gives equal weight to each contribution. To overcome this problem, we propose a method, MONITOR, which estimates the contributor's profile and aggregates the collected data by taking into account their possible imperfections thanks to the theory of belief functions. To do so, MONITOR starts by estimating the profile of the contributor through his qualification for the task and his behavior.Crowdsourcing campaigns have been carried out to collect the necessary data to test MONITOR on real data in order to compare it to existing approaches. The results of the experiments show that thanks to the use of the MONITOR method, we obtain a better rate of correct answer after aggregation of the contributions compared to the majority voting. Our contributions in this article are for the first time the proposal of a model that takes into account both the qualification of the contributor and his behavior in the estimation of his profile. For the second one, the weakening and the aggregation of the answers according to the estimated profiles. △ Less

Submitted 8 March, 2023; originally announced March 2023.

Journal ref: Expert Systems with Applications, 2023

arXiv:2211.11809 [pdf, other]

Real bird dataset with imprecise and uncertain values

Authors: Constance Thierry, Arthur Hoarau, Arnaud Martin, Jean-Christophe Dubois, Yolande Le Gall

Abstract: The theory of belief functions allows the fusion of imperfect data from different sources. Unfortunately, few real, imprecise and uncertain datasets exist to test approaches using belief functions. We have built real birds datasets thanks to the collection of numerous human contributions that we make available to the scientific community. The interest of our datasets is that they are made of human… ▽ More The theory of belief functions allows the fusion of imperfect data from different sources. Unfortunately, few real, imprecise and uncertain datasets exist to test approaches using belief functions. We have built real birds datasets thanks to the collection of numerous human contributions that we make available to the scientific community. The interest of our datasets is that they are made of human contributions, thus the information is therefore naturally uncertain and imprecise. These imperfections are given directly by the persons. This article presents the data and their collection through crowdsourcing and how to obtain belief functions from the data. △ Less

Submitted 21 November, 2022; originally announced November 2022.

Journal ref: 7th International Conference on Belief Functions, Oct 2022, Paris, France

arXiv:2002.11717 [pdf, other]

Modelisation de l'incertitude et de l'imprecision de donnees de crowdsourcing : MONITOR

Authors: Constance Thierry, Jean-Christophe Dubois, Yolande Le Gall, Arnaud Martin

Abstract: Crowdsourcing is defined as the outsourcing of tasks to a crowd of contributors. The crowd is very diverse on these platforms and includes malicious contributors attracted by the remuneration of tasks and not conscientiously performing them. It is essential to identify these contributors in order to avoid considering their responses. As not all contributors have the same aptitude for a task, it se… ▽ More Crowdsourcing is defined as the outsourcing of tasks to a crowd of contributors. The crowd is very diverse on these platforms and includes malicious contributors attracted by the remuneration of tasks and not conscientiously performing them. It is essential to identify these contributors in order to avoid considering their responses. As not all contributors have the same aptitude for a task, it seems appropriate to give weight to their answers according to their qualifications. This paper, published at the ICTAI 2019 conference, proposes a method, MONITOR, for estimating the profile of the contributor and aggregating the responses using belief function theory. △ Less

Submitted 26 February, 2020; originally announced February 2020.

Comments: in French. Extraction et Gestion des Connaissances (EGC), Jan 2020, Bruxelles, Belgique

arXiv:1907.10588 [pdf, other]

Measuring the Expertise of Workers for Crowdsourcing Applications

Authors: Jean-Christophe Dubois, Laetitia Gros, Mouloud Kharoune, Yolande Le Gall, Arnaud Martin, Zoltán Miklós, Hosna Ouni

Abstract: Crowdsourcing platforms enable companies to propose tasks to a large crowd of users. The workers receive a compensation for their work according to the serious of the tasks they managed to accomplish. The evaluation of the quality of responses obtained from the crowd remains one of the most important problems in this context. Several methods have been proposed to estimate the expertise level of cr… ▽ More Crowdsourcing platforms enable companies to propose tasks to a large crowd of users. The workers receive a compensation for their work according to the serious of the tasks they managed to accomplish. The evaluation of the quality of responses obtained from the crowd remains one of the most important problems in this context. Several methods have been proposed to estimate the expertise level of crowd workers. We propose an innovative measure of expertise assuming that we possess a dataset with an objective comparison of the items concerned. Our method is based on the definition of four factors with the theory of belief functions. We compare our method to the Fagin distance on a dataset from a real experiment, where users have to assess the quality of some audio recordings. Then, we propose to fuse both the Fagin distance and our expertise measure. △ Less

Submitted 24 June, 2019; originally announced July 2019.

Journal ref: Advances in Knowledge Discovery and Management, pp.139-157, 2019

arXiv:1811.07536 [pdf, other]

Contributors profile modelization in crowdsourcing platforms

Authors: Constance Thierry, Jean-Christophe Dubois, Yolande Le Gall, Arnaud Martin

Abstract: The crowdsourcing consists in the externalisation of tasks to a crowd of people remunerated to execute this ones. The crowd, usually diversified, can include users without qualification and/or motivation for the tasks. In this paper we will introduce a new method of user expertise modelization in the crowdsourcing platforms based on the theory of belief functions in order to identify serious and q… ▽ More The crowdsourcing consists in the externalisation of tasks to a crowd of people remunerated to execute this ones. The crowd, usually diversified, can include users without qualification and/or motivation for the tasks. In this paper we will introduce a new method of user expertise modelization in the crowdsourcing platforms based on the theory of belief functions in order to identify serious and qualificated users. △ Less

Submitted 19 November, 2018; originally announced November 2018.

Comments: in French, 27èmes rencontres francophones sur la logique floue et ses applications, Nov 2018, Arras, France

arXiv:1808.00495 [pdf, other]

Semantic Classification of 3D Point Clouds with Multiscale Spherical Neighborhoods

Authors: Hugues Thomas, Jean-Emmanuel Deschaud, Beatriz Marcotegui, François Goulette, Yann Le Gall

Abstract: This paper introduces a new definition of multiscale neighborhoods in 3D point clouds. This definition, based on spherical neighborhoods and proportional subsampling, allows the computation of features with a consistent geometrical meaning, which is not the case when using k-nearest neighbors. With an appropriate learning strategy, the proposed features can be used in a random forest to classify 3… ▽ More This paper introduces a new definition of multiscale neighborhoods in 3D point clouds. This definition, based on spherical neighborhoods and proportional subsampling, allows the computation of features with a consistent geometrical meaning, which is not the case when using k-nearest neighbors. With an appropriate learning strategy, the proposed features can be used in a random forest to classify 3D points. In this semantic classification task, we show that our multiscale features outperform state-of-the-art features using the same experimental conditions. Furthermore, their classification power competes with more elaborate classification approaches including Deep Learning methods. △ Less

Submitted 1 August, 2018; originally announced August 2018.

Comments: 3DV2018

arXiv:1501.04792 [pdf, other]

doi 10.1007/978-3-319-11191-9_15

Designing a Belief Function-Based Accessibility Indicator to Improve Web Browsing for Disabled People

Authors: Jean-Christophe Dubois, Yolande Le Gall, Arnaud Martin

Abstract: The purpose of this study is to provide an accessibility measure of web-pages, in order to draw disabled users to the pages that have been designed to be ac-cessible to them. Our approach is based on the theory of belief functions, using data which are supplied by reports produced by automatic web content assessors that test the validity of criteria defined by the WCAG 2.0 guidelines proposed by t… ▽ More The purpose of this study is to provide an accessibility measure of web-pages, in order to draw disabled users to the pages that have been designed to be ac-cessible to them. Our approach is based on the theory of belief functions, using data which are supplied by reports produced by automatic web content assessors that test the validity of criteria defined by the WCAG 2.0 guidelines proposed by the World Wide Web Consortium (W3C) organization. These tools detect errors with gradual degrees of certainty and their results do not always converge. For these reasons, to fuse information coming from the reports, we choose to use an information fusion framework which can take into account the uncertainty and imprecision of infor-mation as well as divergences between sources. Our accessibility indicator covers four categories of deficiencies. To validate the theoretical approach in this context, we propose an evaluation completed on a corpus of 100 most visited French news websites, and 2 evaluation tools. The results obtained illustrate the interest of our accessibility indicator. △ Less

Submitted 20 January, 2015; originally announced January 2015.

Journal ref: Belief 2014, Sep 2014, Oxford, United Kingdom. Lecture Notes in Artificial Intelligence, Lecture Notes in Computer Science, Vol. 8764, pp.134 - 142, Belief Functions: Theory and Applications

Showing 1–8 of 8 results for author: Gall, Y L