LIUBoost : Locality Informed Underboosting for Imbalanced Data Classification

Ahmed, Sajid; Rayhan, Farshid; Mahbub, Asif; Jani, Md. Rafsan; Shatabda, Swakkhar; Farid, Dewan Md.; Rahman, Chowdhury Mofizur

Computer Science > Machine Learning

arXiv:1711.05365 (cs)

[Submitted on 15 Nov 2017]

Title:LIUBoost : Locality Informed Underboosting for Imbalanced Data Classification

Authors:Sajid Ahmed, Farshid Rayhan, Asif Mahbub, Md. Rafsan Jani, Swakkhar Shatabda, Dewan Md. Farid, Chowdhury Mofizur Rahman

View PDF

Abstract:The problem of class imbalance along with class-overlapping has become a major issue in the domain of supervised learning. Most supervised learning algorithms assume equal cardinality of the classes under consideration while optimizing the cost function and this assumption does not hold true for imbalanced datasets which results in sub-optimal classification. Therefore, various approaches, such as undersampling, oversampling, cost-sensitive learning and ensemble based methods have been proposed for dealing with imbalanced datasets. However, undersampling suffers from information loss, oversampling suffers from increased runtime and potential overfitting while cost-sensitive methods suffer due to inadequately defined cost assignment schemes. In this paper, we propose a novel boosting based method called LIUBoost. LIUBoost uses under sampling for balancing the datasets in every boosting iteration like RUSBoost while incorporating a cost term for every instance based on their hardness into the weight update formula minimizing the information loss introduced by undersampling. LIUBoost has been extensively evaluated on 18 imbalanced datasets and the results indicate significant improvement over existing best performing method RUSBoost.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1711.05365 [cs.LG]
	(or arXiv:1711.05365v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1711.05365

Submission history

From: Sajid Ahmed [view email]
[v1] Wed, 15 Nov 2017 00:44:41 UTC (13 KB)

Computer Science > Machine Learning

Title:LIUBoost : Locality Informed Underboosting for Imbalanced Data Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:LIUBoost : Locality Informed Underboosting for Imbalanced Data Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators