Manifold Distance-Based Over-Sampling Technique for Class Imbalance Learning

Authors

  • Lingkai Yang China University of Mining and Technology
  • Yinan Guo China University of Mining and Technology
  • Jian Cheng China University of Mining and Technology

DOI:

https://doi.org/10.1609/aaai.v33i01.330110071

Abstract

Over-sampling technology for handling the class imbalanced problem generates more minority samples to balance the dataset size of different classes. However, sampling in original data space is ineffective as the data in different classes is overlapped or disjunct. Based on this, a new minority sample is presented in terms of the manifold distance rather than Euclidean distance. The overlapped majority and minority samples apt to distribute in fully disjunct subspaces from the view of manifold learning. Moreover, it can avoid generating samples between the minority data locating far away in manifold space. Experiments on 23 UCI datasets show that the proposed method has the better classification accuracy.

Downloads

Published

2019-07-17

How to Cite

Yang, L., Guo, Y., & Cheng, J. (2019). Manifold Distance-Based Over-Sampling Technique for Class Imbalance Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 10071-10072. https://doi.org/10.1609/aaai.v33i01.330110071

Issue

Section

Student Abstract Track