Incremental Ensemble Learning Model for Imbalanced Data: a Case Study of Credit Scoring
Abstract
Imbalanced data is a challenge for classification models. It reduces the overall performance of traditional learning algorithms. Besides, the minority class of imbalanced datasets is misclassified with a high ratio even though this is a crucial object of the classification process. In this paper, a new model called the Lasso-Logistic ensemble is proposed to deal with imbalanced data by utilizing two popular techniques, random over-sampling and random under-sampling. The model was applied to two real imbalanced credit data sets. The results show that the Lasso-Logistic ensemble model offers better performance than the single traditional methods, such as random over-sampling, random under-sampling, Synthetic Minority Oversampling Technique (SMOTE), and cost-sensitive learning.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium provided the original work is properly cited.
Keywords
Full Text:
PDFTime cited: 0
DOI: http://dx.doi.org/10.55579/jaec.202372.407
Refbacks
- There are currently no refbacks.
Copyright (c) 2023 Journal of Advanced Engineering and Computation
This work is licensed under a Creative Commons Attribution 4.0 International License.