A Novel Multi-Stage Stacked Ensemble Classifier using Heterogeneous Base Learners

A Novel Multi-Stage Stacked Ensemble Classifier using Heterogeneous Base Learners

  IJETT-book-cover           
  
© 2023 by IJETT Journal
Volume-71 Issue-4
Year of Publication : 2023
Author : Pavitha N, Shounak Sugave
DOI : 10.14445/22315381/IJETT-V71I4P206

How to Cite?

Pavitha N, Shounak Sugave , "A Novel Multi-Stage Stacked Ensemble Classifier using Heterogeneous Base Learners," International Journal of Engineering Trends and Technology, vol. 71, no. 4, pp. 65-71, 2023. Crossref, https://doi.org/10.14445/22315381/IJETT-V71I4P206

Abstract
The assessment of credit risk is essential to contemporary economies. Historically, statistical techniques and manual auditing have been used to measure credit risk associated with credit grants. Recent developments in financial AI are the result of a new generation of machine learning (ML)-driven credit risk models that have drawn a lot of interest from both business and academia. In this research, we aimed to improve ML algorithms' performance by optimizing hyperparameters. Also, we proposed a novel multi-stage heterogeneous stacked ensemble ML algorithm for predicting credit risk. Experimental results show a significant improvement in the performance of the proposed algorithm as compared to other hyperparameter-optimized ML algorithms. Two real-time data sets from emerging market economies are used to evaluate our model. Different evaluation metrics, namely precision, recall, f1-score and accuracy, are used for evaluating model performance.

Keywords
Machine Learning, Ensemble algorithm, Heterogeneous ensemble, Statistical modelling, Credit risk.

References
[1] Wei-Yang Lin et al., “Machine Learning in Financial Crisis Prediction: A Survey,” IEEE Transactions on Systems, Man, and Cybernetics, vol. 42, no. 4, pp. 421–436, 2012.
[CrossRef] [Google Scholar] [Publisher Link]
[2] Shawni Dutta et al., “A Hybrid Machine Learning Model for Bank Customer Churn Prediction,” International Journal of Engineering Trends and Technology, vol. 70, no. 6, pp. 13–23, 2022.
[CrossRef] [Publisher Link]
[3] Swati Warghade, Shubhada Desai, and Vijay Patil, "Credit Card Fraud Detection from Imbalanced Dataset Using Machine Learning Algorithm," International Journal of Computer Trends and Technology, vol. 68, no. 3, pp. 22-28, 2020.
[CrossRef] [Google Scholar] [Publisher Link]
[4] Anton Markov, Zinaida Seleznyova, and Victor Lapshin, “Credit scoring methods: Latest Trends and Points to Consider,” The Journal of Finance and Data Science, vol. 8, pp. 180–201, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[5] Arijit Bhattacharya, Saroj Kr. Biswas, and Ardhendu Mandal, “Credit Risk Evaluation: A Comprehensive Study,” Multimedia Tools and Applications, pp. 1–51, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[6] Sudhansu R. Lenka, Bikram K. Ratha, and Biswaranjan Nayak, “A Review on Novel Approach to Handle Imbalanced Credit Card Transactions,” International Journal of Engineering Trends and Technology, vol. 62, no. 2, pp. 80-95, 2018.
[CrossRef] [Publisher Link]
[7] Salvatore Carta et al., “Credit Scoring by Leveraging an Ensemble Stochastic Criterion in a Transformed Feature Space,” Progress in Artificial Intelligence, vol. 10, no. 4, pp. 417–432, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[8] Mayorga Lira Sergio Dennis, Laberiano Andrade-Arenas, and Miguel Angel Cano Lengua, "Credit Risk Analysis: Using Artificial Intelligence in a Web Application," International Journal of Engineering Trends and Technology, vol. 71, no. 1, pp. 305-316, 2023.
[CrossRef] [Publisher Link]
[9] Wenyu Zhang, Dongqi Yang, and Shuai Zhang “A New Hybrid Ensemble Model with Voting-Based Outlier Detection and Balanced Sampling for Credit Scoring,” Expert Systems with Applications, vol. 174, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[10] Wenyu Zhang et al., “A Novel Multi-Stage Ensemble Model with Enhanced Outlier Adaptation for Credit Scoring,” Expert Systems With Applications, vol. 165, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[11] Yilun Jin et al., “A Novel Multi-Stage Ensemble Model with a Hybrid Genetic Algorithm for Credit Scoring on Imbalanced Data,” IEEE Access, vol. 9, pp. 143593–143607, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[12] Jianrong Yao et al., “Novel Hybrid Ensemble Credit Scoring Model with Stacking-Based Noise Detection and Weight Assignment,” Expert Systems with Applications, vol. 198, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[13] Himanshu Thakur, and Aman Kumar Sharma, "Supervised Machine Learning Classifiers: Computation of Best Result of Classification Accuracy," International Journal of Computer Trends and Technology, vol. 68, no. 10, pp. 1-8, 2020.
[CrossRef] [Publisher Link]
[14] Abhijit Das, Pramod, "Exploratory Analysis on Anomaly-based IDS Data Using DASK and Ensemble Learning: A Data Parallelization Approach,” International Journal of Engineering Trends and Technology, vol. 70, no. 12, pp. 370-391, 2022.
[CrossRef] [Publisher Link]
[15] Indu Singh et al., “A Multi-level Classification and Modified PSO Clustering Based Ensemble Approach for Credit Scoring,” Applied Soft Computing, vol. 111, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[16] S. Yamashkin et al., “Classification of Metageosystems by Ensembles of Machine Learning Models,” International Journal of Engineering Trends and Technology, vol. 70, no. 9, pp. 258–268, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[17] A. Lakshmanarao, and M. Shashi, “An Efficient Android Malware Detection Framework with Stacking Ensemble Model,” International Journal of Engineering Trends and Technology, vol. 70, no. 4, pp. 294–302, 2022.
[CrossRef] [Publisher Link]
[18] Zarapala Sunitha Bai, and Sreelatha Malempati, “An Enhanced Text Mining Approach using Ensemble Algorithm for Detecting Cyber Bullying,” International Journal of Engineering Trends and Technology, vol. 70, no. 9, pp. 393–399, 2022.
[CrossRef] [Publisher Link]
[19] Diwakar Tripathi et al., “Credit Scoring Models Using Ensemble Learning and Classification Approaches: A Comprehensive Survey,” Wireless Personal Communications, vol. 123, no. 1, pp. 785–812, 2021.
[CrossRef] [Google Scholar] [Publisher Link]
[20] G.Gowrishankar, S.Balgani, and R.Aruna, "Multiselfish Attacks Detection Based on Credit Risk Information in Cognitive Radio Ad-hoc Networks," SSRG International Journal of Computer Science and Engineering, vol. 2, no. 4, pp. 1-7, 2015.
[CrossRef] [Publisher Link]
[21] Sudhansu R. Lenka et al., “Empirical Analysis of Ensemble Learning for Imbalanced Credit Scoring Datasets: A Systematic Review,” Wireless Communications & Mobile Computing, vol. 2022, 2022.
[CrossRef] [Google Scholar] [Publisher Link]
[22] Akinbohun Folake, Akinbohun Ambrose, and E. Oyinloye Oghenerukevwe, "Stacked Ensemble Model for Hepatitis in Healthcare System," International Journal of Computer and Organization Trends, vol. 9, no. 4, pp. 25-29, 2019.
[CrossRef] [Publisher Link]
[23] Q. Sun, and B. Pfahringer, “Bagging Ensemble Selection,” Advances in Artificial Intelligence, vol. 7106, pp. 251–260, 2011.
[CrossRef] [Google Scholar] [Publisher Link]
[24] J. Lu et al., “Ensemble-Based Discriminant Learning with Boosting for Face Recognition,” IEEE Transactions on Neural Networks, vol. 17, no. 1, pp. 166–178, 2006.
[CrossRef] [Google Scholar] [Publisher Link]
[25] Smitha Rajagopal, Poornima Panduranga Kundapur, and Katiganere Siddaramappa Hareesha, “A Stacking Ensemble for Network Intrusion Detection Using Heterogeneous Datasets,” Security and Communication Networks, vol. 2020, 2020.
[CrossRef] [Google Scholar] [Publisher Link]