The Machine Learning Methods For Micro-Credit Scoring: The Case Of Micro-Financing In Mongolia

Authors

  • Bayarmaa Dashnyam Senior Lecturer of Finance Department, National University of Mongolia
  • Gerelt-Od Uvgunkhuu Associate Professor of Finance Department, National University of Mongolia
  • Burmaa Sosorbaram 2MBA, Department of Finance, National University of Mongolia

DOI:

https://doi.org/10.59188/eduvest.v4i5.1137

Keywords:

Micro-Credit, Micro-Finance, Credit Scoring, Microloan, Machine Learning

Abstract

As a result of growing digital technologies in the financial sector, the traditional slow lending process is being replaced by fast and easy digital lending systems that can make decisions in real time. Both lenders and borrowers have experienced the benefits of digital lending, the activities of microfinance institutions have expanded rapidly and the volume of digital microloans has increased significantly worldwide, including Mongolia. At the same time with the growing volume of digital microloans in Mongolia, the rationality of credit risk management has been becoming more critical. Credit quality is the most important factor in optimal credit risk management. It depends on determining the customer's creditworthiness and making accurate credit decisions. This research focuses on a credit scoring system to improve the digital loan evaluation system of the Mongolian microfinance institute. This study aims to contribute to the development of possible credit scoring systems for Mongolian microfinance institutions by comparing several machine-learning approaches based on loan datasets of a non-banking microfinance institute in Mongolia. The result shows the ensemble methods Random Forest and XGBoost Tree's accuracies are higher than other machine learning models for the microloan borrowers' repayment status prediction.

References

An-Hsing Chang, L.-K. Y.-H.-K. (2022). Machine learning and artificial neural networks to construct P2P lending credit-scoring model: A case using Lending Club data . Quantitative Finance and Economics, 6(2), 303-325. doi:10.3934/QFE.2022013

Anil Kumar, S. S. (2021). Machine Learning (ML) Technologies for Digital Credit Scoring in Rural Finance: A Literature Review. Risks, 9(192). doi:https://doi.org/10.3390/risks9110192

Apostolos Ampountolas, T. N. (2021). A Machine Learning Approach for Micro-Credit Scoring. Risks, 9(3), 50. doi:https://doi.org/10.3390/risks9030050

B. Ghaddar and J. Naoum-Sawaya. (2018). High dimensional data classification and feature selection using support vector machines. European Journal of Operational Research,, 265(3), 993–1004.

Bhilare, A. C. (2018). Application of Ensemble Models in Credit Scoring Models. Business Perspectives and Research, 6(2), 129–141.

Breiman, L. (2001). Random Forests. Machine Learning, 45(5-32).

Chau, C. W. (2011). Rainfall–runoff modeling using artificial neural network coupled with singular spectrum analysis. Journal of Hydrology, 399(3-4), 394-409.

Chen, T. a. (2016). Xgboost: A scalable tree boosting system. 22nd ACM Sigkdd International International Conference on Knowledge Discovery and Data Mining, 13–17, pp. 785–94. San Francisco, CA, USA.

Dumitrescu E., ,. H. (2022). Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects. European Journal of Operational Research, 297(3), 1178-1192. doi:https://doi.org/10.1016/j.ejor.2021.06.053

Financial Regulatory Commission. (2022). FInancial Market Review 2022. Ulaanbaatar, Mongolia: Financial Regulatory Commission. Retrieved from http://www.frc.mn/resource/frc/Document/2023/03/31/0gt0qm9v25qrpo9z/MARKET%20REVIEW%202022.pdf

Germanno Teles, J. J. (2020). Artificial neural network and Bayesian network models for credit risk prediction. Journal of Artificial Intelligence and Systems, 2, 118-132. doi:https://doi.org/10.33969/AIS.2020.21008

Gernmanno Teles, J. J. (2020). Machine learning and decision support system on credit scoring. Neural Computing and Applications, 32, 9809–9826. doi:https://doi.org/10.1007/s00521-019-04537-7

Geurts, P. D. (2006). Extremely randomized trees. Machine Learning, 63, 3-42.

Gilbert, R. (2013). CHAID and Earlier Supervised Tree Methods. Contemporary Issues in Exploratory Data Mining in the Behavioral Sciences, 48–74.

Green, D. S. (1966). Signal detection theory and psychophysics. New York: John Wiley and Sons.

J. Vaidya, H. Y. (2007). “Privacy-preserving SVM classification. Knowledge and Information Systems, 14(2), 161–178.

Keramati, A. &. (2011). A proposed classification of data mining techniques in credit scoring. In Proc. 2011 Int. Conf. on Industrial Engineering and Operations Management, (pp. 416-424). Kuala Lumpur, Malaysia . Retrieved 2011

Khalil Masmoudi, L. A. (2019). Credit risk modeling using Bayesian network with a latent variable. Expert Systems with Applications, 127, 157–166.

Leong, C. (2016). Credit Risk Scoring with Bayesian Network Models. Computional Economics, 423–446. doi:https://doi.org/10.1007/s10614-015-9505-8

Lkhagvadorj M., O.-E. N. (2018). Credit Scoring with Deep Learning. 4th International Conference on Information, System and Convergence Applications. Bangkok, Thailand.

Lkhagvadorj Munkhdalai, T. M.-E. (2019). An Empirical Comparison of Machine-Learning Methods on Bank Client Credit Assessments. Sustainability, 11(3), 699. doi:https://doi.org/10.3390/su11030699

Loh, W.-Y. (2011). Classification and regression trees. Data Mining and Knowledge Discovery, 1(1), 14-23. doi:https://doi.org/10.1002/widm.8

Mandukhai Ganbat, E. B.-E. (2021). Effect of Psychological Factors on Credit Risk: A Case Study of the Microlending Service in Mongolia. Behavioral Sciences, 11(4). doi:https://doi.org/10.3390/bs11040047

Mohammad Amini, J. R. (2015). A Cluster-Based Data Balancing Ensemble Classifier for Response Modeling in Bank Direct Marketing. International Journal of Computational Intelligence and Applications, 14(04), 1550022. doi:https://doi.org/10.1142/S1469026815500224

Pan, Y. W. (2021). Application Analysis of Credit Scoring of Financial Institutions Based on Machines Learning Model. Complexity, 2021, 12. doi:https://doi.org/10.1155/2021/9222617

Poon, M. (2007). Scorecards as Devices for Consumer Credit: The Case of Fair, Isaac & Company Incorporated. The Sociological Review , 55(2), 284-306. doi:https://doi.org/10.1111/j.1467-954X.2007.00740.x

Pradeep R., ,. K. (2023). Digital Lending Market Research, Global Opportunity Analysis and Industry Forecast, 2023-2032. Allied Market Research.

Shin, S. K. (2022). Two stage credit scoring using Bayesian approach. Journal of Big Data, 9, 106. doi:https://doi.org/10.1186/s40537-022-00665-5

Shobana A., K. N. (2023). Bank loan prediction using KNN algorithm. International Research Journal of Modernization in Engineering Technology and Science, 5(03). doi:https://www.doi.org/10.56726/IRJMETS34927

Somvanshi M., a. C. (2016). A review of machine learning techniques using decision tree and support vector machine. Proceedings of the 2016 International Conference on Computing Communication Control and Automation (ICCUBEA) , (pp. 1–7). Pune, India.

V. Anantha Nageswaran, S. K. (2021). White paper on Digital Lending: Issues, Challenges and Proposed Solutions. Indicus Centre for Financial Inclusion. Retrieved from https://indicus.org/admin/pdf_doc/White-Paper-DIgital-Lending-April-2021.pdf

Vapnik, V. N. (1997). The Support Vector method. Artificial Neural Networks — ICANN'97 (pp. 261–271). Berlin: Springer. doi:https://doi.org/10.1007%2FBFb0020166

Zhang, G. B. (1998). Forecasting with artificial neural networks: The state of the art. International Journal of Forecasting, 14, 35-62.

Downloads

Published

2024-05-25