Article Open Access

Naïve Bayes Algorithm Based on Forward and Backward Selection for Predicting Mortgage Loan Application Status

(1) * Andriano Andriano Mail (Institut Teknologi Sepuluh Nopember, Surabaya, Indonesia, 60111, Indonesia)
(2) Harison Harison Mail (Universitas Riau, Pekanbaru, Indonesia, 28131, Indonesia)
*Corresponding author

Abstract


This study discusses about the classification analysis of the status of home ownership credit at one of the banks in Pekanbaru City using the naïve Bayes algorithm. The purpose of this study is to compare the results of naïve Bayes classification with variable selection using forward and backward selection without variable selection. Variables from the selection results that are included in the best model with transform data use Weight of Evidence method are the type of customer, residential status, last education, and occupation. The results show that the naïve bayes algorithm with variable selection is better than without variable selection. This is shown from the value of the confusion matrix, namely accuracy and specificity, which reached 57,10% and 72,32%, although the sensitivity value was relatively small, 33,10%.


Keywords


Naïve bayes algorithm; forward selection; backward selection; weight of evidence; confusion matrix

   

DOI

https://doi.org/10.33122/ejeset.v6i1.684
      

Article metrics

Abstract views : 314

   

Cite

   

References


Aduda, J., & Obondy, S. (2020). Credit Risk Management and Efficiency of Savings and Credit Cooperative Societies: A Review of Literature. Journal of Applied Finance and Banking, 11(1), 99–120. https://doi.org/10.47260/jafb/1117

Aggrawal, R., & Pal, S. (2021). Elimination and backward selection of features (p- value technique) in prediction of heart disease by using machine learning algorithms. Turkish Journal of Computer and Mathematics Education, 12(6), 2650–2665

Anderson, A., & Semmelroth, D. (2019). Statistics for big data for dummies. New Jersey: Wiley.

Bhatia, P. (2019). Data mining and data warehousing: principles and practical techniques (1st ed.). Cambridge: Cambridge University Press.

Bramer, M. (2016). Principle of data mining (3rd ed.). Hampshire: Springer.

Bustami. (2014). Penerapan algoritma naive bayes untuk mengklasifikasi data nasabah asuransi. Jurnal Informatika, 8(1), 885–898.

Chatterjee, S., & Hadi, A. S. (2012). Regression analysis by example (5th ed.). New Jersey: Wiley.

Christensen, R. (2016). Analysis of variance, design, and regression: Linear modeling for unbalanced data (2nd ed.). New York: CRC Press.

Desyanita, L., & Wibowo, A. (2020). Pemodelan sistem prediksi kelayakan pengajuan kredit kepemilikan rumah dengan metode C4.5 dan Naive Bayes. Jurnal Ilmiah Elektronika dan Komputer, 13(2), 10–22

Eweoya, I. O., Adebiyi, A. A., Azeta, A. A., Chidozie, F., Agono, F. O., & Guembe,

Eweoya, I. O., Adebiyi, A. A., Azeta, A. A., Chidozie, F., Agono, F. O., & Guembe, B. (2019). A Naive Bayes approach to fraud prediction in loan default. Journal of Physics: Conference Series, 1299(1), 012038. https://doi.org/10.1088/1742-6596/1299/1/012038

Han, J., Kamber, M., & Pei, J. (2012). Data mining: Data mining concepts and techniques (3rd ed.). Waltham: Morgan Kaufmann.

Hasan, M. (2017). Prediksi tingkat kelancaran pembayaran kredit bank menggunakan algoritma Naive Bayes berbasis forward selection. ILKOM Jurnal Ilmiah, 9(3), 317–324.

Kantardics, M. (2020). Data mining: Concept, models, methods, and algorithms (3rd ed.). New Jersey: Wiley-IEEE Press.

Kasmir. (2003). Dasar - dasar perbankan (1st ed.). Jakarta: PT RajaGrafindo Persada.

Kusuma, A. S., Welda, W., & Juliana, I. K. (2021). Penentuan Lokasi Fasilitas Kesehatan Strategis Menggunakan Metode Naive Bayes pada RSU Bintang. INFORMAL Informatics Journal, 6(2), 52. https://doi.org/10.19184/isj.v6i2.23798

Larner, A. J. (2021). The 2x2 matrix: contingency, confusion, and the metrics of binary classification. Liverpool: Springer.

Larose, D., & Larose, C. (2014). Discovering knowledge in data: An introduction to data mining (2nd ed.). Hoboken: John Wiley & Sons, Inc.

Lavrushin, O., & Sokolinskaya, N. (2020). Confidence Level and Credit Risk Analysis in Russian Banks. Banks and Bank Systems, 15(2), 38–46. https://doi.org/10.21511/bbs.15(2).2020.04

Marcos, H., & Hidayah, I. (2014). Evaluasi algoritme untuk klasifikasi nasabah kredit Bank "X" menggunakan beberapa algoritme klasifikasi. Prosiding Seminar Nasional Teknologi Informasi dan Multimedia 2014, STMIK AMIKOM Yogyakarta.

Mendenhall, W., & Sincich, T. (2011). A second course in statistics regression analysis (7th ed.). Boston: Prentice Hall.

Montgomery. (2001). Design and analysis of experiments (5th ed.). New York: John Wiley & Sons.

Montgomery, Peck, E. A., & Vining, G. G. (2012). Introduction to linear regression analysis (5th ed.). New Jersey: Wiley.

Pattekari, S. A., & Parveen, A. (2012). Prediction system for heart disease using naive bayes. International Journal of Advanced Computer and Mathematical Sciences, 3(3), 290–294.

Purnomo, A., Barata, M. A., Soeleman, M. A., & Alzami, F. (2020). Adding feature selection on naïve bayes to increase accuracy on classification heart attack disease. Journal of Physics: Conference Series, 1511(1).

Rachmat. (2020). Prediksi kelayakan kredit kepemilikan rumah menggunakan metode K-NN berbasis forward selection. JESSI (Journal of Embedded System, Security and Intelligent System), 1(2), 1–8.

Rivai, V., & Veithzal, A. (2007). Credit management hanbook: Teori, konsep, prosedur, dan aplikasi panduan praktis mahasiswa, bankir, dan nasabah (2nd ed.). Jakarta: PT. RajaGrafindo Persada.

Saber, G. A. F., & Lee, A. J. (2003). Linear regression analysis (2nd ed.). New Jersey: Wiley.

Setiawan, J. T. (2025). Analisis keputusan pembelian rumah dengan meminimalkan risiko kredit dengan metode Naïve Bayes. Cerdika: Jurnal Ilmiah Indonesia, 5(4), 1705–1717. https://cerdika.publikasiindonesia.id/index.php/cerdika/index

Sidiqqi, N. (2005). Credit risk scorecards developing and implementing intelligent credit scoring. New Jersey: John Wiley & Sons.

Situmorang, D. J., & Riyanti, R. S. (2023). Analisis pengaruh risiko kredit dan risiko likuiditas terhadap profitabilitas perbankan pada masa pandemi Covid-19. Jurnal Ilmiah MEA (Manajemen, Ekonomi, dan Akuntansi), 7(2), 394–405. https://doi.org/10.31955/mea.v7i2.2988

Suhartono, S., Halimah, A., Kuspriyono, T., Abdurrachman, Hakim, L., & Yanti, V. A. (2023). Pengaruh Resiko Kredit Terhadap Profitabilitas Bank (Studi pada Bank Mandiri, Bank BNI dan Bank BRI). Jurnal Akuntansi dan Ekonomi, 8(1), 20–30. https://doi.org/10.29407/jae.v8i2.18906

Sumanto, S., Marita, L. S., Mazia, L., & Ratnasari, T. W. (2021). Analisis kelayakan kredit rumah menggunakan metode Naive Bayes untuk mengurangi kredit macet. Applied Information Systems and Management (AISM), 4(1), 17–22.

Suryono, C. D. P., Handayani, S. R., & Dwiatmanto. (2016). Evaluasi sistem pemberian kredit pemilikan rumah (KPR) dalam mengantisipasi terjadinya kredit bermasalah (Studi pada PT. Bank Jatim (Persero) Tbk Cabang Malang). Jurnal Administrasi Bisnis (JAB), 32(1), 1–7.

Wahyuningsih, S., & Utari, D. R. (2018). Perbandingan metode K-Nearest Neighbor, Naïve Bayes dan Decision Tree untuk prediksi kelayakan pemberian kredit. Konferensi Nasional Sistem Informasi 2018, STMIK Atma Luhur Pangkalpinang, 8–9 Maret 2018, 619–624.


Refbacks

  • There are currently no refbacks.


Copyright (c) 2025 Andriano Andriano, Harison Harison

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

 
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0