A Comparison Between Naïve Bayes and The K-Means Clustering Algorithm for The Application of Data Mining on The Admission of New Students
Main Article Content
Abstract
The process of admitting new students at Universitas Islam Negeri Raden Fatah each year produces a lot of new student data. so that there is an accumulation of student data continuously. The purpose of this study is to compare the K-Means Clustering Algorithm and Naïve Bayes on the admission of new students as well as being one of the bases for making decisions to determine the promotion strategy of each study program. The data mining method used is Knowledge Discovery in Database (KDD). The tools used are Rapid Miner. The attributes used are national examination score, school origin, and study programs. The new student data used from 2016 to 2019 was an 18.930 item. The results of this study used the K-Means Clustering Algorithm to produce 3 clusters, while the Naïve Bayes results resulted in an accuracy value of 9.08%.
Article Details
How to Cite
[1]
“A Comparison Between Naïve Bayes and The K-Means Clustering Algorithm for The Application of Data Mining on The Admission of New Students”, intelektualita, vol. 9, no. 1, pp. 51–62, Apr. 2020, doi: 10.19109/intelektualita.v9i1.5574.
Section
Articles
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work
How to Cite
[1]
“A Comparison Between Naïve Bayes and The K-Means Clustering Algorithm for The Application of Data Mining on The Admission of New Students”, intelektualita, vol. 9, no. 1, pp. 51–62, Apr. 2020, doi: 10.19109/intelektualita.v9i1.5574.
References
Ahmad, Fadhilah, Nur Hafieza Ismail, and Azwa Abdul Aziz. ‘The Prediction of Students’ Academic Performance Using Classification Data Mining Techniques’. Applied Mathematical Sciences 9, no. 129 (2015): 6415–26. https://doi.org/10.12988/ams.2015.53289.
Doshi, Mital, and Setu K Chaturvedi. ‘Correlation Based Feature Selection (CFS) Technique to Predict Student Perfromance’. International Journal of Computer Networks & Communications 6, no. 3 (2014): 197–206. https://doi.org/10.5121/ijcnc.2014.6315.
Fayyad, Usama, Gregory Piatetsky-Shapiro, and Padhraic Smyth. ‘From Data Mining to Knowledge Discovery in Databases’. AI Magazine 17, no. 3 (1996): 37–53.
He, Hong, and Yonghong Tan. ‘Corrigendum to “A Two-Stage Genetic Algorithm for Automatic Clustering” [Neurocomputing 81 (2012) 49-59]’. Neurocomputing, 2012. https://doi.org/10.1016/j.neucom.2012.02.009.
Hermawati. ‘Analisis Faktor-Faktor Self-Care Terhadap Status Nutrisi Pada Pasien Hemodialisa Di RSUD Dr. Moewardi Surakarta’. Jurnal IImiah Rekamedis Dan Informatika Kesehatan 7, no. 2 (2017): 29–35.
Jiawei Han, and Micheline Kamber. Data Mining: Concepts and Techniques Second Edition. Morgan Kaufmann. Vol. 53, 2013. https://doi.org/10.1017/CBO9781107415324.004.
Joko Suntoro. Data Mining Algoritma Dan Implementasi Dengan Pemrograman PHP. Jakarta, 2019.
Khotimah, Tutik. ‘PENGELOMPOKAN SURAT DALAM AL QUR�AN MENGGUNAKAN ALGORITMA K-MEANS’. Simetris: Jurnal Teknik Mesin, Elektro Dan Ilmu Komputer 5, no. 1 (2014): 83–88. https://doi.org/10.24176/simet.v5i1.141.
Mujawar, Sairabi, and H. P. R. Devale. ‘Prediction of Hear t Disease Using Modified K-Means and by Using Naive Bayes’. International Journal of Innovative Research in Computer and Communication Engineering 3, no. 11 (2015): 0396–0400. https://doi.org/10.15680/IJIRCCE.2015.
Rahayu, Flourensia Sapty, Rangga Deputra Ginantaka, and Y Sigit Purnomo Wp. ‘Analisis Manfaat Sistem Informasi Penerimaan Mahasiswa Baru Dengan Metode IT Balanced Scorecard’, no. January 2019 (2017). https://doi.org/10.21460/jutei.2017.12.21.
Soni, Jyoti, Ujma Ansari, Dipesh Sharma, and Sunita Soni. ‘Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction’. International Journal of Computer Applications 17, no. 8 (2011): 43–48. https://doi.org/10.5120/2237-2860.
Sundar, P.V.Praveen. ‘A COMPARATIVE STUDY FOR PREDICTING STUDENT’S ACADEMIC PERFORMANCE USING BAYESIAN NETWORK CLASSIFIERS’. IOSR Journal of Engineering 03, no. 02 (2013): 37–42. https://doi.org/10.9790/3021-03213742.
Sutabri, Tata. ‘Improving Naïve Bayes in Sentiment Analysis For Hotel Industry in Indonesia’. 2018 Third International Conference on Informatics and Computing (ICIC), 2016, 1–6.
Suyanto. Data Mining Untuk Klasifikasi Dan Klasterisasi Data. SpringerReference, 2017. https://doi.org/10.1007/SpringerReference_5414.
Vulandari, Tri Retno. ‘Pengertian Data Mining’. In Data Mining, Teori Dan Aplikasi Rapirminer, 1, 2017.
Yathongchai, Wilairat, Chusak Yathongchai, Kittisak Kerdprasop, and Nittaya Kerdprasop. ‘Factor Analysis with Data Mining Technique in Higher Educational Student Drop Out’. Latest Advances in Educational Technologies, 2012, 111–16.
Doshi, Mital, and Setu K Chaturvedi. ‘Correlation Based Feature Selection (CFS) Technique to Predict Student Perfromance’. International Journal of Computer Networks & Communications 6, no. 3 (2014): 197–206. https://doi.org/10.5121/ijcnc.2014.6315.
Fayyad, Usama, Gregory Piatetsky-Shapiro, and Padhraic Smyth. ‘From Data Mining to Knowledge Discovery in Databases’. AI Magazine 17, no. 3 (1996): 37–53.
He, Hong, and Yonghong Tan. ‘Corrigendum to “A Two-Stage Genetic Algorithm for Automatic Clustering” [Neurocomputing 81 (2012) 49-59]’. Neurocomputing, 2012. https://doi.org/10.1016/j.neucom.2012.02.009.
Hermawati. ‘Analisis Faktor-Faktor Self-Care Terhadap Status Nutrisi Pada Pasien Hemodialisa Di RSUD Dr. Moewardi Surakarta’. Jurnal IImiah Rekamedis Dan Informatika Kesehatan 7, no. 2 (2017): 29–35.
Jiawei Han, and Micheline Kamber. Data Mining: Concepts and Techniques Second Edition. Morgan Kaufmann. Vol. 53, 2013. https://doi.org/10.1017/CBO9781107415324.004.
Joko Suntoro. Data Mining Algoritma Dan Implementasi Dengan Pemrograman PHP. Jakarta, 2019.
Khotimah, Tutik. ‘PENGELOMPOKAN SURAT DALAM AL QUR�AN MENGGUNAKAN ALGORITMA K-MEANS’. Simetris: Jurnal Teknik Mesin, Elektro Dan Ilmu Komputer 5, no. 1 (2014): 83–88. https://doi.org/10.24176/simet.v5i1.141.
Mujawar, Sairabi, and H. P. R. Devale. ‘Prediction of Hear t Disease Using Modified K-Means and by Using Naive Bayes’. International Journal of Innovative Research in Computer and Communication Engineering 3, no. 11 (2015): 0396–0400. https://doi.org/10.15680/IJIRCCE.2015.
Rahayu, Flourensia Sapty, Rangga Deputra Ginantaka, and Y Sigit Purnomo Wp. ‘Analisis Manfaat Sistem Informasi Penerimaan Mahasiswa Baru Dengan Metode IT Balanced Scorecard’, no. January 2019 (2017). https://doi.org/10.21460/jutei.2017.12.21.
Soni, Jyoti, Ujma Ansari, Dipesh Sharma, and Sunita Soni. ‘Predictive Data Mining for Medical Diagnosis: An Overview of Heart Disease Prediction’. International Journal of Computer Applications 17, no. 8 (2011): 43–48. https://doi.org/10.5120/2237-2860.
Sundar, P.V.Praveen. ‘A COMPARATIVE STUDY FOR PREDICTING STUDENT’S ACADEMIC PERFORMANCE USING BAYESIAN NETWORK CLASSIFIERS’. IOSR Journal of Engineering 03, no. 02 (2013): 37–42. https://doi.org/10.9790/3021-03213742.
Sutabri, Tata. ‘Improving Naïve Bayes in Sentiment Analysis For Hotel Industry in Indonesia’. 2018 Third International Conference on Informatics and Computing (ICIC), 2016, 1–6.
Suyanto. Data Mining Untuk Klasifikasi Dan Klasterisasi Data. SpringerReference, 2017. https://doi.org/10.1007/SpringerReference_5414.
Vulandari, Tri Retno. ‘Pengertian Data Mining’. In Data Mining, Teori Dan Aplikasi Rapirminer, 1, 2017.
Yathongchai, Wilairat, Chusak Yathongchai, Kittisak Kerdprasop, and Nittaya Kerdprasop. ‘Factor Analysis with Data Mining Technique in Higher Educational Student Drop Out’. Latest Advances in Educational Technologies, 2012, 111–16.