Nguyen Minh Khiem * , Huynh Van Tu and Nguyen Hung Dung

* Corresponding author (nmkhiem@cit.ctu.edu.vn)

Main Article Content

Abstract

A number of factors influence a student's attainment of graduation. Besides scholastic performance within the academic curriculum, other variables such as living circumstances, gender, and choice of major significantly contribute to the probability of achieving graduation. The capacity to forecast academic performance at the time of graduation holds profound importance for universities, especially in discerning the influential factors that contribute to a student's successful completion of their educational pursuits. This study employs multiple machine learning algorithms, including K-nearest neighbor, Neural network, Decision tree, Random forest, and Gradient boosting, to prognosticate the graduation outcomes of 7,837 undergraduate students from Can Tho University during the academic year 2022. These selected students were enrolled in 16 colleges and institutes affiliated with Can Tho University. The efficacy of the employed algorithms was assessed through performance evaluation metrics encompassing accuracy, precision, recall, and F-measure. Furthermore, a 15-fold cross-validation technique was employed for validation. The findings revealed that the Random forest model yielded the most reliable predictions. The factors that significantly impact graduation grades comprise GPA, training point, residential address, college, major, and gender. Based on the experimental findings, these factors were ranked to ascertain their effects on student graduation.

Keywords: Academic influenced factor, graduation grade, machine learning

Article Details

References

AlHarthi, H., & Kadhim, A. (2011). Predicting the difficulties faced by students living outside the university campus in light of some demographic variables. Journal of Qualitative Educational Research, 18(3), 306–430.

Abdul-Wahab, S. A., Salem, N. M., Yetilmezsoy, K., & Fadlallah, S. O. (2019). Students’ reluctance to attend Office hours: Reasons and suggested solutions. Journal of Educational and Psychological Studies, 13(4), 715–732.

Al-Mahrouqia, & R., Karadsheh, M. A. (2016). Sultan Qaboos University students reasons of being underObservation. Humanities and Social Sciences, 43(3), 2343–2360.

Alsayed, A.O., Rahim, M.S.M., AlBidewi, I., Hussain, M., Jabeen, S.H., Alromema, N., Hussain,S., & Jibril, M. L. (2021). Selection of the right undergraduate major by students using supervised learning techniques. Appl. Sci., 11, 10639. DOI: 10.3390/app112210639

AlGhanboosi, S., & Kadhim, A. (2004). Problems of Academic Supervision at Sultan Qaboos University from Professors and students perspectives. Journal of Education, 10(2), 39–75.

Al-Alawi, L., Al Shaqsi, J., & Tarhini, A. (2023). Using machine learning to predict factors affecting academic performance: the case of college students on academic probation. Educ. Inf. Technol. DOI: https://doi.org/10.1007/s10639-023-11700-0

Breiman, L. (2001). Random forests. Mach. Learn. DOI: https://doi.org/10.1023/A:1010933404324

Green, L., & Celkan, G. (2011). Student demographic characteristics and how they relate to student achievement. Procedia - Social and Behavioral Sciences, 15, 341-345.

Ibrahim, A., & Al-Barwani, T. A. (1993). A study of Omani secondary school Certificate Examination as a predictor of academic performance of Sultan Qaboos University. Research in College Teaching Practicum Research in Sultan Qaboos University, 1(1), 1–29.

Kursa, M., & Rudnicki, W. (2011). The All Relevant Feature Selection using Random Forest. Computer science, Artificial Intelligence. https://doi.org/10.48550/arXiv.1106.5112

Kumar, R., & Sharma, A. (2017). Data mining in education: A review. International Journal of Mechanical Engineering and Information Technology, 5(1), 1843–1845.

Mustafa, Y. (2022). Educational data mining: prediction of students' academic performance using machine learning algorithms. Smart Learn. Environ. 9(11). https://doi.org/10.1186/s40561-022-00192-z

Natekin, A., & Knoll, A. (2013). Gradient boosting machines. Front Neuro-robot, 7, 21. https://doi.org/10.3389/fnbot.2013.00021

Nick, D. (2016). The effect of university attended on graduates’ labour market prospects: A field study of Great Britain. Economics of Education Review, 52. DOI: https://doi.org/10.1016/j.econedurev.2016.03.001.

Nguyen, H. D., Duc, T., Sang, V., Diem, N., Hung, N., & Nha, T. T (2023). Knowledge Management for Information Querying System in Education via the Combination of Rela-Ops Model and Knowledge Graph. Journal of Cases on Information Technology, 25(1) pp.1-17. http://doi.org/10.4018/JCIT.324113

Priyadarshini, M., Gurnam, K. S., Hoon, T. S., Geethanjali, N., & Fook, C. Y. (2022). Key Factors Influencing Graduation on Time Among Postgraduate Students: A PLS-SEM Approach. Asian Journal of University Education, 18(1), 51-64. DOI: https://doi.org/10.24191/ajue.v18i1.17169.

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O.,… & Duchesnay, É. (2011). Scikit-learn: Machine learning in Python. The Journal of machine Learning research, 12, 2825-2830

Powers, D. M. W. (2020). Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. ArXiv abs/2010.16061.

Roksa, J., & Kinsley, P. (2019). The Role of Family Support in Facilitating Academic Success of Low-Income Students. Research in Higher Education, 60(4), 415–437. http://www.jstor.org/stable/45180388

Solis, O., & Durband, D. B. (2015). Financial support and its impact on undergraduate student financial satisfaction. College Student Journal, 49, 93-105.

Sekeroglu, B., Abiyev, R., Ilhan, A., Arslan, M., & Idoko, J. B. (2021). Systematic literature review on machine learning and student performance prediction: Critical gaps and possible remedies. Applied Sciences, 11(22), 10907

Vidiyala, R. (2023, Jul 18). Performance metrics for classification machine learning problems. https://towardsdatascience.com/performance-metrics-for-classification-machine-learning-problems-97e7e774a007.

Wook, M., Yusof, Z. M., & Nazri, M. Z. A. (2017) Educational data mining acceptance among undergraduate students. Educ. Inf. Technol., 22, 1195–1216.

Yakubu, M. N., & Abubakar, A. M. (2022). Applying machine learning approach to predict students’performance in higher educational institutions. Kybernetes, 51(2), 916–934. https://doi.org/10.1108/K-12-2020-0865.

Zhu, Y., Xu, S., Wang, W., Zhang, L., Liu, D., Liu, Z., & Xu, Y. (2022). The impact of online and offline learning motivation on learning performance: the mediating role of positive academic emotion. Education and Information Technologies, 27(7), 8921-8938.

Zhang, Z. (2016). Introduction to machine learning: k-nearest neighbors. Annals of Translational Medicine, 4(11), 218. https ://doi.org/10.21037 /atm.2016.03.37

Zou J, Han Y, & So, S. S. (2008). Overview of artificial neural networks. Methods Mol Biol., 458, 15–23.