Of Students Academic Performance Rates Using Artificial Neural Networks (ANNs)
O. C. Asogwa1,, A. V.Oladugba2
1Department of Mathematics, Computer Science, Statistics and Informatics, Federal University Ndufu-Alike Ikwo
2Department of Statistics, University of Nigeria, Nsukka
Abstract
A model based on the multilayer perception algorithm was programmed. The result from the test data evaluation showed that the programmed Artificial Neural Network model was able to correctly predict and classify the performance of students with Mean Correct Classification Rate CCR of 97.07%.
Keywords: mean correct classification rate, Artificial Neural Networks (ANNs), Predictive models
Received June 10, 2015; Revised July 07, 2015; Accepted July 23, 2015
Copyright © 2015 Science and Education Publishing. All Rights Reserved.Cite this article:
- O. C. Asogwa, A. V.Oladugba. Of Students Academic Performance Rates Using Artificial Neural Networks (ANNs). American Journal of Applied Mathematics and Statistics. Vol. 3, No. 4, 2015, pp 151-155. https://pubs.sciepub.com/ajams/3/4/3
- Asogwa, O. C., and A. V.Oladugba. "Of Students Academic Performance Rates Using Artificial Neural Networks (ANNs)." American Journal of Applied Mathematics and Statistics 3.4 (2015): 151-155.
- Asogwa, O. C. , & V.Oladugba, A. (2015). Of Students Academic Performance Rates Using Artificial Neural Networks (ANNs). American Journal of Applied Mathematics and Statistics, 3(4), 151-155.
- Asogwa, O. C., and A. V.Oladugba. "Of Students Academic Performance Rates Using Artificial Neural Networks (ANNs)." American Journal of Applied Mathematics and Statistics 3, no. 4 (2015): 151-155.
Import into BibTeX | Import into EndNote | Import into RefMan | Import into RefWorks |
At a glance: Figures
1. Introduction
Universities and higher institutions of learning have been known as a breeding and grooming field for students’ academic pursuit. That is why the path of the students’ academic performances are necessary to be checked, in order to help sustain the important roles students play in the society upon graduation, which maybe either to continue their studies into the post-graduate program or become the man power for the industry, government and private sectors. Thus, the students’ academic performances are critical in ensuring that those significant roles the students’ play in the society are maintained. This has motivated some higher institutions of learning to developed interest in predicting the paths of students, thus identifying which students will require assistance in order to graduate at the stipulated time or maintain their studies or even drop out of the school. This is brought about by the academic failure rate among students, which has fed to a large number of debates [5].
In this paper, a Soft computing technique, which artificial neural network is part of, has been recognized as attractive alternatives to the standard, well-established hard computing paradigms. Soft computing techniques, which emphasize gains in understanding system behavior in exchange for unnecessary precision, have been proven to be able to efficiently solve complicated problems. Soft computing techniques have also enabled the development of more efficient models which predicts student academic performance more accurately than previously possible. There are several soft computing techniques popularly known and used: Artificial Neural Networks, Decision Tree, and K-Mean clustering.
However, the main objective of this work is to evaluate the accuracy of the Artificial Neural Network Model Architecture developed under a statistical programme (Matlab 2009) as a classifier, using its Mean Correct Classification Rate CCR (%), among others is: to report the order of predictor’s significance to the model programmed.
Several researchers have conducted studies on students’ academic performances: [13, 16, 19] and so many others have carried out a scholarly research on prediction of academic performance of students. Whereas others like: [3, 21]; have used Artificial Neural Networks to model, predict and classify, using number of factors which influences academic performance of university students. [12] utilized a Multilayer Perception Neural Networks to predict student performance. They used the average point scores of grade 12 students as inputs and the first year college results as output. Their research showed that Artificial Neural Networks-based model is able to predict student performance in semester one with high accuracy. [14] used a Multiple Feed-Forward Neural Networks to predict students’ final achievements and to group them into two groups.
2. Materials and Methods
The data used in this work were secondary data collected from the students files of the faculty of veterinary medicine, University of Ibadan from 2009-2013. All students’ record with blank fields or incomplete information was deleted from the compiled data. A total number of 420 records were collected and used.
Through extensive review of literatures, a number of socio-economic, biological, environmental, academic, and other related factors like pre-admission requirements that were considered to have influence on the academic performance and promotion rates of university students were identified. Some of these factors can be seen in some literatures like [4, 7, 9, 10, 15, 17].
These factors gender, parents marital status, parents educational status, parents occupation, O’level result, age at entry, time delay before admission, type of secondary school attended, location of secondary school attended, post UMTE, physically impaired and performance outcome were collected from the students’ records were carefully studied and harmonized into a manageable number suitable for computer coding with the context of the Artificial Neural Network modeling. These influencing factors were categorized as input variables (units). The output variables on the other hand will represent the performance rate (students that were promoted to final years, students that repeated fifth year and students that dropped down to fourth year). That is, the output was categorically classified. MatLab R2009a was used as a statistical tool for the analysis.
A Multi-Layer Artificial Neural Network (ANN) model with a Logistic Transfer Function, trained with back-propagation algorithm was programmed; the model was able to predict and classify students into three categories: those that were promoted to final years, those that repeated class and those that were demoted to fourth year. A six years course was considered in this work and the consideration was based on the extension of a research work done by [20] which recommended that the work be extended to other discipline and many more other influencing factors which affect students’ academic performance or pre-admission status be considered as well.
3. The Neural Network Model
Artificial Neural Network (ANN) model proposed by [2] was used in this research work. The model is given below with consideration of logistic transfer function.
(1) |
(2) |
(3) |
(4) |
4. Network Architecture and Design
Multilayer Perceptions (MLPs) are layered feed forward networks typically trained with static back propagation. These networks have found their way into countless applications requiring static pattern classification. Therefore, given the computational capabilities of a multilayer perception as a classifier, a three-layered feed forward neural network was programmed in this research work. The first layer (input level) comprised of 17 neurons (processing elements) - one for each profile parameter (input). The third layer (output level) comprised of 3 neurons - one for denoting “promoted students”, another one for “repeating students” and the other for the “dropped down student” as seen in fig 1.
However, based upon recommendations from [6] and [11] that one hidden-layer network is sufficient to model any complex system, the designed network model will have only one hidden layer. See fig 1.
In this work, we decided to have 70 neurons in the hidden layer as the network performance was best at this number. The back-propagation learning algorithm was used for training the network. The logistic activation function was used at the hidden layer, which can be seen inside the shape in the hidden layer together with the symbol of the summing unit and the identity activation function was used at the output layer.
5. The Data Set Grouping
In supervised training, the data is divided into 3 categories; the training set, verification set (hide out) and the testing set. The training set enables the system to observe relationships between input data and resulting outputs, so that it can develop relationship between the input and the expected output.
A heuristic statement is that the number of the training set data should be at least a factor of 10 larger than the number of network weight to accurately classify test data with 90% accuracy [1]. A total of 420 students records were used in the analysis. About (58.8%) of the total data (i.e. 247 candidates) were used as the training set, (31.0%) (i.e. 130 candidates) as the testing set, and (10.2%) (i.e. 43 candidates) was used for cross validation as each network was run for 100 epochs.
6. Model Performance Measures
There can be many performance measures for predictors; the most important measure of performance is the prediction accuracy that can be achieved with the training data. The most frequently used is the Mean Correct Classification Rate (), [8, 18] which is defined as
(5) |
A model with a high Correct Classification Rate has a better performance. In general, is used to judge the functional network classifier performance. The better classifier is the one with a high value.
7. Results
After the training and cross validation, the network was tested with the test data set and the following results were obtained. This involves given the input variable data to the network without the output variable results. The output from the network is then compared with the actual variable data and the mean correct classification rate was evaluated using equation v. The comparison is summarized in the matrix bellow.
From Table 1 above, each row represents a case of interest and each cell represents the number of the cases of the interest in the rows as evaluated by the classifier. The network was able to predict accurately 83 out of 83 for promoted students (which represents candidates which were promoted to final year), 10 out of 16 for the repeated students (which represents candidates that repeated class), and 29 out of 31 for the demoted students (which represents candidates that were demoted) used to test the Network’s topology. This gives an accuracy of 100% for promoted students, 62.5% for repeated students, and 93.5% for the demoted students. This indicates a Mean Correct Classification Rate or accuracy of about 97.07% for the Artificial Neural Network model architecture developed which shows a good performance according to results from some literatures is like: [1], [17] and [20].
8. Conclusion
This paper has shown the potential of the Artificial Neural Network for accurate prediction and classification of the academic performance of students in higher institution of learning, especially the considered case study by its CCR value. The model was developed based on some selected input variables from the pre admission data contained in the student’s records. It achieved an accuracy of over 97.07%, which shows the potential efficacy of Artificial Neural Network as a prediction tool and a selection criterion for classifying students according to their rates of performances.
Acknowledgement
Faculty of Veterinary Medicine, University of Ibadan was strongly acknowledged for making the data for this research available.
References
[1] | Adefowoju, B. S. & Osofisan, A. O. (2004) Cocoa Production Forecasting Using Artificial Neural Networks International Centre for Mathematics and Computer Science Nigeria. ICMCS117-136. | ||
In article | |||
[2] | Anders, U. (1996) Model selection in neural networks, ZEW Discussion Papers 96-21. Retrieved from https://hdl.handle.net/10419/29449. | ||
In article | |||
[3] | Arinze, B., SubbaNarasimha, P. N, & Anandarajam, M. (2000) The predictive accuracy of artificial neural networks and multiple regressions in the case of skewed data: Exploration of some issues. Expert systems with Applications, 19, 117-123. | ||
In article | View Article | ||
[4] | Arsad, P. M., Buniyamin, N., Manan, J. L. A., & Hamzah, N. (2011) Proposed Academic Students’ Performance prediction model. A Malaysian case study. 3rd International Congress on Engineering Education (ICEED). | ||
In article | View Article | ||
[5] | Bresfelean, N. & Ghisoiu, N. (2005) Determining Students’ Academic Failure Profile Founded on Data Mining Methods, ” Proceedings of the ITI 2005 30th Int. Conf. on Information Technology Interfaces, 2005, Cavtat, Croatia, June 23rd - 26th, 2008, pp. 317-322. | ||
In article | |||
[6] | Cybenko, G. (1989) Approximation by superpositions of a sigmoidal function. Journal Mathematics of Control, Signals, and Systems, vol. 2, no. 4, pp. 303-314, Dec. 1989. | ||
In article | View Article | ||
[7] | Diaz, A. L. (2003) Personal, family, and academic factors affecting low achievement in secondary school. Electronic Journal of Research in Educational Psychology and psychopadology, Vol. 1. No 1, pp. 43-66. April 2003. | ||
In article | |||
[8] | El-Sebakhy, E. A., Hadi, A. S. & Faisal, K. A. (2007) Iterative Least Squares Functional Networks Classifier. IEEE Transactions on Neural Networks, Vol.18, No.3, May, 2007, pp. 844-850. | ||
In article | View Article | ||
[9] | Golding, P., & Donaldson, O. (2006) predicting Academic Performance. Paper presented at the frontiers in Education Conference, 36th Annual, 27-31. | ||
In article | View Article | ||
[10] | Hedjazi, Y., & Omidi, M. (2008) Factors affecting the Academic success of Agricultural Students at university of Tehran, Iran. Journal of Agricultural Science and Technology. Vol. 10. No. 3. Pp. 205-214, April 2008. | ||
In article | |||
[11] | Hornik, K., Stinchcombe, M. & White, H. (1989) “Multilayer feed forward networks are universal approximators,” Neural Networks, Vol. 2, no. 5, pp. 359-366, Jul. 1989. | ||
In article | View Article | ||
[12] | Kanakana, G., & Olanrewaju, A. (2011) Predicting student performance in engineering education using an artificial neural network at Tshwane University of Technology, Proceedings of the ISEM, Stellenbosch, South Africa. | ||
In article | |||
[13] | Karamouzis, S. T and Vrettos, A. (2009) Sensitivity Analysis of neural Network parameters for identifying the factors for college students’ success. World Congress on Computer Science and Information Engineering. 2009, Los Angeles, USA, March 31th –April 2nd, 2009, pp. 671-675. | ||
In article | View Article | ||
[14] | Lykourentzou, I., Giannoukos, I., Mpardis, G., Nikolopoulos, V., and Loumos, V. (2009) Early and dynamic student achievement prediction in e-learning courses using neural networks. Journal of the American Society for Information Science and Technology, 60 (2), 372-380. | ||
In article | View Article | ||
[15] | Mohsin, M. F. M., Wahab, M. H. A., Zaiyadi, M. F., Norwawi, N. M., & Hibadullah, C. F, (2010) An Investigation into Influence Factor of Student Programming Grade Using Association Rule Mining, Advances in Information Sciences and Service Sciences, Vol. 2 (2.3), June 2010. | ||
In article | |||
[16] | Naik, B. & Ragothaman, S. (2004) Using Neural Networks to predict MBA student success, College Student Journal, vol. 38, no. 1, Pg. 143-149. | ||
In article | |||
[17] | Oladokun, V. O., Charles-Owaba. O. E. & Adebanjo, O. E. (2008) Predicting Student’s Academic performance using artificial neural network: A case study of an engineering course. The pacific Journal of Science and Technology, 9 (1) pp. 72-79. | ||
In article | |||
[18] | Ritchie, S. G & Oh, C. (2007) Recognizing vehicle classification information from blade sensor signature, Pattern Recognition Letters, vol. 28, no. 9, pp. 1041-1049, Jul. 2007. | ||
In article | View Article | ||
[19] | Siraj, F., and Abdoulha, M. A., (2009) Uncovering Hidden Information within University’s Student Enrollment Data Using Data Mining. Proceedings of the 2009 third Asia International Conference on Modeling & Simulation, Bali Indonesia, May 25th-29th, 2009, pp. 413-418. | ||
In article | View Article | ||
[20] | Thipsuda, W. & Pusadee, S. (2010) A comparison of classical Discriminant Analysis and Artificial Neural Network in predicting student graduation outcomes. Proceedings of the Second International Conference of knowledge and Smart technologies: 24-25. | ||
In article | |||
[21] | Walczak, S., & Sincich, T. (1999) A comparative analysis of regression and neural works for university admissions. Information Sciences, 119 (1-2), 1-20. | ||
In article | View Article | ||
APPENDIX
Those factors that were considered in this work are listed in the Table below;