## An Analysis of Students Performance Using Genetic Algorithm

**T. Miranda Lakshmi**^{1}, **A. Martin**^{2,}, **V. Prasanna Venkatesan**^{2}

^{1}Department of Computer Science, Bharathiyar University, Coimbatore, India

^{2}Department of Banking Technology, Pondicherry University, Puducherry, India

### Abstract

Genetic algorithm plays a significant role, as search techniques for handling complex spaces, in many fields such as artificial intelligence, engineering, robotic, etc. The genetic processes on the natural evolution principles of populations have been fairly successful at solving problems and produce optimized solution from generation to generation. This is applied in students’ quantitative data analysis to identify the most impact factor in their performance in their curriculum. The results will help the educational institutions to improve the quality of teaching after evaluating the marks achieved by the students’ in academic career. This student analysis model considers the quantitative factors such as theoretical, mathematical, practical, departmental and other departmental marks to find the most impacting factor using genetic algorithm.

### At a glance: Figures

**Keywords:** students’ performance, quantitative factors, genetic algorithm, influencing parameter, students evaluation results

*Journal of Computer Sciences and Applications*, 2013 1 (4),
pp 75-79.

DOI: 10.12691/jcsa-1-4-3

Received December 31, 2012; Revised May 30, 2013; Accepted May 31, 2013

**Copyright**© 2013 Science and Education Publishing. All Rights Reserved.

### Cite this article:

- Lakshmi, T. Miranda, A. Martin, and V. Prasanna Venkatesan. "An Analysis of Students Performance Using Genetic Algorithm."
*Journal of Computer Sciences and Applications*1.4 (2013): 75-79.

- Lakshmi, T. M. , Martin, A. , & Venkatesan, V. P. (2013). An Analysis of Students Performance Using Genetic Algorithm.
*Journal of Computer Sciences and Applications*,*1*(4), 75-79.

- Lakshmi, T. Miranda, A. Martin, and V. Prasanna Venkatesan. "An Analysis of Students Performance Using Genetic Algorithm."
*Journal of Computer Sciences and Applications*1, no. 4 (2013): 75-79.

Import into BibTeX | Import into EndNote | Import into RefMan | Import into RefWorks |

### 1. Introduction

In present day’s educational system, performance in career growth is determined by the assessment and examination achievements. The assessment is carried out by curricular activities such as class test, viva, seminar, assignments, general proficiency, attendance, lab work and finally semester exams. On considering the quantitative factors such as attendance, CGPA (cumulative), marks obtained in theoretical , mathematical, elective and departmental papers, knowledge in learning, understanding capability, communication etc. thus the computation is done by real genetic algorithm is used to find the most influencing factor in those selected factors.

In order to formulate the equation that contains several important subjects and its marks, their priority values plays the major role in the sensitivity analysis. It can be added to the equation along with their parametric variables and the output returned from the equation is considered as the overall performance value. Now according to the passing criteria as fixed, the performance is classified as good, average and poor categories. The real genetic algorithm is applied RGSPAT (Real Genetic Student Performance Analysis Tool) that helps us to know where the variations happened can be identified. From the result, the focus towards the potential problems in the course is gained. Educators can also use this information to guide their way of evaluation and implementing according to curriculum changes.

The remaining section of this paper is organized as follows, section 2 describes the several techniques used in evaluation of student’s performance, section 3 discuss about designing principles of various components in our student performance analysis model, section 4 describes about application of real genetic algorithm to find the most important quantitative feature, section 5 about working model of our Analysis tool and section 6 concludes the paper with remarkable outcome.

### 2. Prior Research

This section discuss about various techniques that are used to evaluate and predict the performance of employees, staffs, teachers and students and implementation of real genetic algorithm in bank applications. They are as follows.

The most impact banking features in bankruptcy model are analyzed and results are produced. Deakin’s bankruptcy model and its various features such as net income, cash assets, current assets, sales and total assets are considered. On application of Genetic algorithm they predicted the most important feature in this Deakin model is cash assets/ total assets. This ratio is crucial for predicting the bankruptcy with 96% accuracy (Martin et al, 2011), similarly this approach is adapted here to find the most impacting parameter from the students results.

A fuzzy Expert System tested with 20 student’s marks obtained by semester-1 and semester-2 examinations, both inputs had same Triangular Membership Functions. The proposed a Fuzzy Expert System (FES) for student academic performance evaluation based on fuzzy logic techniques uses a suitable fuzzy inference mechanism and associated rule (Ramjeet et al, 2011). The academic performances are taken and compare the results (performance) with existing statistical method. Similarly FES was developed for evaluation of teachers’ performance in teaching activity is especially relevant for the academic institutions. It helps to define efficient plans to guarantee quality of teachers and the teaching learning process by considering students’ feedback,** **result, students’ attendance,** **teaching learning process and academic development of teachers (Chaudhari et al, 2012).

The various factors in his research that may likely influence the performance of a student were identified such as examination scores, age on admission, parental background and gender etc., The Artificial neural network (ANN) shown the potential for enhancing the effectiveness of a university admission system. The model was developed based on some selected input variables from the pre admission data of five different sets of university graduates. It achieved an accuracy of over 74%, which shows the potential efficacy of Artificial Neural Network as a prediction tool and a selection criterion for candidates seeking admission into a university (Oladokun et al, 2011). The adaptive Neuro-fuzzy technique introduced for prediction of student performance evaluation (Osman and Bahattin, 2009). Here Assessment of students’ achievements in the previous year to evaluate the learning process and performance of students is considered to be more meaningful than referring the marks individually. The previous and current student’s performance data are kept in a computer system that is an excellent source to achieve satisfactory results in evaluation of the students’ performance as a group, their learning level and evaluate the quality of education. It is illustrated with students’ achievements, academic performances, linguistic grades granted. It also gives the outcomes of the SAP for statistical and neuro-fuzzy modeling approaches with their linguistic values.

A methodology by the derivation of performance prediction indicators to deploying a simple student performance assessment and monitoring system focusing on performance monitoring of students’ continuous assessment (tests) and examination scores in order to predict their final achievement status upon graduation. This is done based on various data mining techniques (DMT) and the application of machine learning processes, rules are derived that enable the classification of students in their predicted classes (Emmanuel, 2007). The deployment of the prototyped solution, integrates measuring, ‘recycling’ and reporting procedures in the new system to optimize prediction accuracy. A fuzzy qualitative classification system for academic performance evaluation using the link analysis methodology (Tossapon et al, 2011), Unlike the conventional approach where fuzzy rules are used to encode information provided by training data, the proposed model considers involving variables, classes and their relations as elements of a social network that can be modeled as a weighted graph. The resulting linguistic descriptions of grade-specific likelihood are useful for further revisions regarding the formal partition of performance levels and students’ improvement.

Genetic algorithm along with decision tree is used in distant learning for analyzing the student academic performance. These concepts form the basis of the GATREE System (Papagelis and Kalles, 2001), which has been built on top of the GALIB library (Wall, 1996). The genetic operators on the tree representations are relatively straightforward. A mutation may modify the test attribute at a node or the class label at a leaf. A cross-over may substitute whole parts of a decision tree by parts of another decision tree. It is used to assist in the task of continuously monitoring a student’s performance with reference to the possibility of passing the final exam.

From these literature surveys, few researches have compared for analyzing the performance of student quantitative data and there is no considerable work on finding any equation model for to find the best parameter that influence the performance of the students. This research also analyzes the impact of quantitative parameters in student’s academic performance.

### 3. Genetic Approach for Students Performance Analysis

Genetic algorithms are now widely applied in science and engineering as adaptive algorithms for solving practical problems. The general acceptance is that GA is particularly suited to multidimensional global search problems where the search space potentially contains multiple local minima. Unlike other search methods, correlation between the search variables is not generally a problem. Genetic algorithm works modeling the parameters of a problem as real strings. The following figure depicts the various components of RGSPAT model.

**Figure 1**

**.**Design model of RGSPAT (Real genetic student performance analysis tool)

The proposed system consists of starting edge from the student analyzing data and ending edge by finding the most important parameter that is found by undergoing several processes as follows

● Analyzing the performance information in educational systems that are quantifiable such as attendance, internal assessment marks, project marks, previous semester marks, seminar, general proficiency, papers that are considered important in that course etc.

● On collecting those parameters, the equation construction that containing all those parameter values with weight values are designed.

● After designing the equation, the real-coded genetic algorithm is applied.

● The operations such as mutation and cross over are carried out.

● The best parameter is identified, once analyzing all the parameters through real genetic algorithm.

The quantitative factors that are considered into the analysis of students’ performance are given below:

● **PSM**** **Previous Semester Marks/Grade obtained in course

● **IA**** **Internal Assessment marks

● **SEM**** **Seminar Performance obtained. In each semester seminar are organized to check the performance of students.

● **ASS**** **Assignment marks. In each semester two assignments are given to students by each teacher.

● **GP**** **General Proficiency performance. Like seminar, in each semester general proficiency tests are organized* *

● **ATT** Attendance of Student. Minimum 70% attendance is compulsory to participate in End Semester Examination. But even through in special cases low attendance students also participate in End Semester Examination on genuine reason. Attendance is divided

● **PM**** **Project Work. Completion of the full project with report, presentation, system model.

● **T**** **Theoretical subject marks obtained in all semesters

● **M**** **Mathematical subject marks obtained in all semesters

● **E**** **Elective subject marks obtained in all semesters

● **D**** **Departmental subject marks obtained in all semesters

● **O**** **Other departmental subject marks in all semesters

● **PV**** **Performance value obtained from the equation as the result

Here RGSPAT tool that is developed for predicting student performance requires the quantitative attributes of student (i.e.) measurable variables like internal mark, seminar mark, attendance, daily test marks etc. are listed below.

Now the equation used to analyze and categorize the performance of the student consists of the input parameters such as attendance, internal assessments, Assignment, General proficiency, theory subject, mathematical subject, elective and departmental subject marks as mentioned above. The parameter which shows more variation in the evaluation of the performance using the algorithm, it is chosen as the optimum parameter. On finding the result for each student, we can generally find the most optimizing parameter for the student management and thereby taking necessary steps for improving the overall performance of the students.

### 4. Most Important Parameter Using RGA

This analysis is concerned with finding most important attribute that affects the performance of student. As the fore mentioned properties of RGA are highly advantageous, RGA for RGSPAT is designed using Crossover and Mutation process. For this experiment we have selected the quantitative factors among the student in college/school. The real-valued genetic algorithm (RGA) uses a real value as a parameter of the offspring in populations without performing coding and encoding process before calculates the fitness values of individuals. The performance analysis for RGSPAT model is given by the following equation.

(1) |

The working of real genetic algorithm in performance analysis is as follows.

a. [Start] Generate random population of attributes as chromosomes

b. [Fitness] Evaluate the fitness f(x) of each chromosome x in the population

c. [New population] Create a new population of attributes by repeating following steps until the new population is complete

i. [Selection] Select two parent chromosomes from a population according to their fitness (which satisfies the fitness function)

ii. [Crossover] with a crossover probability Pc crossover the parents to form a new offspring (children). For real values, linear crossover is performed. ax+(1-a)y, where a is priority value in the equation and x, y are real values

iii. [Mutation] with a mutation probability Pm mutate new offspring at each locus (position in chromosome), For real value mutation, add any random value as x + N(0,0.1)

iv [Accepting] Place new offspring in a new population

d. [Replace] Use new generated population for a further run of algorithm.

e. [Test] if the end conditions are satisfied, stop, and returns the best parameter in current population

f. [loop] go to step b.

Initially a population of chromosomes, each of which represents a potential solution to the problem at hand, is generated randomly and each of them is evaluated by finding its fitness. The next generation of same size is created by selecting more fit individuals from this population and by applying genetic operators like crossover and mutation to them. Mutation is an operator which creates a new individual by making a random change in the old one, whereas crossover creates new individuals by combining parts from multiple individuals. Classic mutation randomly alters a single gene, while crossover exchanges genetic material between two or more parents. This completes one generation and after repeating this procedure for a number of generations, due to selective pressure the algorithm converges and it yields a better solution.

### 5. Results and Discussion

Table 2 illustrates the sample iterative process of the genetic algorithm to find the best parameter from the population. The selected iterated values are given in the Table 2. The fitness value is chosen from these values. According to that the influencing parameter can be identified. In this application, we have chosen the performance fitness value as > 2.

The following diagram illustrates the working of RGSPAT model for analyzing the student performance in mathematical, technical, other department, and theoretical papers. The respective subject marks are loaded and their corresponding influence on students is shown in the graph. In fig 2, the mathematical subject marks are taken whose maximum value is 100, given in y axis. The x axis represents the corresponding quantitative factors in terms of numbers. Thus each subject marks are analyzed from their semester values and results are produced separately to find the most influencing parameter as given below.

In the above Figure 2, we have selected the 120 students of IT departments and the results are used to analyze the most important parameter. This graph represents that parameter 1 and 4 have high impact on the performance of the students (1 indicates mathematical subjects and 4 is Theoretical subjects). Thus those two subjects have given high importance to reach the better performance. It is observed from the graph that next most important subject is practical, other departmental, technical and internal assessments respectively. Thus this system can be implemented in the educational environment to evaluate the student results and to find the importance given to the subjects through analyzing their end results. Further research is to implement fuzzy clustering technique to group the students on the basis of their performance towards the selected parameters.

**Fig**

**ure**

**2**

**.**Working model of RGSPAT model

### 6. Conclusion

Thus real genetic process is successfully implemented to find the most influencing parameter that affects performance in the educational system. It will be a better solution for the classification, analyzing and evaluating the quantitative factors while course year and thereby predicting the students’ overall performance at the end of the semester examination. In this genetic process when we made changes in single parent characteristic we are able to find important ratios. This way of using Real Genetic Algorithm can be applied to any areas to find the most influencing ratios or parameters to predict the performance effectively. This research outlines the best features to develop a student performance model to analyze and can predict individual’s performance efficiently. It will also act as a self-assessment tool for the students to know their position and area where to get improve. Thus we will provide the successful tool to predict the student’s performance using genetic algorithm.

### References

[1] | A. Martin, V. Prasanna Venkatesan et al, “To find the most impact financial features for bankruptcy model using genetic algorithm”, International Conference on Advances in Engineering and Technology, (ICAET-2011), May 27-28, 2011. | ||

In article | |||

[2] | Ramjeet Singh Yadav, “Modeling Academic Performance Evaluation Using Soft Computing Techniques: A Fuzzy Logic Approach”, International Journal on Computer Science and Engineering (IJCSE), Vol. 3 No. 2 Feb 2011. | ||

In article | |||

[3] | O.K Chaudhari, P.G khot, “Soft computing model for academic performance of teachers using fuzzy logic”, British journal of applied science and technology, 2(2):213-226, 2012. | ||

In article | |||

[4] | V.O. Oladokun, “Predicting Students’ Academic Performance using Artificial Neural Network: A Case Study of an Engineering Course”, The Pacific Journal of Science and Technology, Volume 9. Number 1. May-June 2008 (Springer). | ||

In article | |||

[5] | Osman Taylan, Bahattin Karagozog, “An adaptive neuro-fuzzy model for prediction of student’s academic performance computers & Industrial Engineering”, 57 (2009) 732-741 | ||

In article | CrossRef | ||

[6] | Emmanuel N. Ogor “Student Academic Performance Monitoring and Evaluation Using Data Mining Techniques” published on Fourth Congress of Electronics, Robotics and Automotive Mechanics, 2009 | ||

In article | |||

[7] | Tossapon boongoe, Qiang shen and Chris price, , International Journal of Uncertainty, fuzzy qualitative link analysis for academic performance evaluation Fuzziness and Knowledge-Based Systems Vol. 19, No. 3 (2011) 559-585. | ||

In article | |||

[8] | Dimitris Kalles, Analyzing student performance in distance learning with genetic algorithms and decision trees, Hellenic Open University, Greece. | ||

In article | |||

[9] | Harvinder Singh Atwal, Predicting Individual StudentPerformance by Mining Assessment and Exam Data, 2010. | ||

In article | |||

[10] | Aavo Luuk Kersti Luuk, Predicting students’ academic performance in Aviation College from their admission test results, 2011 | ||

In article | |||

[11] | Mukta Paliwal, Usha A. Kumar, A study of academic performance of business school graduates using neural network and statistical techniques, Expert Systems with Applications 36 (2009) 7865-7872. | ||

In article | CrossRef | ||