A Hierarchical Linear Modelling of Teacher Effects on Academic Achievement in the Kenya Certificate of Primary Education Examination

Tables index

Veiw figure View Table

Kiswahili is the first language of the Swahili people (one of the 300-600 ethnic groups in Africa who speak Bantu languages ^[28]) and is a lingua franca of the African Great Lakes region and other parts of Southeast Africa, including Tanzania, Kenya, Uganda, Rwanda, Burundi, Mozambique, and the Democratic Republic of the Congo ^[29]. Kiswahili serves as a national language of four nations: Tanzania, Kenya, Uganda, and the Democratic Republic of the Congo and is also one of the working languages of the African Union and one of the official languages of the East African Community ^[30].

For Mathematics and Science, each of the 50 items has 2 points with 100 as the maximum score. The total score is 90 points for each of English and Kiswahili and the student’s final score is calculated as given in equation (1).

(1)

Where is the student’s cumulative score in English’s or Kiswahili’s sections A (scored out of 50) and B (scored out of 40) and is the maximum score in English while is the maximum score in Kiswahili.

The same applies for Social Studies and Religious Education with the final student’s score calculated as given in equation (2)

(2)

Where is the student’s cumulative score from the 90 items in Social Studies and Religious Education and is the maximum score in the same examination.

School, teacher and student questionnaires were fielded for data collection. Table 2 presents a description of the variables used in the three-level hierarchical linear modelling of teacher effects.

Table 2. Description of Variables Used in the Analysis of the Data

Download as

Tables index

Veiw figure View Table

For ease of interpretation, the outcome variable was transformed to a standard normal score with a Mean of zero (0) and Standard Deviation of one (1) so that the residuals at each level better approximate the normality assumptions of the models. This transformation allowed the effects of the covariates in the three-level HLM to be interpreted in terms of standard deviation units of our outcome variable ^{[25, 31]}. The untransformed variable ranged between 4 and 99 with mean score of 52.64 and standard deviation of 15.83.

2.3. Model Specification

As is usual for HLM, the starting point was to fit an unconditional model (also called intercept-only, null or empty model) in order to obtain the amounts of variance available for explanation at each level of the hierarchy ^{[5, 25]}. Consequently, a three-level variance components model was specified and fitted including only an intercept, school and teacher effects, and a student level residual error term. The model did not make any adjustments for predictor variables, only decomposing the total variance in the outcome variable (students’ running score on the five KCPE subjects) into separate school, teacher and student variance components. We followed Leckie ^[31] in specifying the unconditional/null model as:

(3)

Assuming that;

Where:

is the KCPE academic subject score for student nested within teacher in school , ;

is the mean score across all schools;

is the effect of school ;

is the effect of teacher ; and

is the student level residual error term.

The school, teacher effects and the student level residual errors are assumed independent and normally distributed with zero means and constant variances.

Table 3 presents the results of this null model. The random intercept, , predicts that a student’s z-score in any of the Five KCPE examination Subjects will be -0.02 (SE=0.09, p=.834). Since the outcome variable is approximately normalised, an estimated random intercept of zero, an estimated total variance of approximately one and a non significant intercept are all expected. The random part of the model presents the Variance Partition Coefficient (VPC) for each HLM level. Substituting the Variance Components into equation (4, 5 and 6), the VPC available for explanation at Student (), Teacher () and School () levels is 0.4388 (43.88%), 0.0493 (4.93%) and 0.5119 (51.19%) respectively.

(4)

(5)

(6)

The largest variance lay between schools (51.19%) while a substantial one lay among students within teachers (43.88%). Only 4.93% of the variance lay between teachers within schools suggesting that there was only modest variation in the five subjects between teachers. Most of the variation in students’ scores was seen between their schools and among themselves.

In adding predictors from the three levels to the unconditional model in equation (3), the authors followed Leckie ^[31] in specifying the full three-level random intercept slopes model as:

(7)

Table 3. Three Level Unconditional Model

Download as

Tables index

Veiw figure View Table

A description of these predictors is presented in Table 2.

Two new terms and were added to the model, so that the coefficients of the sex of the student and whether or not student kept negative company became and respectively and the community-level variance replaced by a matrix with three new parameters, , and . Three random intercept models were fitted in steps starting with Level-1 Student predictors subscripted that were estimated in Model-1. The Level-2 Teacher predictors subscripted were added in Model-2 while the Level-3 School predictors with the subscript were accounted for in Model-3. These predictor variables helped to explain the response variation allocated to the three levels as well as test the hypothesis regarding the relationship between teacher-level predictors and the outcome variable. The slope coefficients of these predictor variables were assumed fixed across Levels 2 and 3.

Table 4. Non-significant Variables Dropped from the Teacher-Model Only

Download as

Tables index

Veiw figure View Table

Models 4 and 5 fitted random slopes because an exploratory analysis indicated that the relationship between the students’ running score in the five subjects, the outcome variable (s17z), and student sex , 0=Male; 1=Female, and whether or not student kept negative company (), standardized score, -0.74 - 2.65, varied across Level-3. In model-4, three teacher-level predictors were omitted, i.e. teacher's age in years (t22a), number of in-service short courses attended by the teacher (t214) and the number of formal written tests in the teacher's subject (t227) but were included in the final Model-5 in order to determine their net value in explained variance.

Selection of “candidate predictors” to be included in the three-level models involved a two-step process informed by the need for parsimony in the final model. In the first step, a pair-wise correlation of all possible variables for each of the three levels was estimated. The second step involved running only those variables that were significantly correlated with the outcome variable in an exploratory Level-specific model while considering the hierarchical nature of the dataset ^{[5, 25, 32, 33]}. For the student-level, “candidate predictors” that were correlated with the outcome variable were fitted in a student-only model excluding teacher and school-level predictors. Only statistically significant variables at the 5% level were then preserved as the student-level predictors to be included in subsequent models and levels. This procedure was repeated at teacher and school levels. Table 4 presents the non-significant teacher-level predictors that were dropped leaving only t22a, t214 and t227 for modelling at the school level. STATA version 11.2 was used for data management and analysis with the “xtmixed” command.

3. Results and Discussion

3.1. Descriptive Statistics of the Variables used in the Modelling

Table 5 presents the descriptive statistics for the variables used in the modeling.

Table 5. Descriptive Statistics for Variables Used in the Modelling

Download as

Tables index

Veiw figure View Table

The focus in this paper was to assess the effect of teacher-level variables at Class 8. Three teacher predictors were modelled with the Class 8 teachers’ age having a Mean of 37.88 and standard deviation of 9.40, number of in-service courses attended in 2013 (M=0.88, SD=0.01) and the number of formal written tests in the teacher's subject (M=11.14, SD=6.44). All interval or ratio predictors under consideration had reasonably small standard errors of the mean suggesting that their calculated means were not quite far away from the true population mean. Using Multiple Correspondence Analysis with non-income or expenditure data as proposed by Filmer and Pritchett ^[34] and as computed in the Demographic Health Surveys ^{[35, 36]}, the students’ wealth index was determined from their reported home ownership of assets, such as cars motor cycles, electronics (including fridges), and bicycles among others; materials used for housing construction; source of lighting; and types of water access and sanitation facilities. This wealth index was then divided into three tertiles of 608 students each categorized as 1=High tertile (wealthiest of the three), 2=Middle tertile and 3= Low tertile (least wealthy of the three).

3.2. Bivariate Analysis

Pair-wise correlation was run between the students’ running z-score on the Five Subjects (s17z) and the full range of predictors as estimated in the final Model-5. Two of the school-level predictors: Sub-County (h16) and Boarding status at class 8 (h24a) presenting the ‘strongest’ correlation (r= -0.513, p<.001) and (r= -0.394, p<.001) respectively. These were however considered moderate using Taylor’s interpretation of correlation coefficients ^[37]. There was no added value in presenting the entire pair-wise matrix table since the rest of the predictors under consideration including the three teacher variables of interest in the models had weak but statistically significant correlations with the outcome variable: Teachers’ age, t22a, (r=-0.16, p>.001; Number of in-service courses attended, t214, (r=0.039, p>.001), and Number of formal written tests in the teacher’s subject, t227, (r=0.349, p>.001). No other correlation was stronger than r= -0.513.

The results of a an independent t-test with unequal variance, t(7885)= 56.62, p <.001, showed that Mumias Sub-County had a z-score of 0.43 standard deviation units above the mean compared with Kuria East Sub-County’s -0.61 units below the mean. The strength of the difference between the two z-score means as measured by R²was 0.29 which is considered a very large effect ^[38].

A one-way ANOVA was also run to determine if academic achievement in the Five Subjects was different comparing the schools’ boarding status at Class 8 where 1=Day school (n=7915, z=-0.17), 2=Boarding school, (n=395, z=1.25) and 3=Day and boarding school (n=810, z=1.00). There was a statistically significant difference between the groups as determined by the one-way ANOVA, F(9119) = 1013.48, p <0.001). The Bonferroni post-hoc test showed that the z-score unit difference between boarding and day schools was 1.41, p <.001, while that between Mixed day/ boarding and day schools was 1.17, p <.001. The difference between mixed day/ boarding and full boarding schools was -0.25, p <.001. The effect size, η²= 0.18 (measured using eta-squared), was considered large ^[38].

3.3. The Three-Level Random Slope School Model

Table 6 presents the HLM results. The effect of individual predictors remained pretty much the same, and statistically significant, across the three levels as well as through the Five Models.

Table 6. Three Level Random Slope School Model (Level-3)

Download as

Tables index

Veiw figure View Table