Do Larger Samples Really Lead to More Precise Estimates? A Simulation Study

Tables index

Veiw figure View Table

Table 2. Correlation between Sample Size and Selected Statistics for the First 37 Samples

Download as

Tables index

Veiw figure View Table

Table 3. Correlation between Sample Size and Selected Statistics for the Last 37 Samples

Download as

Tables index

Veiw figure View Table

Table 4. Correlation between Sample Size and Significance of the Difference

Download as

Tables index

Veiw figure View Table

Table 5. Supplementary Statistical Evidence

Download as

Tables index

Veiw figure View Table

Table 2 shows the correlation between SS and each of the mean, SD, SEM and CI for the first 37 samples, which represent the upper half of the data. Relative to results on the whole data in Table 1, Table 2 shows a stronger correlation between SS and Mean (r = -0.205, p = .224), and between SS and SD (r = -0.129, p = .447). Also the correlation between SS and SEM (r = -0.835, p = .000), and SS and CI (r = -0.812, p = .000) is stronger for the upper half-split data relative to results in Table 1. In Table 3, the correlation between SS and each of the Mean, SD, SEM and CI for the bottom-half of the data is shown. Relative to results on the whole data in Table 1, Table 3 shows a stronger and positive correlation between SS and Mean (r = 0.220, p = .198), and between SS and SD (r = 0.196, p = .252).

It is evident that the SS-Mean and SS-SD correlations for the upper-half data, and the SS-Mean and SS-SD correlations for the bottom-half data are in appositive directions and are nearly equal in strength. This situation resulted in a negligible correlation between SS and each of the Mean and SD in Table 1. This assertion is made because two opposite correlations of almost the same strength explained by the two halves of a dataset would cancel out. Hence the SS-Mean and SS-SD correlations in Table 1 are actually stronger and are considerable. With respect to the SS-Mean and SS-SD correlations in Table 2 and Table 3, their statistical insignificance is attributable to the small number of data points involved in the computation (i.e. N = 73).

Table 4 shows a positive and significant correlation between SS and p-value or significance at 5% significance level (r = .252, p = .032). This evidence implies that the p-value increases as sample size increases. Since the p-value is expected to be greater than the level of significance to confirm that the sample Mean approximates the population Mean, this correlation implies that the p-value increases towards a value that supports the equation X = x.

In Table 5, deviation of the sample means from the population mean at the upper-half of the data is greater than deviation from the population mean at the bottom-half of the data. It must be noted that the upper-half contains smaller samples whereas the bottom-half contains larger samples. The lesson of interest in Table 5 is that though there are sample mean values which deviate from the population mean at both halves of the data, the deviation is higher at the upper-half of the data. Thus sample means at the bottom-half are closer to 250.50. Hence, larger samples (at the bottom-half) better approximate the population parameter.

Download as

Veiw figureFigures index

NEW
View larger figure in new window

Figure 1. Sample Size vs. Mean

Table 6. One Sample Descriptive Statistics

Download as

Tables index

Veiw figure View Table

Table 7. Statistics of One-Sample Significance Test

Download as

Tables index

Veiw figure View Table

Download as

Veiw figureFigures index

Figure 2. Sample Size vs. Standard Deviation

Download as

Veiw figureFigures index

Figure 3. Sample Size vs. Significance (p-value)

Table 8. Normality of Population and Samples

Download as

Tables index

Veiw figure View Table

Download as

Veiw figureFigures index