Question regarding probability and normal distributions.

Click For Summary
SUMMARY

This discussion centers on the statistical analysis of student ability measurement through multiple tests versus a single test. The example provided illustrates that a student's true ability, represented as a score of 70/100 with a normal distribution (mean = 70, standard deviation = 5), can be better assessed through multiple equally weighted exams. The conversation highlights the importance of understanding the correlation between test scores and suggests that using the Central Limit Theorem can help mitigate day-to-day variations in test performance. Ultimately, the discussion emphasizes the need for a robust statistical model to accurately measure student ability.

PREREQUISITES
  • Understanding of normal distribution and its properties
  • Familiarity with the Central Limit Theorem
  • Knowledge of statistical correlation and dependence
  • Basic concepts of probability theory
NEXT STEPS
  • Explore the implications of the Central Limit Theorem in educational assessments
  • Research methods for calculating the correlation between test scores
  • Learn about advanced statistical models for measuring student ability
  • Investigate the effects of test timing and frequency on performance variability
USEFUL FOR

Educators, statisticians, educational psychologists, and anyone involved in student assessment and measurement of academic ability.

cerealkiller
Messages
7
Reaction score
0
Hello Mathematicians!

I'm doing some work on obtaining true measures of ability for students, and am trying to find a simple mathematical example that would show that a student's true ability is obtained by having a few equally weighted tests rather than one big test.

The example I'm thinking of is something along the lines of:

A student's "true" ability is 70/100, but if they sit an exam there will be a slight error in their ability measurement. Say that their mark will be normally distributed with a mean of 70 and a standard deviation of 5. So there is a 68% chance that their mark is between 65 and 75.

Now rather than sit a single exam, say that they sit 2 exams instead - both of which their mark will come from a normal distribution with mean 70 and standard deviation 5. The exams will be equally weighted as 50% of their total mark.

Now earlier, if they sat a single exam, there would be a 68% chance that their mark would be between 65 and 75. Now that they are sitting 2 exams each weighted at 50%, what would the probability be that their total mark is between 65 and 75?

Also, what about if there were 3 exams of equal weighting, etc.?

Thanks for the help guys.
 
Physics news on Phys.org
So you have two scores T1 and T2, identically distributed, and want to calculate T = .5T1 + .5T2 and say something about its distribution?

The difficulty is that there is no way you can assume the two test scores are independent, since the same person takes the two tests. You'd need to make some assumption about the correlation between the two scores.
 
statdad said:
The difficulty is that there is no way you can assume the two test scores are independent, since the same person takes the two tests.
But it is exactly the dependence on the person taking it that the OP is trying to measure so that he can determine the student's ability. If the tests are spread out over several days, we can assume that the day-to-day variations even out. If the test results are low over several days (perhaps the student is sick for a few weeks), then we may have to conclude that the student is just less capable during that time.

I would say that dividing the test questions into several tests taken on separate days would definitely reduce the effect of day-to-day variations. I think that the central limit theorem could be used to show that the average error rate due to daily variations would approach zero as the testing is spread out over more days.
 
"I would say that dividing the test questions into several tests taken on separate days would definitely reduce the effect of day-to-day variations. I think that the central limit theorem could be used to show that the average error rate due to daily variations would approach zero as the testing is spread out over more days."

I doubt that splitting things into mini test would eliminate the dependence. The problem is that there is not enough information to allow us to model the form the dependence takes.
 

Similar threads

Replies
2
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
Replies
6
Views
2K
Replies
8
Views
3K
  • · Replies 11 ·
Replies
11
Views
3K
  • · Replies 9 ·
Replies
9
Views
3K