Question regarding probability and normal distributions.

Click For Summary

Discussion Overview

The discussion revolves around the measurement of student ability through testing, specifically comparing the effectiveness of multiple smaller tests versus a single larger test. The focus is on the statistical implications of using normally distributed scores and the impact of test independence and correlation on the overall assessment of a student's true ability.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant proposes a scenario where a student's true ability is represented by a normal distribution, suggesting that multiple tests could provide a more accurate measure of ability than a single test.
  • Another participant highlights the need to consider the correlation between test scores, arguing that the assumption of independence may not hold since the same student takes both tests.
  • A different viewpoint emphasizes that the dependence on the individual taking the tests is crucial for measuring ability, suggesting that spreading tests over several days could mitigate day-to-day variations in performance.
  • One participant expresses skepticism about the effectiveness of splitting tests into smaller assessments, noting that without sufficient information, the nature of the dependence between scores cannot be modeled accurately.

Areas of Agreement / Disagreement

Participants express differing views on the independence of test scores and the effectiveness of multiple tests in measuring true ability. There is no consensus on whether splitting tests into smaller assessments would eliminate dependence or improve accuracy.

Contextual Notes

Participants acknowledge the complexities involved in modeling test score dependence and the implications of day-to-day variations, but do not resolve these issues. The discussion remains open-ended regarding the statistical modeling of test scores.

cerealkiller
Messages
7
Reaction score
0
Hello Mathematicians!

I'm doing some work on obtaining true measures of ability for students, and am trying to find a simple mathematical example that would show that a student's true ability is obtained by having a few equally weighted tests rather than one big test.

The example I'm thinking of is something along the lines of:

A student's "true" ability is 70/100, but if they sit an exam there will be a slight error in their ability measurement. Say that their mark will be normally distributed with a mean of 70 and a standard deviation of 5. So there is a 68% chance that their mark is between 65 and 75.

Now rather than sit a single exam, say that they sit 2 exams instead - both of which their mark will come from a normal distribution with mean 70 and standard deviation 5. The exams will be equally weighted as 50% of their total mark.

Now earlier, if they sat a single exam, there would be a 68% chance that their mark would be between 65 and 75. Now that they are sitting 2 exams each weighted at 50%, what would the probability be that their total mark is between 65 and 75?

Also, what about if there were 3 exams of equal weighting, etc.?

Thanks for the help guys.
 
Physics news on Phys.org
So you have two scores T1 and T2, identically distributed, and want to calculate T = .5T1 + .5T2 and say something about its distribution?

The difficulty is that there is no way you can assume the two test scores are independent, since the same person takes the two tests. You'd need to make some assumption about the correlation between the two scores.
 
statdad said:
The difficulty is that there is no way you can assume the two test scores are independent, since the same person takes the two tests.
But it is exactly the dependence on the person taking it that the OP is trying to measure so that he can determine the student's ability. If the tests are spread out over several days, we can assume that the day-to-day variations even out. If the test results are low over several days (perhaps the student is sick for a few weeks), then we may have to conclude that the student is just less capable during that time.

I would say that dividing the test questions into several tests taken on separate days would definitely reduce the effect of day-to-day variations. I think that the central limit theorem could be used to show that the average error rate due to daily variations would approach zero as the testing is spread out over more days.
 
"I would say that dividing the test questions into several tests taken on separate days would definitely reduce the effect of day-to-day variations. I think that the central limit theorem could be used to show that the average error rate due to daily variations would approach zero as the testing is spread out over more days."

I doubt that splitting things into mini test would eliminate the dependence. The problem is that there is not enough information to allow us to model the form the dependence takes.
 

Similar threads

Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K
Replies
2
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 15 ·
Replies
15
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 24 ·
Replies
24
Views
4K