Improbable Grading Results in DOD Academy: 200 Students Receive A

  • Context: Graduate 
  • Thread starter Thread starter promethius
  • Start date Start date
  • Tags Tags
    Statistics
Click For Summary

Discussion Overview

The discussion revolves around the statistical improbability of 200 students receiving an A grade in a series of 38 performance tests at a DOD academy. Participants explore the implications of this grading outcome, questioning the fairness and difficulty of the tests involved.

Discussion Character

  • Debate/contested
  • Statistical reasoning
  • Technical explanation

Main Points Raised

  • One instructor claims that it is statistically improbable for all 200 students to receive an A across 38 tests, given their varying intelligence levels.
  • Some participants suggest that if P is the probability of a student receiving an A on all tests, then the probability of all students achieving this outcome is P^200, requiring P to be as high as 0.996 for it to be plausible.
  • Others argue that the statistical analysis is invalid without knowing the difficulty of the tests and the likelihood of individual students achieving A's.
  • There is a call for more information, specifically the complete set of grades and an approximation of what the typical grade should be based on course difficulty.
  • Some participants assert that the results could indicate either that the tests were very easy or that the grading was not fair, without implying any cheating.
  • Concerns are raised about the representativeness of the sample, suggesting that if the students were not randomly selected, the results may be biased.

Areas of Agreement / Disagreement

Participants generally disagree on the validity of the statistical reasoning presented. While some agree on the improbability of the results, others challenge the assumptions and the need for more data to draw any conclusions.

Contextual Notes

Limitations include the lack of specific data on individual test difficulties and student performance, as well as the absence of a clear methodology for assessing the significance of the grading outcomes.

promethius
Messages
1
Reaction score
0
I am an instructor in a DOD academy. I am trying to show that the way that students are being graded in a particular performance test is a statistical improbability.

I have 200 students, all of which received a final grade of A in a course which consisted of 38 performance tests. These students consisted of 6 different classes which conducted these tests.

the academy works fon a graduate level 10 point grading system, i.e. 100-90 =A, 89-80 = B, 79-70 =C, 69 and below failure.

My logic tells me that it is impossible for 200 human beings with varying intelligence levels, all to receive the grade of A for 38 individual performance tests.

I may not be providing sufficient amount of information, but if anyone out there can help, it would greatly appreciated.

Thanks in advance to all!
 
Physics news on Phys.org
If P is the chance of a student getting A's on all 38 tests, then the probability of your event is P^200. For even odds of every student getting all A's, P would have to be as high as 0.996. Not very likely, unless the tests were all extremely easy.
 
mXSCNT said:
If P is the chance of a student getting A's on all 38 tests, then the probability of your event is P^200. For even odds of every student getting all A's, P would have to be as high as 0.996. Not very likely, unless the tests were all extremely easy.
But that last, "unless the tests were all extremely easy", is what makes such a statistical analysis invalid. You cannot say that the results are a "statistical improbability" without addressing just what it is that is being tested and how likely each individual student is to get an A on each individual test. I've known some P.E. classes where just showing up for the test guarenteed you an A!
 
mXSCNT said:
If P is the chance of a student getting A's on all 38 tests, then the probability of your event is P^200. For even odds of every student getting all A's, P would have to be as high as 0.996. Not very likely, unless the tests were all extremely easy.

that is not good statistical reasoning. we're not trying to find the probablitiy of getting straight A's for an individual. we need an actual test for significance.

promethius is right, we need some more info: First, we need the complete set of numerical grades for each student (this can be anonymous, if desired; we only need the numbers). Second, we need you to give an approximation of what the typical grade for a student should be (based on the difficulty of the course). this doesn't have to be exact, as there can never be an "exact estimate", but give us an idea of what you think the average grade should be.

Provide these two components, then we can get into the specifics.
 
But that last, "unless the tests were all extremely easy", is what makes such a statistical analysis invalid.
Nonsense, I made a correct statement. I did not assert that there was necessarily any cheating going on. I simply said that either the tests were all extremely easy, or the result was very unlikely under the assumption that the grades were fair. One of those things is certainly true.

If you want something more statistical, you can observe that if you estimate the chance of a student getting an A at 0.986 or below, then the null hypothesis that the students' grades were fair is rejected at the 0.05 significance level.
 
mXSCNT said:
Nonsense, I made a correct statement. I did not assert that there was necessarily any cheating going on. I simply said that either the tests were all extremely easy, or the result was very unlikely under the assumption that the grades were fair. One of those things is certainly true.

we are comparing a whole group, not "the probability of a random student getting x number of A's in a row is y".

how do you even know what to reject or not when you don't even have the data?
 
Last edited:
HallsofIvy and aznshark4 are correct in their reasoning.

promethius said:
My logic tells me that it is impossible for 200 human beings with varying intelligence levels, all to receive the grade of A for 38 individual performance tests.


First, just to be a finicky arse, unless those 200 subjects were randomly selected, so there was no significant difference between them, would the experiment be properly conducted. I'm willing to bet that testees in this DOD environment would bias the sample.

Second, we need you to give an approximation of what the typical grade for a student should be (based on the difficulty of the course).

Exactly. There are tests that could evaluate the two samples--explaining those tests and why they are valid to a group of non-statistics students could be harder than the tests themselves, however...
 

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 20 ·
Replies
20
Views
3K
  • · Replies 3 ·
Replies
3
Views
3K
Replies
98
Views
9K
  • · Replies 10 ·
Replies
10
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 15 ·
Replies
15
Views
3K
  • · Replies 56 ·
2
Replies
56
Views
10K
  • · Replies 3 ·
Replies
3
Views
1K