# Homework Help: Help regarding our school project, a statistical survey on education

1. Jul 20, 2007

### loom91

Hi,

The students of statistics in my high-school have to do a project. The topic we have chosen is Educational Performance in Kolkata Schools and Its Relation to Gender and Gender Interactions. I outline our goals below:

1)Collect data about Madhyamik results (a centralised exam organised by the state education board to pass the 10th grade). We will sample marks from all-boys schools, all-girls schools and co-educational schools. We have selected the schools to be of similar average educational performance. The average socio-economic condition of the students is also approximately the same, hopefully eliminating that variable.

2)We will examine various parameters of the marks, with emphasis on the difference between the boys schools and the girls schools. These include central tendencies, standard deviations, skewness etc. We will also examine the myth that girls are relatively stronger at humanities subjects while boys are stronger at science subjects. We may also examine if the population is heterogeneous (that is, whether 'good students' and 'bad students' form two distinctly distributed populations).

3)We will then analyse the data obtained from the co-educational schools and try to determine if the co-educational environment lessens the difference between boys and girls.

Now, we are all grossly inexperienced and this is the first time we will attempt a statistical study instead of working in out classrooms with provided data. I'm eagerly seeking suggestions from experienced persons about possible pitfalls and how to make our results statistically meaningful.

I'm also looking for specific help on the following topics:

i)What is a suitable measure of whether girls are stronger at some subjects while boys at other subjects? I'm thinking of comparing the percentage of total marks obtained in one subject, standardised against the whole population. For example, consider the variable X = percentage of total marks earned in History+Geography. Next, we define the standardised (wrt the entire population) variate corresponding to X, let it be Z. Now we compute the mean of Z over the girls schools ($E_1(Z)$) and the mean over the boys schools ($E_2(Z)$).

If the first value is larger than the second value (it seems one will have to be positive and the other negative), then we may say that girls prefer humanities more over other subjects than boys. Next we can do the same analysis on the boys vs girls population in coed schools and see if the difference is less. By using the absolute marks instead of expressing it as percentage of total marks, we can also compare the relative performance (as opposed to preference) of boys and girls in humanities. The same can be done for languages and sciences. Is this a statistically sound measure (unlikely, since I just made it up)? What are the alternatives?

ii) What is a good way of identifying whether the population in a school indeed consists of discreet stratas? This could be good students/bad students (there is indication from previous results that this may be the case) or in coed schools boys/girls (very likely the case). In case of coed schools, there may even be four stratas: good boys, good girls, bad boys, bad girls. It will be interesting to study whether bad boys vs girls show more difference than good boys vs girls. All this sounds very pretty, but I don't know how to separate the population into stratas.

iii)Is there some easily available (preferably free) software that will let me do all this analysis (brownie points for fitting probability distributions and graphing)? It would be a nightmare to do this by hand since we usually work with less than 50 data points instead of several hundred.

iv)As it stand right now, we will sample two boys schools, two girls schools and one coed school. Is this enough to be statistically significant? How many data points should we sample from each school? Should this be a constant or proportional to the total number of students?

v)Finally, is the whole proposition so glaringly ridiculous that all serious statisticians will simply laugh at it? I hope not

I hope there are at least a few statisticians around who could help out. We have in all probability bitten off more than we can chew. But we are hoping to do some meaningful work publishable in a journal, so we need all the help we can get. I will also be very grateful if you give me the email of someone who may be able and willing to help. We will be marked in our school-finishing (and career determining) central exams, so this is very important to our whole class. Thanks a lot.

Molu

2. Jul 20, 2007

### cristo

Staff Emeritus
I'm not sure I've completely understood your post, but you seem to be firstly saying that you are undertaking some school project, and then going on to ask for email addresses of professionals who can help? Surely this defeats the point of it being a school project?

If I were you, I'd forget thinking about publishing your work, and concentrate on getting a good grade for your work. After all, the criteria for publishing in a journal and a highschool assignment are completely different.

I'm not sure that you should just look at means here. I would do the standard things and look at medians, ranges, quartiles, etc. to compare girls v. boys in each different scenario.

3. Jul 20, 2007

### loom91

What part don't you understand? Please tell me so I can explain. I'm looking for help because the project is to carry out the study and analyse the data, not to invent statistical measures. We just want to know the techniques, so we can do a meaningful and statistically sound analysis.

As for publication, we could actually get full marks in the exam by doing far, far less. We could have simply computed the correlation between physics and mathematics marks of a few students from our own school. That's easier, smaller and properly done gets full marks. But our headmaster said that instead of doing the same thing every year, we should do something more ambitious, like examine the impact of gender on education. That's when we came up with this idea. Since we are doing it, we might as well do it properly and make the results of some value. A psychologist who has published some work on this very topic told me that she will consider getting our results published if we do good work. In any case, even if publication is not an option we should still do our best. That includes using the best techniques of analysis available.

Of course we will be comparing the standard measures you mention. But in addition we're looking for a measure to decide whether the myth that girls prefer humanities subjects (History, Geography) over science subjects is true. I came up with a measure that should be able to measure whether humanities subjects contribute more to a girls total marks than they do to a boys total marks. Quite possibly the measure I came up with is inadequate, so I want to know what are the standard measures used to do such studies?

This is the joint work of a team of around 25 students, not me alone.

Molu

4. Jul 22, 2007

### loom91

Some help?

5. Jul 24, 2007

### loom91

Anyone to help out? We could do with some.