Simple demographics calculation

  • Context: High School 
  • Thread starter Thread starter DaveC426913
  • Start date Start date
  • Tags Tags
    Calculation
Click For Summary

Discussion Overview

The discussion revolves around how to adjust demographic statistics from a user questionnaire to reflect gender representation within specific categories. Participants explore methods for expressing gender bias numerically, considering both overall demographics and category-specific data.

Discussion Character

  • Exploratory
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • One participant presents demographic data from a questionnaire, noting a higher percentage of female users overall but a male-dominated category within the data.
  • Another participant simplifies the problem with hypothetical numbers to illustrate the concept of gender representation in categories.
  • A method is proposed to express relative gender bias using probabilities based on the number of males and females in a category compared to overall numbers.
  • Concerns are raised about the interpretation of the data, with one participant suggesting that the survey results may misrepresent the gender distribution.
  • A later reply calculates adjusted percentages for gender representation in a specific category, arriving at a conclusion of 56% male and 44% female based on the proposed method.

Areas of Agreement / Disagreement

Participants express differing views on how to represent the data and the implications of the gender distribution. While some methods are proposed, there is no consensus on the best approach to express the demographic adjustments.

Contextual Notes

Participants discuss various methods for calculating and expressing gender bias, but the assumptions and limitations of these methods are not fully resolved. The discussion includes hypothetical scenarios that may not directly apply to the original data.

DaveC426913
Gold Member
2025 Award
Messages
24,488
Reaction score
8,752
Hi. I'm reporting some stats on my website, and I want to adjust my stats by the demographics. Been a long time since HS math... and I never studied stats anyway.

The following reads more complicated than it really is.


Of the 4095 users that filled out my questionnaire, 3255(79.5%) are female, 772(19%) are male and 68(1.5%) did not specify.

My users lump themselves into 1 of 14 categories (the categories are unimportant). We need look at only one category for now.

91 users that specified a gender were lumped into category #1. Of those 91 users, 70 were female and 21 were male. or 77% females and 23% males. That's more males than average. And that is the interesting piece of information I want to capture here: that Category 1 is a male-dominated category, despite the sheer number of female users that are in it. (I'll do the same for the other 13 categories.)

What I'm not sure about is how I demonstrate this difference in a graph with numbers. What numbers do I end up with after I've compensated for the gender bias in the above example?

I think I should end up with numbers like this - males: 54%, females:46% or thereabouts. I'm not sure how to get here from there.
 
Last edited:
Physics news on Phys.org
Wow, not one taker?
 
Let me see if I can simplify.

I have 1000 users: 900 female and 100 male.

Of ten categories, Category 1 has 80 females and 20 males.

If Cat 1 were average, there would be 10 males, but instead there are 20. Clearly, Category 1 is a predominantly male category.

How do I express this numerically? As a percent? A ratio?



Before you answer, can I use the same method of expression if the numbers were more realistic? eg:
Overall: Total: 1023; 833F, 380M
Cat 1: Total: 386; 253F, 143M
 
One candidate method for how to express the relative gender bias would be to consider the probability a person is in a category given that they are male, and the probability given that they are female. To use the numbers in your op, that would be 21/772 and 70/3255 (2.72% and 2.15%) respectively.

If you want to know what the male and female percentages for that category would be if there were equal numbers of males and females completing the questionnaire, find 100/(2.72+2.15). Then multiply that by 2.72 for males and 2.15 for females.
 
Why don't you just say you have a survey of women? Don't try to be so tricky with the numbers, and tell the truth about what you found.
 
Paula said:
Why don't you just say you have a survey of women? Don't try to be so tricky with the numbers, and tell the truth about what you found.

?? Are you implying something about my motives? :mad: [Edit] Oh I see, you didn't bother to read the first post. :rolleyes: :rolleyes: Holy Jeez, way to go off half-cocked.

1] I don't have a survey of women. I have a survey where 3 women for every 1 man participated. My second post is exaggerating and simplifying the numbers so I can make an algorithm.

2] If category 1 was soccer-watchers, I wish to show that, within my sample, soccer-watchers were predominantly male, whereas category 2 (tennis) was predominantly female.
 
Last edited:
Nimz said:
One candidate method for how to express the relative gender bias would be to consider the probability a person is in a category given that they are male, and the probability given that they are female. To use the numbers in your op, that would be 21/772 and 70/3255 (2.72% and 2.15%) respectively.
Ah! So to get numbers that add up to 100...

2.15x2.72=4.87 (total for that category)
100/4.87=20.53
20.53x2.15=44.14
20.53x2.72=55.85

So. For category 1: 56% were male, 44% were female.

Perfect!
 

Similar threads

  • · Replies 10 ·
Replies
10
Views
3K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 12 ·
Replies
12
Views
4K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 13 ·
Replies
13
Views
5K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 45 ·
2
Replies
45
Views
5K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 4 ·
Replies
4
Views
16K