"The distribution of heights is not Gaussian"

AI Thread Summary
The discussion centers on the statistical distribution of human heights, particularly in the context of a 2008 Physics Forums thread about a child molester avoiding prison due to his short stature. One participant argues that the height distribution is Gaussian near the average heights of 5'5" and 6'3", while another claims it is not, suggesting a Boltzmannian distribution instead. The conversation highlights the complexities of height distribution, noting that while it may approximate a Gaussian curve near the mean, it deviates at the extremes. Additionally, the dialogue touches on the historical context of statistical studies in human traits, referencing Quetelet's work on normal distribution. Ultimately, the discussion underscores the nuances of applying statistical models to real-world data.
bluemoonKY
Messages
130
Reaction score
16
I was browsing old threads at Physics Forums, and I came across some information in this thread from 2008 that got my interest. The thread is titled "Child molester avoids prison because he is short." PF member stickythighs wrote the following: "Since the average American man is 5'10", there are about equal numbers of men in Florida 5'5" and shorter as there are men 6'3" and taller."In post #25 on the thread, PF Member Gokul43201 responded: "Not true. The distribution of heights is not Gaussian. It's almost Boltzmannian, and 5 inches is way bigger than the standard deviation - so a Gaussian approximation could be quite off when you go that far away from the mean. And it is..."

The distribution of human heights is the classic example that statistics textbooks and other textbooks use to show a Gaussian Distribution. By the way, a Guassian distribution = a Normal Distribution. I admit that the distribution of human heights is not Gaussian at the tails. In reality, there are far more people at 5+ standard deviations both above and below the mean than a graph of a 100% Gaussian Distribution of human heights would show. In other words, a graph of a 100% Guassian Distribution of human heights would show less people at 5 SD from the mean than there would actually be in real life.

However, in the example that stickythighs and gokul were discussing, the comparison was between male heights of 5'5" and 6'3". Human height distribution IS Gaussian when you are so close to the mean as 5'5" and 6'3". Therefore, why did Gokul deny that the human height distribution is Gaussian in the 5'5"-6'3" range?

Here is a link to the thread that I am referencing: https://www.physicsforums.com/threads/child-molester-avoids-prison-because-he-is-short.249825/page-2

Why did Gokul say that the distribution of human heights is almost Boltzmannian? Clearly it's not.

Note to moderators: The topic of the thread that I am referencing is about a child molester avoiding prison because he is short. The main topic of the thread that I am referencing is NOT about whether or not the distribution of human height is Gaussian or not. The correct etiquette and protocol for a digression in another area is to create a new thread on the digression, not to hijack the previous thread. There is no thread that I am aware of specifically about whether or not the distribution of human heights is Gaussian. Therefore, I should not be breaking any rules by creating this thread. It's a new topic.
 
Biology news on Phys.org
Have a look at the graph here.
https://www.khanacademy.org/science.../a/what-is-the-maxwell-boltzmann-distribution

This is a distribution that has a lower limit like zero, and a maximum going out the x axis. Maxwell-Boltzmann distribution.

So, with normal human adults there is a minimum height, and a larger maximum. I'm excluding dwarfism and gigantism due to abnormalities. And there is a skewness to the result; the graph is not symmetric across the mean, it is skewed.

What gokul43210 (no longer active on the forums) said was that it was a poor fit to a Gaussian curve, and close (but not really) a M-B distribution.
His link to what he cites as a model of the distribution is broken.

How you find a model to fit an existing distribution is interesting. @Dale works with this kind of thing. Maybe he can help clarify what you do with 'almost-fits' situations.

There is also this: Limpert, E; Stahel, W; Abbt, M (2001). "Lognormal distributions across the sciences: keys and clues". BioScience. 51 (5): 341–352
which I cannot get to show in a link. It says that a good human height distribution model is lognormal.

Also, it is not valid to claim that a part of the data is distribution A, and another part is distribution B. And then apply statistical methods on each part as if they were separate.
Since I cannot get all the facts, I cannot give you a good answer.
 
Last edited:
bluemoonKY said:
The distribution of human heights is the classic example that statistics textbooks and other textbooks use to show a Gaussian Distribution.

You can reference Lambert Adolphe Jacques Quetelet for bringing statistical studies into the humanities. His conclusion is that the traits of the average man follow a normal distribution. That has been followed ever since in many areas.

https://en.wikipedia.org/wiki/Adolphe_Quetelet
In his 1835 text on social physics, in which he presented his theory of human variance around the average, with human traits being distributed according to a normal curve, he proposed that normal variation provided a basis for the idea that populations produce sufficient variation for artificial or natural selection to operate.[7]

In terms of influence over later public health agendas, one of Quetelet's lasting legacies was the establishment of a simple measure for classifying people's weight relative to an ideal for their height. His proposal, the body mass index (or Quetelet index), has endured with minor variations to the present day.[8] Anthropometric data is used in modern applications and referenced in the development of every consumer-based product

You may also want to read this.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2831262/
Our data are based on complete enumerations, not samples. The number of young men whose heights were tabulated in the Torre reports rose from about 250,000 individuals for the cohorts born before 1860 to over half a million for those born after 1905 ). Overall, the heights of over 21 million individuals were tabulated in these records.
See figure 3 for the tabulated distribution of 20 year old males (1900 ) - raw data, adjusted and fitted normal distribution.

For other populations, the height curve could well not follow a normal distribution.
ie ages 0 to old age ( 60, 70 ... ) whole population - what's that going to look like??
ie sex - male and female follow two difference curves. Bring them together and one gets a flat-ish top
 
  • Like
Likes jim mcnamara
bluemoonKY said:
Human height distribution IS Gaussian when you are so close to the mean as 5'5" and 6'3".
If you have a large enough sample then even the tiniest departures from normality become significant. With N on the order of a billion I am sure that it is not normal.

The question isn't really whether or not something is normal, just whether or not the approximation is close enough that you can use the nice simplifying assumption that normality provides.
 
  • Like
Likes jim mcnamara
Closed threads should not be re-opened without moderator approval. I think this has been answered.
 
https://www.nhs.uk/mental-health/conditions/body-dysmorphia/ Most people have some mild apprehension about their body, such as one thinks their nose is too big, hair too straight or curvy. At the extreme, cases such as this, are difficult to completely understand. https://www.msn.com/en-ca/health/other/why-would-someone-want-to-amputate-healthy-limbs/ar-AA1MrQK7?ocid=msedgntp&cvid=68ce4014b1fe4953b0b4bd22ef471ab9&ei=78 they feel like they're an amputee in the body of a regular person "For...
Thread 'Did they discover another descendant of homo erectus?'
The study provides critical new insights into the African Humid Period, a time between 14,500 and 5,000 years ago when the Sahara desert was a green savanna, rich in water bodies that facilitated human habitation and the spread of pastoralism. Later aridification turned this region into the world's largest desert. Due to the extreme aridity of the region today, DNA preservation is poor, making this pioneering ancient DNA study all the more significant. Genomic analyses reveal that the...
Whenever these opiods are mentioned they usually mention that e.g. fentanyl is "50 times stronger than heroin" and "100 times stronger than morphine". Now it's nitazene which the public is told is everything from "much stronger than heroin" and "200 times stronger than fentany"! Do these numbers make sense at all? How do they arrive at them? Kill thousands of mice? En passant: nitazene have already been found in both Oxycontin pills and in street "heroin" here, so Naloxone is more...
Back
Top