Can I use my siblings' heights to predict my child's adult height?

  • Context: Undergrad 
  • Thread starter Thread starter moonman239
  • Start date Start date
  • Tags Tags
    Height Probability
Click For Summary
SUMMARY

This discussion centers on predicting a child's adult height using family height data. It is established that surveying siblings is preferable to surveying younger relatives, as siblings provide a more accurate representation of genetic height potential. The conversation emphasizes the importance of sample size, noting that classical statistical methods are not suitable for small samples. Instead, Bayesian statistics should be employed to account for non-linear growth patterns and to incorporate prior information effectively.

PREREQUISITES
  • Understanding of Bayesian statistics
  • Familiarity with non-linear growth models
  • Knowledge of classical statistical methods and their limitations
  • Basic concepts of regression analysis
NEXT STEPS
  • Research Bayesian statistics for small sample sizes
  • Study non-linear growth modeling techniques
  • Learn about regression coefficients in predictive modeling
  • Explore the implications of sampling distributions in height prediction
USEFUL FOR

Parents, geneticists, statisticians, and anyone interested in understanding the complexities of predicting physical traits based on familial data.

moonman239
Messages
276
Reaction score
0
I have two questions.

Suppose I am a parent and want to predict how tall my child will be as an adult. I want to survey family members from both sides.

Should I survey my brothers and sisters, or should I survey my 20-year-old nephews and nieces? I think I should survey my brothers and sisters. And they should be about the same age.

Also, the distribution of those heights will always approximate a normal distribution. That's what I learned in biology yesterday. How do I use this fact to predict the adult
 
Physics news on Phys.org
moonman239 said:
Also, the distribution of those heights will always approximate a normal distribution.

Perhaps, but not if there are only a few samples.
 
moonman239 said:
I have two questions.

Suppose I am a parent and want to predict how tall my child will be as an adult. I want to survey family members from both sides.

Should I survey my brothers and sisters, or should I survey my 20-year-old nephews and nieces? I think I should survey my brothers and sisters. And they should be about the same age.

Also, the distribution of those heights will always approximate a normal distribution. That's what I learned in biology yesterday. How do I use this fact to predict the adult

For small sample sizes, it is not wise to use classical statistical methods.

The classical methods are asymptotic and they rely on having a large enough sample size.

Apart from this you need to understand a little bit about the underlying process. Your brothers and sisters data might better represent the outcome than your nieces. If biology has results that say that your nieces may not be a good representation for what you are looking for, it might be more damaging using that data than not using it all.

Also with doing things like trying to predict outcomes, you would use a simple linear model.

One thing you need to be aware of is that growth is a highly non-linear process. By this I mean that we don't grow at a constant rate: there are periods where our growth is sudden and there are periods where our growth is somewhat negligible.

If you want to fit some kind of model it would have to take this into account, and this kind of model would require a little bit of advanced mathematics to transform your data correctly so that you a) get a transformed linear model that makes sense and b) can transform it back to what it originally represents to get your predicted values.

If you have small sample sizes, you should probably use Bayesian statistics. What this does is basically use conditional probability and what you do is use a distribution to represent prior information.

In conclusion, there are a lot of things to consider, even though the problem seems relatively simple. The non-linearity factor, combined with small sample sizes, combined with a bad understanding of the process involved are things that will give you results that may not be useful.
 
chiro said:
For small sample sizes, it is not wise to use classical statistical methods.

The classical methods are asymptotic and they rely on having a large enough sample size

If you have small sample sizes, you should probably use Bayesian statistics.

Ignoring the other factors such as any genetic disorders that may affect my children's height, can I not use the Student's t-distribution?
 
moonman239 said:
Ignoring the other factors such as any genetic disorders that may affect my children's height, can I not use the Student's t-distribution?

Since you want to make predictions, chances are you want to have some kind of model that has regression coeffecients.

Since your model is not a simple model (because of things like growth spurts), you can't just use the data to get a simple linear or even non-linear model.

This creates a bit of difficulty if you want a model that is reasonable for this kind of problem.
 
chiro said:
Since you want to make predictions, chances are you want to have some kind of model that has regression coeffecients.

Since your model is not a simple model (because of things like growth spurts), you can't just use the data to get a simple linear or even non-linear model.

This creates a bit of difficulty if you want a model that is reasonable for this kind of problem.

Huh. I thought it always approximated the normal distribution. I guess empirical probability would be my best friend here. If I had like 30 or so siblings, the height distribution might come close enough to being normally distributed.
 
moonman239 said:
Huh. I thought it always approximated the normal distribution. I guess empirical probability would be my best friend here. If I had like 30 or so siblings, the height distribution might come close enough to being normally distributed.

The fact that you're sampling from a distribution of siblings means it will be difficult to defend the independence assumption for a variable like height.
 

Similar threads

  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 9 ·
Replies
9
Views
4K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 12 ·
Replies
12
Views
3K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 8 ·
Replies
8
Views
970
  • · Replies 3 ·
Replies
3
Views
6K
  • · Replies 5 ·
Replies
5
Views
2K