Genetic Variance Between Pops as compared Within Pops

  • Context: Graduate 
  • Thread starter Thread starter Simfish
  • Start date Start date
  • Tags Tags
    Variance
Click For Summary
SUMMARY

The discussion centers on the concept that approximately 85% of genetic variation in humans is attributed to intra-group differences rather than inter-group differences. A statistical model is presented, utilizing a regression equation to analyze the variation among groups defined by genetic indices. The model indicates that the R² statistic is expected to be 0.15, suggesting a poor fit for the data. This highlights the predominance of intra-group genetic variance in the human genome.

PREREQUISITES
  • Understanding of genetic variance concepts
  • Familiarity with regression analysis
  • Knowledge of R² statistics in statistical modeling
  • Basic comprehension of group definitions in statistical contexts
NEXT STEPS
  • Study the implications of intra-group versus inter-group genetic variance
  • Learn about regression analysis techniques in genetics
  • Explore advanced statistical modeling methods for genetic data
  • Investigate the significance of R² in evaluating model fit
USEFUL FOR

Geneticists, statisticians, researchers in human genetics, and anyone interested in understanding the statistical analysis of genetic variation.

Simfish
Gold Member
Messages
811
Reaction score
2
As it often said, the majority of the variation in the human genome is due to intra-group variation, not inter-group variation (I think .85 is the number due to intra-group variation). I know what this intuitively means, but can anyone explain to me what it rigorously means? (in statistical terms?)
 
Physics news on Phys.org
I don't know how group is defined, but it doesn't matter as long as YOU know.

Suppose you have a numeric index of human genome code for each individual i in group g. Let that index be Y: H --> R. That is, for genome h in H, Y(h) is a real number. Each group is a subset of H. Subscript i indexes the h's in group g; i = 1, ..., Ig where Ig is the number of h's in group g. There are k groups.

A regression equation of the following form is estimated:

yi,j = a + b1d1 + ... + bkdk + ui,j

where the dependent variable is Y(hi,j) = yi,j and the independent variables are the d's; u is the error term. For each g, dg = 1 if j = g (that is, hi,j is in group g), dg = 0 otherwise. The R^2 statistic of the estimated equation is expected to be 0.15 (a poor fit).
 
Last edited:

Similar threads

  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 13 ·
Replies
13
Views
4K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 29 ·
Replies
29
Views
4K