When are 2 models comparable using an F Test?

  • Thread starter Thread starter David Laz
  • Start date Start date
  • Tags Tags
    Models Test
David Laz
Messages
28
Reaction score
0
Say I'm given a bunch of models on the same set of data how does one determine whether a valid comparison can be made between any two of them using an F Test?

Is it that the estimation space of one has to be a subset of the other? Is there any easier, more practical way of determining this?

Thanks :redface:
 
Physics news on Phys.org
Is it that the estimation space of one has to be a subset of the other?
No. For example, in a regression equation, you might want to test whether bi = bj. In this case, the relevant test statistic has an F distribution, although the hypothesis does not involve a subset relationship.

http://en.wikipedia.org/wiki/F_test
 
I think I know what your getting at. But here's an example of the sort of question I need help with. Maybe you could explain/show me.

heres a question from an old exam:

The following table gives the yields from a field experiment on two varieties of wheat, Hard and Common, with four equally spaced levels of applied fertilizer. The plots were allocated at random to the various treatment combinations. Initially it was planned to have four replicates at each combination, but errors in applying the fertilizer reduced the final sample size.


http://img530.imageshack.us/img530/2558/statny0.jpg
(table of the data, probably not need to answer this question)

In what follows Var.f refers to variety treated as a factor with two levels. (1=hard, 2 = common) and Fert.f refers to fertilizer treated as a factor with 4 levels (1,2,3,4)

Since fertilizers are numeric it is possible to use the actual amount of fertilizer as a variable (denoted by x, taking values of 1,2,3,4)

http://img174.imageshack.us/img174/8470/stat2nc9.jpg
(table of different models and their associated deviance and df's)

Where a*b means a + b + a:b (interaction term)
Among all the models which cannot be validly compared using an F-Test?
 
Last edited by a moderator:
To calculate the F statistic, you need a restricted model and an unrestricted model. The restricted model is a sub-type of the unrestricted model, in the sense that but for the restriction(s) being applied, it would have been identical to the unrestricted model. Put differently, starting from the unrestricted model, you should be able to arrive at the restricted model by imposing one or more linear restrictions on the set of parameters that the model is to estimate. Example: a Var.f + b Fert.f can be obtained from a Var.f + b Fert.f + c Var.f:Fert.f by imposing the linear restriction c = 0. (Each of a, b, c is an estimated parameter that would explain, say, "agricultural yield" as a function of the variable Var.f, Fert.f, Var.f:Fert.f in respective order.)
 
Last edited:
Hi all, I've been a roulette player for more than 10 years (although I took time off here and there) and it's only now that I'm trying to understand the physics of the game. Basically my strategy in roulette is to divide the wheel roughly into two halves (let's call them A and B). My theory is that in roulette there will invariably be variance. In other words, if A comes up 5 times in a row, B will be due to come up soon. However I have been proven wrong many times, and I have seen some...
Thread 'Detail of Diagonalization Lemma'
The following is more or less taken from page 6 of C. Smorynski's "Self-Reference and Modal Logic". (Springer, 1985) (I couldn't get raised brackets to indicate codification (Gödel numbering), so I use a box. The overline is assigning a name. The detail I would like clarification on is in the second step in the last line, where we have an m-overlined, and we substitute the expression for m. Are we saying that the name of a coded term is the same as the coded term? Thanks in advance.
Back
Top