emmasaunders12 said:
Any idea how to proceed here?
Use simulation.
For simulation you need stochastic models, not mere curve fits. Each model, should specify a method for making a deterministic prediction, (such as Y = AX) but it also must specify a model for how the observed data arises in a stochastic manner (such as Y = AX + B* err(k}, where err(k) is an independent random draw at each time k from normal distribution with mean 0 and variance 1.)
I think your x-data is a time series of vectors. You need to generate representative examples of the x-data by simulation or have such examples from actual observations (i.e. one "example" is an entire time series of vectors). So you might need a stochastic model for the x-data.
I am assuming your predictive models give the predicted y-value as a function of the observed x-values , not as a function of the underlying "true" x-values. Of course a model may use the observed x-values to predict the "true" x-values and then make it's prediction based on those estimates.
Once you have the capability to do simulations, you can investigate various statistics by the Monte-Carlo method.
-------------
For example:
Let model_X be the stochastic model for generating the x-data.
Create a Mont-Carlo simulation involving two (possibly identical models) model_A and model_B as follows. One replication of the simulation is:
1) Generate the X-data using model_X
2) Generate the Y-data using the stochastic model associated with model_A
3) Generate the predicted Y-data using the deterministic model associated with model_A
4) Compute RSS_A = the sum of the squared residuals between the Y-data of step 2 and the predicted Y-values of step 3.
( I'm assuming that when using the F-test, your intent was to define the "residual" between two vectors as the euclidean distance between them. Whether this is wise depends on details of the real world problem.)
5) Generate the Y_data using the stochastic model associated with model_B
6) Generate the the predicted Y-data using the deterministic model associated with model_B
7) Compute RSS_B = the sum of the squares of the residuals between the Y_data from step 5 and the predictions of step 6.
8) Compute G = (RSS_A - RSS_B) / RSS_B
G is an obvious imitation of the F-statistic. We don't know that G has the same distribution as any F-statistic, so we shouldn't call it one.
We can set model_A = model_B = your model2 and use the Monte-Carlo simulation to estimate the distribution of G. (When the stochastic model associated with model 2 is applied to the same X-data twice, it probably won't produce the same residuals due to the stochastic terms. Hence the value of G will vary on different replications.)
Take the "null hypothesis" to be that model1 is the same as model2 (as far as producing residuals goes). Compute the single numerical value G_obs = (RSS_1 - RSS_2)/RSS_2 by applying the two models to the actually observed X-data. Use the distribution of G to compute how likely it is to get a value of G equal or greater than G_obs. Then "accept" or "reject" the null hypothesis based on how this probability compares to whatever "significance level" you have chosen.