Time Series: Residuals of ARMA model

In summary, the residuals of an ARMA model are calculated by estimating the parameters (phi and theta) and using them to compute the residual "a_t hat", based on the definition of residual which involves the white noise terms at-1, at-2, ..., at-q. The initial conditions for the noise terms and X values may need to be specified in order to compute the residuals numerically.
  • #1
kingwinner
1,270
0
Time Series: "Residuals" of ARMA model

stat7.JPG


To check whether the white noise {at} are uncorrelated, we usually look at the residuals (which are sample estimates of the white noise {at}) and residual plots. But I just don't understand the meaning of "residuals" in the context of ARMA model...

In the above definition, the residual "at hat" is in terms of the white noise terms
at-1, at-2, ..., at-q. But we know that the white noise terms are unobservable (residuals are observable, white noise terms are not), and there is no way we can know the exact values of at-1, at-2, ..., at-q, right? Now if we don't know everything on the right hand side, how can we calculate the residuals "at hat"? I just don't understand how residuals of ARMA model can be calculated based on this definition.

I tried searching the internet, but couldn't find much.
Hopefully someone can explain. Thank you!
 
Physics news on Phys.org
  • #2


It would be easier to comment on material you quoted if we clarify how "residual" is defined in that material.

A problem in reading material on time series is that you must figure out what notation is being used for indexed quantities. For example, if we have a quantity denoted as [itex] q_i [/itex] , are we to regard it as a random variable? Or is it a particular numerical value, which may have come from one realization of a random variable?

The subject of ARMA models isn't fresh in my mind. The general pattern that I recall is this:

Let [itex] x_ i [/itex] denote the numerical values of a specific set of time series data. The model for the data expresses each value as a linear combination of random variables. You can't deduce any numerical values for the random variables by looking at an equation for a particular [itex] x_i [/itex] (for example, [itex] x_{20} [/itex]). However, you can make progress if you look at equations that involve higher order differences of the [itex] x_i [/itex]. Define
[tex] \triangle x_i = x_{i+1} - x_i [/tex]
[tex] \triangle^2 x_i= \triangle ( x_{i+1} - x_i ) = x_{i+2} - x_{i+1} - (x_{i+1} - x_i) [/tex]
etc.

If you form equations that have a higher order difference on the left hand side, then many of the variables in the right hand side may cancel out. You may be able to solve for particular numerical values of these variables or the numerical values of particular functions of them (for example, enough to give you an estimate of their sample variance).

So when you say something is "unobservable", in an ARMA model, what do you mean? Do you mean that we can't compute numerical values for it when we are given numerical values of the time series data? Are you saying we can't compute it even if we use tricks involving higher order differences?
 
  • #3


I don't think the tricks involving higher order differences would solve the problem.

I believe the white noise {a_t} are unobservable (this term is used in all of my textbooks) because the true value of the parameters, phi's and theta's, are unknown. We can never calculate the exact value of {a_t}. We can only calculate the (observable) residuals "a_t hat", but based on the above definition of residual, to calculate a_t hat, we need to know the values of the (unobservable) white noise terms
at-1, at-2, ..., at-q which seem to be going in circles because they are defining the residuals in terms of the white noises? What sense does it make?

So how can we actually compute the values of "residuals" of ARMA model?

Thanks!
 
Last edited:
  • #4


If you take the view that the parameters of your model are unknown, I don't think you can calculate residual errors between the data and the predicted values. This is the usual situation, isn't it? For example, if you take the view that data is generated by a linear regression model, but that the slope and intercept of the line are unknown, then you can't calculate the residual errors.
 
  • #5


The true values of the parameters phi's and theta's are unkown, and we can only estimate them, hence we have the "phi hat" and "theta hat". (aside: this is also the key to explaining the important difference between the terms residual and error in regression) Since estimates of the parameters are used, we don't get the white noise a_t, but only the residual "a_t hat", an estimate of the white noise.

But my problem is not with estimating the parameters (phi and theta). When you look at the definition of "residual" above, they are defining the residual in terms of the white noises
at-1, at-2, ..., at-q which doesn't make any sense to me. On the right hand side of the definition, everything is known or can be calculated except for the white noise terms. But since we don't know the values of the white noise terms, there is no way we can calculate any residual, so I'm really confused as to how residuals of ARMA model are calculated in practice.
 
  • #6


OK, to summarize your objection to the definition of residuals, you object to the use of the hat on them since you think that they cannot be numerically computed.

As I recall, you have to stipulate some initial conditions as part of the parameters of the model. For example, to compute [itex] X_0 [/itex], we need "fictitious" noise terms
[itex] a_{-1}, a_{-2}, a_{-q} [/itex] and X values with negative indices [itex] X_{-1}, X_{-2},..X_{-p} [/itex]. It wouldn't surprise me if all these are set to zero, but you'll have to consult your materials to see. With that initial data, you can compute the residuals numerically.
 

1. What is a time series?

A time series is a sequence of data points collected at regular intervals over time. It is used to analyze past trends, forecast future values, and identify patterns or relationships within the data.

2. What are residuals in an ARMA model?

Residuals in an ARMA (Autoregressive Moving Average) model are the differences between the actual values of a time series and the predicted values from the model. They represent the unexplained variation in the data that is not accounted for by the model.

3. Why are residuals important in a time series analysis?

Residuals are important in a time series analysis because they allow us to evaluate the performance of the ARMA model. If the residuals are random and have no discernible pattern, it indicates that the model is a good fit for the data. However, if there is a pattern or trend in the residuals, it suggests that the model is not capturing all the information in the data and needs to be improved.

4. How do you interpret the residuals in an ARMA model?

The residuals in an ARMA model should be normally distributed with a mean of 0 and constant variance. If the residuals do not follow this pattern, it indicates that the model is not a good fit for the data. Additionally, a plot of the residuals should not show any trends or patterns, as this would suggest that the model is not capturing all the information in the data.

5. What can cause non-random residuals in an ARMA model?

Non-random residuals in an ARMA model can be caused by several factors, including: omitted variables, incorrect model specification, non-stationarity in the data, or autocorrelation. It is important to diagnose the cause of non-random residuals in order to improve the model and make more accurate predictions.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
8
Views
449
  • Set Theory, Logic, Probability, Statistics
Replies
14
Views
240
  • Set Theory, Logic, Probability, Statistics
Replies
8
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
23
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
472
  • Set Theory, Logic, Probability, Statistics
Replies
10
Views
934
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
4K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
4K
Back
Top