Akaike information small sample AICc

  • Context: Graduate 
  • Thread starter Thread starter mertcan
  • Start date Start date
  • Tags Tags
    Information
Click For Summary

Discussion Overview

The discussion revolves around the derivation and implications of the corrected Akaike Information Criterion (AICc) in relation to the standard Akaike Information Criterion (AIC). Participants explore the mathematical formulation of AICc, particularly the additional correction term, and the conditions under which AICc is preferred over AIC, especially in small sample sizes.

Discussion Character

  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant seeks clarification on the derivation of the correction term in AICc, specifically the term $$2K*(K+1)/(n-K-1)$$, and how it relates to the standard AIC formula.
  • Another participant notes that the definitions of AIC and AICc are often presented as separate formulas, raising questions about a common underlying quantity that both aim to approximate.
  • There is a mention of a specific threshold (n/k < 40) for when to use AICc, with one participant expressing curiosity about the assumptions leading to this threshold.
  • Another participant acknowledges uncertainty regarding the origin of the number 40 and references literature that discusses model selection criteria but does not clarify this specific threshold.
  • Participants express a willingness to explore derivations and definitions further, contingent on interest from others in the discussion.

Areas of Agreement / Disagreement

Participants express uncertainty regarding the derivation of the correction term in AICc and the significance of the threshold n/k < 40. No consensus is reached on these points, and multiple viewpoints are presented without resolution.

Contextual Notes

Limitations include a lack of clarity on the assumptions leading to the threshold n/k < 40 and the absence of definitive references that explain the derivation of the AICc correction term.

mertcan
Messages
343
Reaction score
6
hi, initially I am aware that AICc value is $$ -2(*log-likelihood)+2K+2K*(K+1)/(n-K-1)$$ where n is sample size and K is number of model parameters. But I really do not know how last term of right hand side is added, also AIC value is $$ -2*(log-likelihood)+2K$$ , so AICc has some correction in addition to AIC. In short my question is what is the derivation of correction in AICc $$(2K*(K+1)/(n-K-1) )$$ ??
 
Physics news on Phys.org
Unfortunately, in searching the web, we find that the usual approach is just to define AIC by a formula and to define AICc by a different formula. However, the terminology "correction" suggests that both formulae are trying to compute a common quantity, whose definition is unstated. If we only consider history as the authority on definitions, we would have to read the original papers that defined the AIC and the AICc to see if the people who proposed the AIC and AICc defined a common quantity that these formulae are supposed to approximate.

If we go beyond history to seek a respectable definition for the AIC, the section "Model Selection Criterion" on page 7 of the presentation http://www4.ncsu.edu/~shu3/Presentation/AIC.pdf, defines a quantity that is to be maximized. The particular formulae used to estimate that quantity could be different for different types of models and situations (e.g. linear models and large samples vs linear model and small samples ). If we define the AIC abstractly as a quantity proportional to:

##E_y E_x [\log(g(x| \hat{\theta}(y)))]##

then, in different situations, the AIC can be given by different formulae.

I don't know what level of abstraction you are comfortable with. One can probably understand formulae for the AIC and AICc by considering specific situations. - but I won't try to figure this out myself unless someone else is really interested in participating!
 
Stephen Tashi said:
Unfortunately, in searching the web, we find that the usual approach is just to define AIC by a formula and to define AICc by a different formula. However, the terminology "correction" suggests that both formulae are trying to compute a common quantity, whose definition is unstated. If we only consider history as the authority on definitions, we would have to read the original papers that defined the AIC and the AICc to see if the people who proposed the AIC and AICc defined a common quantity that these formulae are supposed to approximate.

If we go beyond history to seek a respectable definition for the AIC, the section "Model Selection Criterion" on page 7 of the presentation http://www4.ncsu.edu/~shu3/Presentation/AIC.pdf, defines a quantity that is to be maximized. The particular formulae used to estimate that quantity could be different for different types of models and situations (e.g. linear models and large samples vs linear model and small samples ). If we define the AIC abstractly as a quantity proportional to:

##E_y E_x [\log(g(x| \hat{\theta}(y)))]##

then, in different situations, the AIC can be given by different formulae.

I don't know what level of abstraction you are comfortable with. One can probably understand formulae for the AIC and AICc by considering specific situations. - but I won't try to figure this out myself unless someone else is really interested in participating!
First of all thanks for your return, but I would like to express that I know how to derive AIC value without the correction, but when sample size is small relative to number of parameters (if n/k<40, by the way k is number of parameters n is sample size) it is said that we should use correction of AIC which means AICc. I really wonder why 40 takes place, what kind of assumptions in AIC definition create 40 or why n/k<40 exists? So, could you help me about which assumptions may result in n/k<40 in small sample size case of AIC value?
 
I myself don't know where the number 40 comes from.

The articles I've found that bother to footnote the recommendation n/k < 40 cite Burnham LS, Anderson DR. Model Selection and Inference: A Practical Information-Theoretic Approach. 2. Springer-Verlag; New York: 2002. I don't have a copy of that book.

We could try to follow the derivation given in http://myweb.uiowa.edu/cavaaugh/doc/pub/aicaicc.pdf, starting on page 3. However, I don't see the number 40 mentioned in that document.
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K
Replies
1
Views
4K
  • · Replies 0 ·
Replies
0
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K