Does higher order moments means more attention to local area?

Wenlong · Jul 3, 2012

Dear all,

Sorry to post this question in this section again.

I am currently looking into few static analyse algorithms. I noticed that they are analysing with different order moments or cumulants to analyse the data. I guess it is because these algorithms are focus on different aspect of the data itself.

So far as I know, 1st (mean) and 2nd(variance) moments are focus on the dispersion of the data as a whole, and 3rd moment (skewness) looks into the tail area of the distribution. 4th moment (kurtosis) concentrates on the peaks.

Can I then deduce that higher moments means the algorithm pays more attention to local static property?

Can anyone answer me explicitly to help me out of this headache? Or can you recommend some books or papers? I do extremely appreciate for your kind consideration and help.

Best wishes
Wenlong

haruspex · Jul 4, 2012

The odd and even moments tend to behave differently. Odd moments will naturally tell you things about lopsidedness (because an odd power of a negative number is negative). The mean is a measure of lopsidedness compared with a distribution more evenly placed about 0. Even moments treat both sides equally, so say more about spread.
Higher order moments put a heavier weight on the outliers. A distribution with a sharp peak and long tails will have a higher kurtosis than one with the same variance but which has a broader centre then falls off quickly.

Wenlong · Jul 4, 2012

Hi, Haruspex

Thank you very much for your reply. It helps alot.

Then may I ask a further question base on this? Take PCA and ICA (independent component analysis) for example, PCA compute principal components with covariance matrix (2nd order moment) while ICA compute independent components with negentropy (measured by kurtosis or higher order moments).

By comparison of principal components and independent components of same set of observations, I find that independent components are better to represent local features while principal components are better to represent global trends.

Is this because the different order of moments they use? Or it just a coincidence?

Many thanks in advance.

Best wishes
Wenlong

Wenlong · Jul 4, 2012

haruspex said:

The odd and even moments tend to behave differently. Odd moments will naturally tell you things about lopsidedness (because an odd power of a negative number is negative). The mean is a measure of lopsidedness compared with a distribution more evenly placed about 0. Even moments treat both sides equally, so say more about spread.
Higher order moments put a heavier weight on the outliers. A distribution with a sharp peak and long tails will have a higher kurtosis than one with the same variance but which has a broader centre then falls off quickly.

Hi, Haruspex

Thank you very much for your reply. It helps alot.

Then may I ask a further question base on this? Take PCA and ICA (independent component analysis) for example, PCA compute principal components with covariance matrix (2nd order moment) while ICA compute independent components with negentropy (measured by kurtosis or higher order moments).

By comparison of principal components and independent components of same set of observations, I find that independent components are better to represent local features while principal components are better to represent global trends.

Is this because the different order of moments they use? Or it just a coincidence?

Many thanks in advance.

Best wishes
Wenlong

BTW, how can I reply to a respondent directly in this forum?

chiro · Jul 4, 2012

Wenlong said:

Dear all,

Sorry to post this question in this section again.

I am currently looking into few static analyse algorithms. I noticed that they are analysing with different order moments or cumulants to analyse the data. I guess it is because these algorithms are focus on different aspect of the data itself.

So far as I know, 1st (mean) and 2nd(variance) moments are focus on the dispersion of the data as a whole, and 3rd moment (skewness) looks into the tail area of the distribution. 4th moment (kurtosis) concentrates on the peaks.

Can I then deduce that higher moments means the algorithm pays more attention to local static property?

Can anyone answer me explicitly to help me out of this headache? Or can you recommend some books or papers? I do extremely appreciate for your kind consideration and help.

Best wishes
Wenlong

Hey Wenlong.

Are you aware of the relationship between the moments and the characteristic probability function, and what the interpretation of the Fourier and inverse Fourier transform is with respect to frequency information?

This will help you understand the relationship between the various moments (not central moments, just moments) and the frequency information of the PDF itself.

haruspex · Jul 4, 2012

Wenlong said:

Take PCA and ICA (independent component analysis) for example, PCA compute principal components with covariance matrix (2nd order moment) while ICA compute independent components with negentropy (measured by kurtosis or higher order moments).

By comparison of principal components and independent components of same set of observations, I find that independent components are better to represent local features while principal components are better to represent global trends.

Is this because the different order of moments they use? Or it just a coincidence?

You've gone beyond my limits of expertise with that one.
As far as I've been able to discern:
- PCA is often used as a preliminary (whitening) step for ICA anyway;
- ICA requires non-Gaussianity in (all but one of) the sources, whereas PCA does not;
- ICA doesn't rank the components

Does higher order moments means more attention to local area?

Thread 'Onto set mapping is the surjective set mapping, and into injective?'

Thread 'Roulette wheel physics and probability'

Thread 'Detail of Diagonalization Lemma'

Similar threads

Hot Threads

B A Little Probability Puzzle

I Need help solving this Existence Algorithm for truth

I This fact on 2-dimensional space blows my mind

A Does this computation satisfy LTL formulas?

A Prove that points which are indistinguishable from 0 exist (using logic)

Recent Insights

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers

Insights Fermat's Last Theorem

Insights Why Vector Spaces Explain The World: A Historical Perspective