Markov Chain as a function of dimensions

Click For Summary
SUMMARY

This discussion centers on the behavior of a Markov chain of order 50 created using R, which demonstrates clustering and flattening as dimensions increase. The user questions whether the observed phenomena are due to data spreading in higher dimensions or the strength of correlation in one coordinate maintaining cluster integrity. The conversation references the work of statisticians Gareth Roberts and Jeff Rosenthal, specifically their paper on limit theorems for Markov Chains in MCMC algorithms, which provides insights into the convergence and mixing properties of these chains.

PREREQUISITES
  • Understanding of Markov Chains and their properties
  • Familiarity with R programming for statistical analysis
  • Knowledge of Euclidean metrics in higher dimensions
  • Basic concepts of MCMC (Markov Chain Monte Carlo) algorithms
NEXT STEPS
  • Study the paper by Gareth Roberts and Jeff Rosenthal on limit theorems for Markov Chains
  • Explore advanced R packages for visualizing high-dimensional data
  • Learn about convergence and mixing times in Markov Chains
  • Investigate the implications of dimensionality on clustering in statistical models
USEFUL FOR

Statisticians, data scientists, and researchers interested in Markov Chains, MCMC algorithms, and high-dimensional data analysis.

FallenApple
Messages
564
Reaction score
61


Here is an animation I created in R.

I built this Markov chain of order 50 by correlating the information in one of the coordinates while randomly varying the rest. Is there an explanation for the clustering and flattening out over increasing dimensions of the vector space? Is it due to the fact that data becomes spread out over larger dimensions?

But that doesn't explain why the clusters themselves do not spread out or why other clusters condense. I've done this for much larger dimensions and it seems to reach a steady state.

The plot is of the incremental changes in a Euclidean metric vs the input, so I don't know if viewing the data as extremely spread out in higher dimensional space would translate to this plot.

Do this mean that the correlation that I induced in that coordinate is strong enough such that it keeps the cluster together regardless of how high the dimension is?
 
Last edited:
Physics news on Phys.org
I'm not sure if this would answer your question, but statisticians Gareth Roberts (of Lancaster University, later University of Warwick, UK) and Jeff Rosenthal (of the University of Toronto, and a former professor of mine when I was in grad school) wrote a paper summarizing limit theorems for Markov Chains in the context of MCMC algorithms.

https://arxiv.org/abs/math/0404033

I believe the contents of the paper will explain the specific convergence of Markov Chains and the properties of "mixing" in Markov Chains (aka the time when Markov Chains are "close" to its steady state distribution).
 
  • Like
Likes   Reactions: FallenApple

Similar threads

Replies
9
Views
2K
  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 10 ·
Replies
10
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 31 ·
2
Replies
31
Views
5K
  • · Replies 7 ·
Replies
7
Views
3K
Replies
2
Views
2K
  • · Replies 25 ·
Replies
25
Views
6K
  • · Replies 1 ·
Replies
1
Views
3K
Replies
13
Views
6K