Continous limit of a multivariate normal distribution

In summary, the conversation is discussing a set of random variables that follow a multivariate normal distribution, with a covariance matrix and vector of mean values constructed numerically from quantum optics. The goal is to calculate the Fisher information for estimating a parameter, but taking the limit as the variables approach a continuous distribution leads to issues with the covariance matrix becoming singular. The participants discuss possible solutions, including using a Gaussian process and estimating the autocovariance function from data. However, the process in question is not stationary, so alternative methods must be explored.
  • #1
QuantizedFun
3
0
Hello everyone,

I am currently considering a set of random variables, [tex] \vec{x} = [x_1,x_2,...x_N][/tex] which are know to follow a multivariate normal distribution,
[tex] P(\vec{x}) \propto \mathrm{exp}(-\frac{1}{2}(\vec{x}-\vec{\mu})^\mathrm{T}\Sigma^{-1}(\vec{x}-\vec{\mu}))[/tex]
The covariance matrix Σ and the vector of mean values μ are constructed numerically from quantum optics, and I the purpose is to calculate the Fisher information of estimating a parameter θ,
[tex] \mathcal{I}(\theta) = \frac{\partial \vec{\mu}^\mathrm{T}}{\partial \theta}\Sigma^{-1} \frac{\partial \vec{\mu}}{\partial \theta}[/tex]

Now, I wish to take the limit as the set of variables approach a continuous distribution,
[tex] x_i \rightarrow x(t), \quad \quad \Sigma_{i,j} \rightarrow \Sigma(t,t') [/tex]
In this case Σ-1 is ill-defined and numerically the calculations break down because Σ-1 becomes singular when close columns become (nearly) identical in this limit.

I would be very grateful for any suggestions to solve a problem like this.

Is there perhaps a "standard" way to take this continuum limit of a multivariate normal distribution or maybe just of the inverse matrix?
 
Last edited:
Physics news on Phys.org
  • #2
What you are looking for is a Gaussian process (https://en.wikipedia.org/wiki/Gaussian_process), which includes countably and uncountably infinite collections of Gaussian random variables. As your variables are suitably indexed (by time), you just need to define the mean as a function of time and the covariance as a function of a pair of time points. I'm not sure how you would get the continuum limit of your covariance matrix however, as that should depend on its specific form.
 
  • #3
madness said:
I'm not sure how you would get the continuum limit of your covariance matrix however, as that should depend on its specific form.

Are we looking for an "autocovariance function"? If so, there should be literature on how to estimate it from data.
 
  • #4
Stephen Tashi said:
Are we looking for an "autocovariance function"? If so, there should be literature on how to estimate it from data.

In the continuous case? That would at least require the specification of some kind of functional form a priori, and then the fitting of parameters.
 
  • #5
madness said:
What you are looking for is a Gaussian process (https://en.wikipedia.org/wiki/Gaussian_process), which includes countably and uncountably infinite collections of Gaussian random variables. As your variables are suitably indexed (by time), you just need to define the mean as a function of time and the covariance as a function of a pair of time points. I'm not sure how you would get the continuum limit of your covariance matrix however, as that should depend on its specific form.

Thank you, the process is indeed a Gaussian process. I have defined my mean μ(t) and the covariance function Σ(t,t') as functions of the continuous time variables t and t'.

This, however, does allow me to construct a probability density P(x(t)) for the continuous time series x(t) (which would be needed to obtain the Fisher information from the definition).

After you made me aware that I am dealing with a Gaussian process I, however, was able to specify my search.
I turns out that for such processes, the mean and covariance function can be considered in frequency space to yield approximate formulas for the Fisher information if the process is stationary (Σ(t,t')=Σ(t-t'))
see http://www.ese.wustl.edu/~nehorai/paper/papersadd/ieeetass90-2.pdf

The problem is that my process is not stationary, so I somehow need to extend these results or find another way around the problem.
 
  • #6
QuantizedFun said:
The problem is that my process is not stationary, so I somehow need to extend these results or find another way around the problem.

That's an interesting problem and I don't want to see this thread die out. You might not appreciate an intuitive, uninformed discussion - but I can't resist.

If I understood whether your covariance function implies that you have an "ordinary" Gaussian process, I would understand whether the paper http://bpuig.perso.univ-pau.fr/articles/PuigPoirion.pdf would help. It's about simulation.

In terms of generalities, your problem involves the concept of integration of a functional over all possible paths. As a generality, its not clear that such an integral exists and if it exists, it may not be unique. From a simplistic point of view, the general pattern for an integral over all possible paths would be to define it as a limit of a sums. Each sum would we a sum of terms and each term would have the form [itex] f(s_i) p(s_i) [/itex] where [itex] f(s_i) [/itex] is a number that "characterizes" some property of all the paths in a set [itex] s_i [/itex] of paths and [itex] p(s_i) [/itex] gives the probability that the realized trajectory of the stochastic process falls in the set [itex] s_i [/itex]. We'd like the sets [itex] s_i [/itex] to partition the space of trajectories.

If a trajectory is simulated as a finite set of points then the finite vector of points can be taken to represent the set [itex] s_i [/itex] defined as the set of all trajectories that pass through those points. The crucial question is whether one may assign a number [itex] f(s_i) [/itex] to that set that "characterizes" the entire set of values [itex] \{ f(v): v \in s_i \} [/itex].
 
Last edited by a moderator:
  • #7
Stephen Tashi said:
If I understood whether your covariance function implies that you have an "ordinary" Gaussian process, I would understand whether the paper http://bpuig.perso.univ-pau.fr/articles/PuigPoirion.pdf would help. It's about simulation.

My covariance function looks like this:

untitled.jpg


A simpler example, where the problem of an ill-defined inverse arises, is

[tex] \Sigma(t,t') = \exp(-(t-t')^2) [/tex]

If one could understand why this fails in contrast to e.g.

[tex] \Sigma(t,t') = \exp(-|t-t'|)[/tex] or [tex] \Sigma(t,t') = 1-|t-t'| [/tex]
where the inverse exists, a solution might be possible to find.These processes are of course stationary, but I believe the root of my problem is the same.
 
Last edited by a moderator:

1. What is the definition of "continuous limit" in the context of a multivariate normal distribution?

The continuous limit of a multivariate normal distribution refers to the theoretical concept of an infinite number of random variables approaching a normal distribution as they become increasingly correlated with each other.

2. How is the continuous limit of a multivariate normal distribution related to the Central Limit Theorem?

The continuous limit of a multivariate normal distribution is a generalization of the Central Limit Theorem, which states that as the sample size increases, the distribution of sample means will approach a normal distribution. The continuous limit extends this concept to multiple variables with increasing correlation.

3. What does the covariance matrix represent in the continuous limit of a multivariate normal distribution?

The covariance matrix in the continuous limit of a multivariate normal distribution represents the strength and direction of the relationships between the variables. It shows how much the variables change together, with positive values indicating a positive relationship and negative values indicating a negative relationship.

4. How does the continuous limit of a multivariate normal distribution differ from a standard multivariate normal distribution?

The continuous limit of a multivariate normal distribution differs from a standard multivariate normal distribution in that it allows for an infinite number of variables with increasing correlation. This means that the variables are not necessarily independent, as they are in a standard multivariate normal distribution.

5. What are some real-world applications of the continuous limit of a multivariate normal distribution?

The continuous limit of a multivariate normal distribution is commonly used in finance, economics, and other fields to model complex systems with multiple variables that are highly correlated. It is also used in statistical analysis to test for normality and to estimate confidence intervals for multivariate data.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
8
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
958
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
918
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
2K
Back
Top