Variation Distance: Explaining Finite Cases

  • Thread starter boboYO
  • Start date
  • Tags
    Variation
In summary, the conversation discusses the total variation distance of probability measures and its equivalence for the finite case. The maximum variation and the sum of variations are shown to be equivalent, but there is a counterexample that raises questions about the definition of equivalence. It is suggested that the two metrics may define the same topology on the set of probability measures.
  • #1
boboYO
106
0
I was doing some reading and I came across this:

http://en.wikipedia.org/wiki/Total_variation_distance_of_probability_measures

So apparently for the finite case,

[tex]\max_{x} ( \left| P(x) - Q(x) \right|)\quad \mbox{ is equivalent to}\quad \frac{1}{2} \sum_x {\left| P(x)-Q(x)\right|}[/tex]


but isn't this is a counterexample?
Code:
x         1        2        3        4
P(x)    0.25     0.25     0.25      0.25

Q(x)    0.10     0.20     0.35      0.35

|P-Q|   0.15     0.05     0.10      0.10

sum(|P-Q|)/2= 0.2

max(|P-Q|)=0.15



So I was thinking, maybe they meant 'equivalent' in a different sense? Could somebody please explain?
 
Mathematics news on Phys.org
  • #2
I believe they might mean that those two metrics define the same topology on the set of probability measures.
 
  • #3
Rochfor1 is correct. "Equivalent" simply means that they will give the same results in any probability measures, not that they are equal.
 
  • #4
Thanks guys.
 

Related to Variation Distance: Explaining Finite Cases

1. What is variation distance?

Variation distance is a measure of how different two probability distributions are. It calculates the absolute difference between the probabilities of events occurring in the two distributions. It is commonly used in statistics and probability theory to compare the similarity of different data sets.

2. How is variation distance calculated?

Variation distance is calculated by taking the absolute difference between the probabilities of each event in the two distributions and then summing all of those differences. The resulting value is a measure of how different the two distributions are, with a higher value indicating a greater difference.

3. Can variation distance be used for non-probabilistic data?

No, variation distance is specifically designed to measure the difference between two probability distributions. It cannot be used for non-probabilistic data as it requires the calculation of probabilities for events.

4. What is the significance of using finite cases in variation distance?

Finite cases refer to situations where the number of events or outcomes is limited and known. In variation distance, using finite cases allows for a more precise calculation of the difference between distributions, as it takes into account all possible outcomes rather than just a sample.

5. How is variation distance used in real-world applications?

Variation distance is commonly used in fields such as statistics, computer science, and biology to compare and analyze data sets. It is often used in machine learning algorithms to measure the distance between data points and make predictions. It is also used in biological studies to compare genetic variations between species or populations.

Similar threads

Replies
1
Views
2K
Replies
4
Views
2K
Replies
1
Views
2K
  • Mechanics
Replies
9
Views
1K
  • Linear and Abstract Algebra
Replies
1
Views
834
Replies
0
Views
477
  • High Energy, Nuclear, Particle Physics
Replies
6
Views
2K
  • Atomic and Condensed Matter
Replies
3
Views
996
Replies
5
Views
2K
Replies
1
Views
816
Back
Top