Discussion Overview
The discussion revolves around the entropy of a histogram when using different bin sizes, particularly in the context of approximating an unknown probability distribution in a dynamical system. Participants explore how the choice of bin size affects the calculated entropy and seek an expression or correction for this relationship.
Discussion Character
- Exploratory
- Technical explanation
- Debate/contested
- Mathematical reasoning
Main Points Raised
- Some participants propose that as the number of bins increases, entropy will also increase, but they seek a relationship that accounts for this effect.
- One participant clarifies that they are approximating an unknown probability distribution using a histogram and are interested in determining the number of bins needed to effectively decrease entropy.
- Another participant questions the criteria for "sufficiently" decreasing entropy, suggesting that continuous distributions have a fixed entropy that should not be influenced by bin size.
- One participant discusses the implications of using a Markov process and how discretizing the state space could lead to sparser transition probabilities, thereby affecting entropy.
- Another participant raises concerns about the clarity of the probability distributions involved, questioning how to define and vary the distributions to analyze entropy correctly.
- Some participants note that the relationship between bin size and entropy may depend on the underlying distribution, with uniform distributions showing a more pronounced increase in entropy with more bins.
Areas of Agreement / Disagreement
Participants express differing views on the relationship between bin size and entropy, with no consensus on how to define or calculate the entropy in this context. The discussion remains unresolved regarding the optimal approach to analyzing entropy with varying bin sizes.
Contextual Notes
Limitations include the ambiguity in defining the probability distributions involved and the assumptions about how entropy behaves with different bin sizes. The discussion also highlights the complexity of applying concepts from continuous distributions to discrete approximations.