Shannon Entropy vs Entropy in chemistry

despues357 · Aug 19, 2017

I'm wondering about the exact usage of the term entropy in programming vs. entropy in science.

MisterX · Aug 19, 2017

This is a common question. The information-theory (Programming) and Von Neumann (Science) entropy have essentially the same formula
$$S \propto -\sum_i p_i \ln p_i $$
however there is difference in interpretation. In information theory, the entropy represents how much new information you get when a specific outcome happens, given the initial probability of each outcome. This doesn't exactly apply to the scientific entropy, because it's not particularly meaningful to ask "if I have a probability distribution for micro-states, and then learn the system is definitely in a particular micro-state, then how much Shannon information has been acquired?"

Instead we have a physical meaning for entropy. For example it's used to define temperature.
$$ \frac{1}{T}= \left(\frac{dS}{dE}\right)_{N,V} $$
or
$$ dS = \frac{dQ}{T} $$

In thermal equilibrium, the entropy is maximized given some set of constraints. For example in the Fermi or Bose gases, we determine the probability that a state is occupied by maximizing the entropy given a particular fixed average energy and number of particles.

More generally I suppose entropy can both be a way of describing how spread out the probability distributions are.

despues357 · Aug 20, 2017

It's becoming clearer now,

wikipedia has an explanation that talks about the predictability of a string of questions, being a unit of per question. Their formulas have the same relationships between the ideas. I guess there isn't much of a relationship between what they stand for as I thought there would be.

The 1st paragraph describes how entropy is measured as unpredictability on average is a function of what is available after whatever number of questions are answered out of the range that is possible given the problem. The second paragraph describes it as the diminishing average of unpredictability over answering more and more of the subsequent questions within the framework of the problem. So, what I'll pull from this is that Shannon entropy is named after scientific entropy due to the form of both ideas and obviously not their individual constituent ideas. Maybe you could describe it in terms of scientific entropy, where the number of micro-states unknown vs the known amount is diminishing within a particular system as you begin to answer what each individual micro-state is given the total possible range of possible values...you know this reminds me of when I learned about degrees of freedom in statistics.

...

wikipedia:

"Now consider the example of a coin toss. Assuming the probability of heads is the same as the probability of tails, then the entropy of the coin toss is as high as it could be. This is because there is no way to predict the outcome of the coin toss ahead of time: if we have to choose, the best we can do is predict that the coin will come up heads, and this prediction will be correct with probability 1/2. Such a coin toss has one bit of entropy since there are two possible outcomes that occur with equal probability, and learning the actual outcome contains one bit of information. In contrast, a coin toss using a coin that has two heads and no tails has zero entropy since the coin will always come up heads, and the outcome can be predicted perfectly. Analogously, one binary-outcome with equiprobable values has a Shannon entropy of {\displaystyle \log _{2}2=1}

$3a63d03d23e9d47761a4f1cfa1d0097919c395f1$

bit. Similarly, one trit with equiprobable values contains {\displaystyle \log _{2}3}

$f889b70067a012339b8baa7f0f9d17a0e6889c8f$

(about 1.58496) bits of information because it can have one of three values.

English text, treated as a string of characters, has fairly low entropy, i.e., is fairly predictable. Even if we do not know exactly what is going to come next, we can be fairly certain that, for example, 'e' will be far more common than 'z', that the combination 'qu' will be much more common than any other combination with a 'q' in it, and that the combination 'th' will be more common than 'z', 'q', or 'qu'. After the first few letters one can often guess the rest of the word. English text has between 0.6 and 1.3 bits of entropy per character of the message."

Fooality · Oct 29, 2017

One interesting area of overlap is in biology. Shannon information in an event is the log reciprocal of probability of event, which means high information events are less probable. So for instance odds of seeing 0 or 1 (-log2(1/2) = 1 bit) is more likely then seeing a specific million bit number. (1 in 2^1000000 odds). In physical entropy, low entropy states are possible, but its improbable for a system to return to them. Even in open systems, with energy pouring in like earth, its more likely to find things in fairly high entropy states. The exception everyone points to when they hear physical entropy explained as disorder is life, which superficially seems to violate it by creating order. Life, through evolution, is in the business of producing information via natural selection and mutation in the DNA, which is by definition of information improbable, but then it makes the improbable probable through reproduction: evolution discovers the best forms, then copies them. In so doing, it "Climbs Mount Improbable" in the words of biologist Richard Dawkins. These improbable forms correspond, interestingly, to the production of low entropy states of matter, such as free oxygen and sugars as produced by plants in photosynthesis, the energy of which we harvest to live, and create our own order. Life doesn't reverse or violate physical entropy (if the sun went black we'd all die) but in the concept of an open system like our earth, it provides a counter force to what you'd expect due to information.

Shannon Entropy vs Entropy in chemistry

Thread 'Learning Assembly and computer architecture for x86'

Thread 'Learning data structures and algorithms in different programming languages'

Thread 'A Crisis for Newly Minted CompSci Majors -- entry level jobs gone'

Similar threads

Hot Threads

Hackathon ideas?

Touch-typing for programmers

How to calculate Tension for a series of connected points?

Trying To Debug A Python File

Python Complaining About Python

Recent Insights

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers

Insights Fermat's Last Theorem

Insights Why Vector Spaces Explain The World: A Historical Perspective