Andy Resnick said:
I'm not an expert in information theory, but IIRC, a string of identical bits has *zero* information (and zero entropy), because of the way information is encoded in a signal. See the way entropy of a signal is defined here:
http://en.wikipedia.org/wiki/Information_theory
"The entropy, H, of a discrete random variable X is a measure of the amount of uncertainty associated with the value of X."
According to that link, entropy is a function of your expectations. If your options are 1 and 0 and you don't know which you're going to get, you have maximum entropy. The values that you actually get, 00000, 01101, etc, don't (typically) change the entropy level.
If you know that the sum of your bits is odd, you have 5 total bits, and you know what the first 4 are, then you can predict the final bit - there is no entropy or uncertainty associated with it. It is only when your bits are correlated like this that the actual series of bits has anything to do with total entropy.
This type of entropy doesn't have anything to do with physical E=mc^2 energy though. If it did, gaining knowledge about what's on my flash drive would change its mass. If my drive has all 0s and I expect a random sequence of 1s and 0s, it is at maximum entropy. If my drive has all 0s and I expect it to have all 0s then it has no entropy. You don't need a physical change to change your information entropy, so it can't be related to mass.
There's no inherent "amount of information" in a string of bits, no matter what they are. It all depends on what you are expecting - what algorithm you are using to encode or decode your bits.
The mass would change if the drive physically has more electrons on it when storing 1s or 0s and your density of 1s and 0s changes. That's the only way I can think of it changing though.