Absolute compression = true randomness?

In summary: The decimal representation of 1/3 takes an infinite number of digits, that's true, but it is not an infinite number.When it comes to computer generated random codes, you need to be pragmatic.
  • #1
serp777
117
6
So I've read from various places that maximal compression produces a truly random output. But suppose you were going to apply compression to a truly random series of numbers with something like LZW. Well wouldn't it be the case that since randomness often produces sequences which do not appear to be random, that therefore compression would work at some point in a random sequence of numbers? In the case of finite sequences it means that the efficiency of LZW here would be random since a true random number generator could produce a compressible sequence of numbers.

Thus true randomness does not mean absolute compression. In an infinite sequence of random numbers you would expect it to contain many sequences which don't actually seem random like how randomness could produce the sequence 111111111111 purely by chance. Its similar to how in PI you can find sequences of numbers which don't appear to be random, but in the context of a larger set of numbers the distribution between all different numbers is the same.

So clearly this is compressible. This implies that an infinite sequence of numbers is infinitely compressible in theory right? So really absolute compression != true randomness because it only works in some cases where a finite sequence of numbers was random and has no sequence or pattern of numbers. From my understanding of randomness, it would be impossible, in an infinite sequence of numbers, to find no compressible patterns at any point. I don't have a proof for this but if anyone knows of one or can show me how I'm wrong i'd love to hear it. Thanks.
 
Technology news on Phys.org
  • #2
Infinitely compressible would mean that every set of data could be reduced to single number.
So every number should uncompress to give a unique set of data.
That doesn't makes sense (to me),.
What unique set of data is represented by for example the number 3?
 
  • #3
rootone said:
Infinitely compressible would mean that every set of data could be reduced to single number.
So every number should uncompress to give a unique set of data.
That doesn't makes sense (to me),.
What unique set of data is represented by for example the number 3?

An infinite number is one that goes on forever and so there are an infinite number of compressible sequences inside a completely random number. 3 obviously doesn't represent infinity so that point is moot. So compression could run an infinite number of times especially considering compression produces a new number each time it runs so it spawns off an infinite number of new numbers.

Infinity simply doesn't work that way. Consider the infinite hotel with infinite buses bringing people. Just because the hotel is infinite and takes on an infinite number of people doesn't mean that there are no buses left--there are still an infinite number of buses and an infinite number of rooms.
 
  • #4
serp777 said:
An infinite number is one that goes on forever
No. An infinite number is one that is larger than any finite number. By your definition, 1/3 = 0.3333... would be an infinite number -- it is not. The decimal representation of 1/3 takes an infinite number of digits, that's true, but it is not an infinite number.
serp777 said:
and so there are an infinite number of compressible sequences inside a completely random number. 3 obviously doesn't represent infinity so that point is moot. So compression could run an infinite number of times especially considering compression produces a new number each time it runs so it spawns off an infinite number of new numbers.
This makes no sense to me.
serp777 said:
Infinity simply doesn't work that way. Consider the infinite hotel with infinite buses bringing people. Just because the hotel is infinite and takes on an infinite number of people doesn't mean that there are no buses left--there are still an infinite number of buses and an infinite number of rooms.
 
Last edited:
  • #5
serp777 said:
In an infinite sequence of random numbers you would expect it to contain many sequences which don't actually seem random like how randomness could produce the sequence 111111111111 purely by chance.
You won't get 111111111111 very often. So how are you going to compress it? What you gain for an occasional 111111111111, you will loose elsewhere.
When it comes to computer generated random codes, you need to be pragmatic. For example, if the purpose is cryptography, you are looking for something with even distribution (all codes equally possible) that cannot be reproduced. You might do some sanity checks on it - just to make sure your random bit generator hasn't broken but that's it. For a Monte Carlo analysis, you may want a reproducible series that doesn't have patterns that match with your problem set. For a white noise generator, you would be looking for good frequency distribution and no repetition.
 
  • #6
Mark44 said:
No. An infinite number is one that is larger than any finite number. By your definition, 1/3 = 0.3333... would be an infinite number -- it is not. The decimal representation of 1/3 takes an infinite number of digits, that's true, but it is not an infinite number.
This makes no sense to me.

Its infinite because it has an infinite number of decimal places. I didn't say infinite in magnitude i just said infinite.That second part refers to an infinite number like pi for instance which has random decimal numbers. The idea is that contained in something like pi is all possible patterns and sequences and so there are an infinite number of compressions that could take place.
 
  • #7
serp777 said:
Its infinite because it has an infinite number of decimal places. I didn't say infinite in magnitude i just said infinite.
Your use of "infinite number" is nonstandard, and one that most mathematicians would disagree with you on.
serp777 said:
That second part refers to an infinite number like pi for instance which has random decimal numbers. The idea is that contained in something like pi is all possible patterns and sequences and so there are an infinite number of compressions that could take place.
I don't see this going anywhere, and it seems more like philosophy than mathematics, so I am closing this thread.
 

1. What is absolute compression?

Absolute compression is a data compression technique that aims to achieve the smallest possible file size by removing all unnecessary information from a file. This can include removing redundancy, irrelevant data, and other forms of data that do not contribute to the overall meaning or purpose of the file.

2. How does absolute compression work?

Absolute compression works by analyzing the data in a file and identifying patterns or redundancies. It then removes these patterns and replaces them with shorter codes or symbols, resulting in a smaller file size. The compressed file can then be decompressed to its original form using a decompression algorithm.

3. What is true randomness?

True randomness refers to a process or event that is completely unpredictable and has no discernible pattern. This can occur in natural phenomena, such as weather patterns or radioactive decay, or through random number generators in computer programs.

4. How is absolute compression related to true randomness?

Absolute compression and true randomness are related in the sense that absolute compression aims to remove all patterns and redundancies from a file, resulting in a file that appears to be random. However, this is not true randomness as the file still follows a specific algorithm for compression and can be decompressed to its original form.

5. Is absolute compression better than other compression techniques?

This depends on the specific use case and the type of data being compressed. Absolute compression can achieve smaller file sizes compared to other compression techniques, but it may also result in a loss of some data or information. Other compression techniques, such as lossless compression, may be more suitable for certain types of data where preserving all information is necessary.

Similar threads

  • Programming and Computer Science
Replies
1
Views
625
Replies
25
Views
3K
  • Programming and Computer Science
Replies
13
Views
3K
  • Programming and Computer Science
Replies
1
Views
1K
  • Programming and Computer Science
Replies
10
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
6
Views
1K
Replies
5
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
10
Views
910
  • Thermodynamics
Replies
20
Views
1K
Back
Top