gop
				
				
			 
			
	
	
	
		
	
	
			
		
		
			
			
				
- 55
- 0
Hi
The source coding theorem says that one needs at least N*H bits to encode a message of length N and entropy H. This supposedly is the theoretical limit of data compression.
But is it? Or does it only apply to situations where only the frequency (probability) of a given symbol is known?
For example, If I know that the symbol x is always followed by the symbol y (or that this happens with a very high probability) couldn't I use this to construct a compression algorithm that needs fewer bits than N*H?
thx
				
			The source coding theorem says that one needs at least N*H bits to encode a message of length N and entropy H. This supposedly is the theoretical limit of data compression.
But is it? Or does it only apply to situations where only the frequency (probability) of a given symbol is known?
For example, If I know that the symbol x is always followed by the symbol y (or that this happens with a very high probability) couldn't I use this to construct a compression algorithm that needs fewer bits than N*H?
thx
 
 
		 
 
		 
 
		