- 10,901
- 3,782
A while ago I did some posts on the modern way audio is recorded, mastered and distributed.
I have been investigating this further and am writing this post on what I found.
These days, high-quality recordings are often recorded in one-bit DSD, which you can look into (a link I provide later has details). However, DSD is hard to use when creating masters. So, a format where Audio Engineers have an overkill amount of leeway in creating masters, called DXD, was devised (352.8/24, ie 352.8 kHz sampling at 24 bits). Some high-quality producers release their recordings in DXD. I have one, and it sounds glorious. However, have a look at:
What About DXD? Surprise!
CD quality 44.1/16 is good enough if 16 bits are used. But certainly not the full DXD; it is all noise above 50 kHz. Knowing this, some DAC manufacturers have a 50 kHz filter in their DACS.
However, is 16 bits enough? To answer that, we need to look into dithering:
24/192 Music Downloads
So, for distribution, 16 bits are more than good enough. 88.2/16 is likely all that is ever needed; even audiophile nuts do not need 24 bits. Most of the time, 44.1 sampling is enough.
There is a sneaky way to process the DXD file so that only the audio, not the noise, is distributed. It is called lossyWAV:
lossyWAV - Hydrogenaudio Knowledgebase
It is a form of adaptive dither that allows FLAC compression to operate much more effectively.
There is the issue with the ultrasonic noise in DXD being larger than 16 bits, so it is not removed when truncated to 16 bits. The methods of the following article can fix this, as well as explain DSD:
Fundamental Principles Behind the Sigma-Delta ADC Topology: Part 1
It is then easy for a program to determine the minimum sampling rate necessary to prevent aliasing (distortion that occurs if there is any content above half the sampling frequency) and decimate (that is, just throw away unnecessary samples and still have a sampling rate above half the maximum frequency) the recording to that minimum. It will usually be just 44.1 sampling, but a higher sampling rate may occasionally be required.
This can be done by upsampling to 10xDSD. A little math shows it is an exact multiple of all the common sampling frequencies. This decreases noise, plus decimation is trivial.
Also, using lossyWAV and FLAC, files that are close in size to lossy audio files are all that is needed for very transparent audio.
I have been investigating this further and am writing this post on what I found.
These days, high-quality recordings are often recorded in one-bit DSD, which you can look into (a link I provide later has details). However, DSD is hard to use when creating masters. So, a format where Audio Engineers have an overkill amount of leeway in creating masters, called DXD, was devised (352.8/24, ie 352.8 kHz sampling at 24 bits). Some high-quality producers release their recordings in DXD. I have one, and it sounds glorious. However, have a look at:
What About DXD? Surprise!
CD quality 44.1/16 is good enough if 16 bits are used. But certainly not the full DXD; it is all noise above 50 kHz. Knowing this, some DAC manufacturers have a 50 kHz filter in their DACS.
However, is 16 bits enough? To answer that, we need to look into dithering:
24/192 Music Downloads
So, for distribution, 16 bits are more than good enough. 88.2/16 is likely all that is ever needed; even audiophile nuts do not need 24 bits. Most of the time, 44.1 sampling is enough.
There is a sneaky way to process the DXD file so that only the audio, not the noise, is distributed. It is called lossyWAV:
lossyWAV - Hydrogenaudio Knowledgebase
It is a form of adaptive dither that allows FLAC compression to operate much more effectively.
There is the issue with the ultrasonic noise in DXD being larger than 16 bits, so it is not removed when truncated to 16 bits. The methods of the following article can fix this, as well as explain DSD:
Fundamental Principles Behind the Sigma-Delta ADC Topology: Part 1
It is then easy for a program to determine the minimum sampling rate necessary to prevent aliasing (distortion that occurs if there is any content above half the sampling frequency) and decimate (that is, just throw away unnecessary samples and still have a sampling rate above half the maximum frequency) the recording to that minimum. It will usually be just 44.1 sampling, but a higher sampling rate may occasionally be required.
This can be done by upsampling to 10xDSD. A little math shows it is an exact multiple of all the common sampling frequencies. This decreases noise, plus decimation is trivial.
Also, using lossyWAV and FLAC, files that are close in size to lossy audio files are all that is needed for very transparent audio.
Last edited: