| New Reply |
How does Nyquist's theory apply to digital recording of opera music? |
Share Thread | Thread Tools |
| Mar6-12, 07:02 AM | #18 |
|
|
How does Nyquist's theory apply to digital recording of opera music?
This whole business is a massive mixture of theory and practice. If we're talking about the needs of high quality Opera recordings then the 'sampling end' of things is relevant - along with Nyquist matters. If we're squeezing all our 'easy listening' music onto a ipod then the MP coding will do enough damage to cover any ADC inadequacies.
|
| Mar6-12, 07:49 AM | #19 |
|
|
We need to make a choice as to what we want to achieve. Here's a possible one: we try to reproduce what one particular person in the audience would have heard (maybe sitting at "the best seat in the house"). We use a dummy head setup with two microphones. Even the very best microphone doesn't have a flat response curve, and the frequency response usually drops sharply above 20 kHz. Most frequency response charts for professional microphones don't go any further than 20 kHz. Now is a good time to ask the question: what use is sampling the sound at 192 kHz or 384 kHz? The input to the sound card is not the original sound: all the information the sound card is getting is coming from the output of the microphone. If the sound card registers information in frequencies above 100 kHz, it is due to artefacts created by the microphone and has no relation to the original sound. Now that we have a digital version of the recorded sound, we need a way to get it into the ears of the hearer: an amplifier and loudspeakers or headphones. Our ideal is to get the air next to the listeners ears to vibrate in exactly the same way as the air vibrated around the dummy head as we were recording. Suffice it to say that no headphone or loudspeaker can create a pressure wave in the air that has the exact form of the one that is fed to it. So "reconstruct to exact original" is a myth. We can make something that approaches the original. Maybe we can do this well enough that a blindfolded person siting on a chair in a concert hall couldn't tell if the sound they are hearing is coming from a live orchestra or a recording of the same orchestra. That's what really matters: even if some instruments are producing sounds in the ultrasound range, nobody can hear them so they could add nothing to the quality of the recording. To go back to the Nyquist-Shannon theory, this is what it states: "If a function x(t) contains no frequencies higher than B hertz, it is completely determined by giving its ordinates at a series of points spaced 1/(2B) seconds apart." Note that the wave is completely determined when sampled at twice the highest frequency. Sampling at a higher frequency simply doesn't add any more information (here's an article that goes into some detail about this). If the listeners cannot hear frequencies above 20 kHz, there's no sense in sampling at more than twice this frequency. 48 kHz is easily sufficient, not just for the "low quality mp3", but also for a high end opera recording. |
| Mar6-12, 08:49 AM | #20 |
|
Verrrry interesting, folks ....
Thanks to all. Probably the ear wouldn't distinguish between a 20 khz sine vs square wave, http://hyperphysics.phy-astr.gsu.edu.../place.html#c1 and if it won't, that's that in my book. Further that's something i could try if can find where i packed my old headphones three moves ago.. 48K vs 192K could become another "Tubes vs Transistors" debate. old jim |
| Mar6-12, 08:49 AM | #21 |
|
|
First, source signal must not contain any frequencies above f/2, otherwise you'll get aliasing. So you have to use low-pass filter. Perfect low-pass aka brick-wall filters are in the same category as massless springs and absolute rigid bodies, i.e. do not exist. Real-world filters are a lot more sloppy. Plus this filter has to be before ADC in analog domain, and these days everything analog is fiddly and expensive and everything digital is dirt cheap and abundant. Second, you need to use sinc interpolation to re-create the original function, which is again fiddly and expensive etc etc. Raising sampling frequency is the easy way out. As sophiecentaur said, this is a mixture of theory and practice. And marketing. Surely 192Khz sounds 4 times better than 48 :) |
| Mar6-12, 11:19 AM | #22 |
|
|
|
| Mar6-12, 01:31 PM | #23 |
|
|
Goodness, a lot of interesting theories about how recording engineers do their jobs.
I just had to register to clear a few things up. I am not a physicist, but a classical recording engineer. A few things: 1) Sampling rates are usually between 44.1 and 192kHz. 2) If there is a high channel count, 44.1 or 48, is probably the most common. 3) DXD formats like 394kHz are not common and are used primarily for PCM to DSD format conversion, as most Digital Audio Workstations cannot work with DSD formats. 4) High quality converters are coveted for their clean analog circuity and dynamic range, not the sample rate. 5) Most propaganda about the superiority of high sample rates sounding better are sales pitches and have no basis in reality. In truth, some inferior converters will sound better at 2x sample rates (88.2 and 96kHz) because of the stability of the clocking and performance of the converter, not because of the higher frequencies. 6) Forget any wave other than sin waves. Those are artificial waves that are not generated by any known acoustic instrument. We are talking about opera right? 7) Any limitation of the Nyquist theorem is due to the need for analog circuity in audio recording. 16bit audio works alright, but the theoretical limit of 144dB of dynamic range for 24 PCM recording has yet to be achieved due to analog limitations. Therefore a converter cannot output exactly what it records and any frequency rate, even though the converter is trying to produce an identical waveform from the original, (post decimation filter), analog signal. |
| Mar6-12, 04:48 PM | #24 |
|
|
|
| Mar6-12, 04:55 PM | #25 |
|
|
even though Nyquist-Shannon didn't say this back in 1945 or whatever year it was, it has since been commonly expressed in the literature that you must sample at a frequency greater than 2B, if B is the highest possible frequency. |
| Mar6-12, 05:13 PM | #26 |
|
|
okay, for any [itex] -\pi/2 < \theta < +\pi/2 [/itex], the following sampled at 2f will result in the same samples of +1, -1, +1, -1... [tex] x(t) = \frac{1}{\cos(\theta)} \cos(2 \pi f t + \theta ) [/tex] [tex] x(nT) = (-1)^n [/tex] [tex] T \equiv \frac{1}{2f} [/tex] So how are you going to reconstruct the highest component if the samples don't give you any hint as to what [itex]\theta[/itex] is? |
| Mar6-12, 05:25 PM | #27 |
|
|
|
| Mar6-12, 05:39 PM | #28 |
|
|
|
| Mar6-12, 05:55 PM | #29 |
|
|
To be fair to them, they never built one which would have made it clear to them by producing a string of equal value pulses.
It's easy to be wise after the event ;-) |
| Mar7-12, 07:32 AM | #30 |
|
|
great discussion!!!
A practical insight into the above 'sampling' discussions can be found at this wiki article: http://en.wikipedia.org/wiki/T-carrier [T carrier is an older version of what is usually referred to as DS-1 today.] QUOTE]A more detailed understanding of how the rate of 1.544 Mbit/s was divided into channels is as follows. (This explanation glosses over T1 voice communications, and deals mainly with the numbers involved.) Given that the telephone system nominal voiceband (including guardband) is 4,000 Hz, the required digital sampling rate is 8,000 Hz (see Nyquist rate). Since each T1 frame contains 1 byte of voice data for each of the 24 channels, that system needs then 8,000 frames per second to maintain those 24 simultaneous voice channels. Because each frame of a T1 is 193 bits in length (24 channels × 8 bits per channel + 1 framing bit = 193 bits), 8,000 frames per second is multiplied by 193 bits to yield a transfer rate of 1.544 Mbit/s (8,000 × 193 = 1,544,000).[/QUOTE] A voice conversation, or music on a telephone line, gives you a bit of a subjective feel for what 'voice communications' sounds like when sampled at twice the nominal 4,000 hz analog bandwidth the former AT&T alloted for voice telephone calls here in the US. [As I recall, that analog bandwidth was based on subjective tests of human hearing by Bell Labs and what was needed to communicate human emotion and nuance and tones in typical voice conversations.] That digital sample rate resulted in subjectively 'higher quality' digital voice communications because background noise was noticeably reduced. One could tell early on if a conversation was on digital or analog facilities.... [These digital coding schemes required precise clock timing throughout the country via distribution of timing signals from the National Institute of Standards and Technology (NIST), formerly the National Bureau of Standards (NBS). That's not a problem in local audio systems. Loss or impairment of that timing signal was a cause for near 'panic' in testrooms around the country!!!!] But after reading the above posts I AM wondering if any error correction schemes are employed in modern digital audio systems???? [The digital scheme employed by AT&T utilized AMI which was replaced aby B8ZS error correction schemes.] And for those interested in some of the effects of analog filters, http://en.wikipedia.org/wiki/Bandwid...nal_processing) |
| Mar9-12, 10:53 AM | #31 |
|
|
The low pass filter in the case of 44 KHz or even 88 KHz and 96 KHz needs to be very, very steep. That introduces some severe phase distortion that varies sharply with frequency. Golden eared audiophiles with high quality analog equipment can definitely hear that, especially with sounds that are percussive such as triangles and cymbals. It's not just the individual sounds from each instrument or voice that matter but the spatial and timing relationships between them and sharp filters do some strange things with those. (I have experience as both a physicist and professional recording and production engineer) The higher sample rates reduce the requirements for the filter and the result should be much reduced phase distortion in the higher audio frequencies. One of the smarter things that can be done is to upsample the signal to 192 KHz or 384 KHz and then apply the filter though that's not quite as refined as having the original signal there already. |
| Mar9-12, 11:06 AM | #32 |
|
|
Howdy folks.
First post here. I should qualify it by saying that I am a recording engineer, not a scientist. From a practical standpoint, sampling rate depends as much on delivery format and track count as anything else when it comes to recording audio. Most adult humans can't hear much above 16 kHz, but we go with the assumption that the average human can hear up to 20 kHz. Can you perceive anything above that? Hard to say, but most evidence points to the fact that higher sampling rates sound better, because the slope of the anti-aliasing filter can be relaxed so that there are no ripples or resonances in the audio band, and the cutoff frequency can be above 20 kHz, rather than in the audio band. 44.1 kHz and 48 kHz were decided upon 30 years ago, since the first commercially available digital audio recorders were adapted video decks, and we seem to be stuck on multiples of these for PCM encoding. These sampling rates fulfilled two criteria: 1 - they were fast enough to recreate almost the entire bandwidth of human hearing (with rather sharply sloped anti-aliasing filters); 2 - they could function with the frame rates of the video recorders that were being used; A lot of classical recordings, if they are using PCM, will use a sample rate of 96 kHz. Few (if any) of my colleagues are specifying 192 kHz or above. If track counts are high, chances are it will be recorded at 44.1 kHz for CD release, sometimes 48 kHz. Nyquist/Shannon works. If you have listened to a CD in the past 30 years and heard all of the instruments, then you can be a witness. Are there some overtones or ultrasonic components missing? Maybe. But it must be said that most microphones and audio processors don't even pass audio signal much beyond 20 kHz. Even the best of the best might have a 50 kHz bandwidth at most. Your speakers sure don't reproduce anything that high. Which leaves only the quality of the ADC and DAC. High-quality analog components; stable, low-jitter clocks, temperature stability, and such all make far more a difference in the quality of reproduction than the sampling rate. I'll take a Genex or Mytek converter at 44.1 over a "soundcard" at 384 kHz any day, and if you heard the difference, you would too. |
| Mar9-12, 03:57 PM | #33 |
|
|
|
| Mar9-12, 05:09 PM | #34 |
|
|
Although his argument is counter to the way many folks on this side of the pond seem to think. For classical recordings, the lower track counts, the desire to maintain the utmost fidelity, and the reluctance to do much in the way of digital processing often leads folks to go with the 88 or 96 k sampling frequencies, while the pop and rock productions often go with the lower sampling rates because of the need for extreme amounts of processing on many tracks. |
| New Reply |
| Thread Tools | |
Similar Threads for: How does Nyquist's theory apply to digital recording of opera music?
|
||||
| Thread | Forum | Replies | ||
| What maths does Digital-Signal Processing use? Does it apply to other areas? | Electrical Engineering | 2 | ||
| Music Theory | General Math | 11 | ||
| Apply kinetic theory to solids and liquids? | Classical Physics | 2 | ||
| Indie music riding the digital surge | Computing & Technology | 0 | ||