Audio Analysis/Synthesis, DSP/FFT/Spectrogram

  • Thread starter Thread starter neurocomp2003
  • Start date Start date
  • Tags Tags
    Audio
AI Thread Summary
The discussion focuses on understanding digital audio for brain modeling research, specifically regarding microphone input and speaker output. Key questions include the format of raw sound buffers, their compatibility with FFT, and the interpretation of spectrogram axes. The WAV format is chosen for its lossless quality and compatibility across systems. Resources are shared to help clarify audio file formats and the distinction between file formats and codecs. The conversation highlights the need for foundational knowledge in DSP to effectively analyze audio.
neurocomp2003
Messages
1,359
Reaction score
4
Hi, I'm trying to understand digital audio on my own for brain modelling research (i'm trying to wire up input to neuralnet, which I've done with a vision component using a camera).
[edited...] i forgot to mention that the ultimate goal is to have microphone input and speaker output.

I've made a similar inquiry in the computer->software forum but was focused on software rather than the dsp part, and with no reply. In this post i'd rather focus on the dsp/audio part. I use the Windows OS

Questions :
[Q1] What is the format of raw sound/wave buffer (that can be played by soundcard) and is it platform and hardware(soundcard) dependent? In image buffers you have some order of RGBA or some other colour scheme. Is this the same for the soudn buffer some sequence of [t,Hz,Phase,Amplitude]? Is this format PCM? and links to references would help.

[Q2] Is the raw sound buffer the same input that should go into a FFT

[Q3] What is displayed in the 3rd axis of spectrogram [t vs Hz vs ###]? Is it the PSD of FFT and is the PSD just log(sqrt(||(Re,Im)||)) or is it just the Re component or somthing to do with amplitude??

[Q4] Is it better to perform audio analysis with FFT or wavelet [ spectrogram vs scaleogram? I came up on a link that says wavelets are better for both Hz and time resolution.

THanks for the time, I'm really new to DSP.

Oh ya, any links to newsgroups where i can ask these questions would also help.

Most of my google searches come up with how to play sound, and with window functions, sample rates and other parameters to wave file. But doesn't actually say what goes into a sound buffer.
 
Last edited:
Engineering news on Phys.org
To Neuro,

Interesting subject.
I have chosen the WAV format for my work.
Note:
(1) it is lossless
(2) it is simple (and ugh! max size).
(3) works between my Windows and Linux systems.

Check these URLs for a starter.

WIKI
Audio File format Definition
http://en.wikipedia.org/wiki/Audio_file_format
"It is important to distinguish between a file format and a codec."

NCH
Audio File Formats Types:
http://www.nch.com.au/acm/formats.html
Note: raw header-less PCM is a file-format frequently called "WAV".

http://www.digitaltips.org/audio/audio101.asp

http://wiki.audacityteam.org/index.php?title=WAV
Note: Little Endian vs. Big Endian transmission methods.

http://edutechwiki.unige.ch/en/Digital_audio

Hope that helps get you started.

glene77is, Memphis, TN
 
Last edited by a moderator:
Hi all, I have a question. So from the derivation of the Isentropic process relationship PV^gamma = constant, there is a step dW = PdV, which can only be said for quasi-equilibrium (or reversible) processes. As such I believe PV^gamma = constant (and the family of equations) should not be applicable to just adiabatic processes? Ie, it should be applicable only for adiabatic + reversible = isentropic processes? However, I've seen couple of online notes/books, and...
Thread 'How can I find the cleanout for my building drain?'
I am a long distance truck driver, but I recently completed a plumbing program with Stratford Career Institute. In the chapter of my textbook Repairing DWV Systems, the author says that if there is a clog in the building drain, one can clear out the clog by using a snake augur or maybe some other type of tool into the cleanout for the building drain. The author said that the cleanout for the building drain is usually near the stack. I live in a duplex townhouse. Just out of curiosity, I...
I have an engine that uses a dry sump oiling system. The oil collection pan has three AN fittings to use for scavenging. Two of the fittings are approximately on the same level, the third is about 1/2 to 3/4 inch higher than the other two. The system ran for years with no problem using a three stage pump (one pressure and two scavenge stages). The two scavenge stages were connected at times to any two of the three AN fittings on the tank. Recently I tried an upgrade to a four stage pump...
Back
Top