Audio Analysis/Synthesis, DSP/FFT/Spectrogram

neurocomp2003 · Mar 23, 2010

Hi, I'm trying to understand digital audio on my own for brain modelling research (i'm trying to wire up input to neuralnet, which I've done with a vision component using a camera).
[edited...] i forgot to mention that the ultimate goal is to have microphone input and speaker output.

I've made a similar inquiry in the computer->software forum but was focused on software rather than the dsp part, and with no reply. In this post i'd rather focus on the dsp/audio part. I use the Windows OS

Questions :
[Q1] What is the format of raw sound/wave buffer (that can be played by soundcard) and is it platform and hardware(soundcard) dependent? In image buffers you have some order of RGBA or some other colour scheme. Is this the same for the soudn buffer some sequence of [t,Hz,Phase,Amplitude]? Is this format PCM? and links to references would help.

[Q2] Is the raw sound buffer the same input that should go into a FFT

[Q3] What is displayed in the 3rd axis of spectrogram [t vs Hz vs ###]? Is it the PSD of FFT and is the PSD just log(sqrt(||(Re,Im)||)) or is it just the Re component or somthing to do with amplitude??

[Q4] Is it better to perform audio analysis with FFT or wavelet [ spectrogram vs scaleogram? I came up on a link that says wavelets are better for both Hz and time resolution.

THanks for the time, I'm really new to DSP.

Oh ya, any links to newsgroups where i can ask these questions would also help.

Most of my google searches come up with how to play sound, and with window functions, sample rates and other parameters to wave file. But doesn't actually say what goes into a sound buffer.

glene77is · Mar 23, 2010

To Neuro,

Interesting subject.
I have chosen the WAV format for my work.
Note:
(1) it is lossless
(2) it is simple (and ugh! max size).
(3) works between my Windows and Linux systems.

Check these URLs for a starter.

WIKI
Audio File format Definition
http://en.wikipedia.org/wiki/Audio_file_format
"It is important to distinguish between a file format and a codec."

NCH
Audio File Formats Types:
http://www.nch.com.au/acm/formats.html
Note: raw header-less PCM is a file-format frequently called "WAV".

http://www.digitaltips.org/audio/audio101.asp

http://wiki.audacityteam.org/index.php?title=WAV
Note: Little Endian vs. Big Endian transmission methods.

http://edutechwiki.unige.ch/en/Digital_audio

Hope that helps get you started.

glene77is, Memphis, TN

Audio Analysis/Synthesis, DSP/FFT/Spectrogram

SUMMARY

PREREQUISITES

NEXT STEPS

USEFUL FOR

Similar threads

VFD for powering a car lift

Series motors, switched to parallel

Wireless Charging

Interpreting VIA PCB characteristics through TDR and TDT

Do light bulbs store energy?

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect