Dismiss Notice
Join Physics Forums Today!
The friendliest, high quality science and math community on the planet! Everyone who loves science is here!

BrainModel: Audio Analysis/Synthesis Software, FFT/Spectograms, Mic/Spkr

  1. Mar 22, 2010 #1
    Not sure if this belongs in this forum, its subforums or engineering.
    Thanks for reading, the post is long. Any suggestions would be grateful.
    Application Field: Brain Modelling
    Topic: Audio Analysis then Audio Synthesis
    OS: Microsoft Windows,
    IDE: MSVC (unless the package is for cygwin that is workable in MSVC)
    Language: C/C++ [HPC/cluster reasons]
    [Q1]: What packages to use to make a spectrogram? And am i on the right track?
    [Q2]: is spectrogram or scaleogram better for the task below?
    [Q3]: is onboard soundcard capable of the task below? [card: sigmatel HD audio codec]
    [Q4]: what format is the input sound signal to FFT algorithm?
    [Q5]: if the FFT algorithm outputs (Re,Im) pairs, does a spectrogram use the Im values.
    Desire: [Phase 1] I need to embed a spectrogram[t v Hz v Amp?] from realtime sound(microphone) or loaded file(wav or mp3) into a C/C++ GUI application(currently wx) to perform audio analysis whereby the spectrogram or some arrangement of the data will be later fed into a neural network to mimic hair cells (which includes vision). [Phase 2] Once phase 1 is setup, I then need to perform audio synthesis via phonemes and basic music oral sounds to mimic the vocal tract or some higher level area.

    OR perhaps i should reverse the tasks.
    What I know:
    -my sound card is sigma tel high definition audio codec
    -the spectrogram process: get sound buffer--> feed to FFT/wavelet --> feed to GUI
    -how to load/play/save wav/mp3 sounds via packages (openal, directx/midi, RT, portaudio,SDL)

    What I've Tried:
    - [still working through] marsyas library , sndobj library but both give fuzzy output signal via my soundcard, and neither seems to access the hardware simultaneously for both input(mic) and output(spk)
    - i've looked at CLAM++ but sadly i require a MSVC compatible library because i rely on MSVC IDE
    What i'd prefer not to use:
    -speech specific software like SAPI or the one from CMU, though i probably need too.
    -synthesis specific software like STK.
    What i'm looking for in the packages
    -open source?
    - I'd rather use a package capable of both analysis and synthesis which includes some FFT and midi capability. Though presumably one can get the FFT elsewhere but it seems its included in linux APIs. Hopefully the API is easy to use with pseudo code looking like

    [] GetMic(...)/LoadSoundFile(...)
    [] GetSoundBuffer(...)
    [] CallingFFT(...)

    - also being able to insert frequencies with duration and timbre would be useful if the package doesn't have midi.

    -pointing me to a newsgroup/mailgroup would also help
    thank you again for reading,
    Last edited: Mar 22, 2010
  2. jcsd
Share this great discussion with others via Reddit, Google+, Twitter, or Facebook

Can you offer guidance or do you also need help?
Draft saved Draft deleted