Audio Reconstruction from Image: Curve Recognition Software?

  • Thread starter Thread starter brian0918
  • Start date Start date
  • Tags Tags
    Curve
AI Thread Summary
Software to convert an image of sound waves back into audio is not widely available, but the process is feasible using current technology. The first step involves extracting data from the image, which can be done by analyzing the pixel columns to identify the white pixels that represent the wave's curve. This data can then be translated into audio by understanding how sound is recorded, typically through sampling at high rates, such as 44.1 kHz for .wav files. Each sample corresponds to a value captured by a microphone at specific intervals. For practical implementation, tools like Mathematica can be utilized to generate sound from graphs, potentially using Fourier series to create a loopable audio clip from the extracted wave data. Overall, while no dedicated software exists for this specific task, the combination of image processing and audio generation techniques makes it an interesting project.
brian0918
Messages
44
Reaction score
0
I'm looking for software that can turn an http://people.deas.harvard.edu/~jones/cscie129/nu_lectures/lecture2/snd_vis/waves.jpg back into the audio. Has anyone heard of something like this?

If not, might there be a way I could do it using current software, such as some sort of curve fitting software combined with some sort of audio generation software?

Thanks.
 
Last edited by a moderator:
Computer science news on Phys.org
I recall seeing a article somewhere about someone retrieving the audio from a picture of a groove in a vinyl record.

I don't see this as physics, more of a tecnnology thing.
 
It's part of physics research, though, and most of physics research involves technology.
 
Sounds doable. There's two obvious encessary steps: reading the data from the curves, outputting the audio. The first is not so hard, depending on how the input is described, if it is a black and white image as the one you linked then i would read each column of pixels, in search of the white pixel, the location of the white pixel giving the value (y-coordinate) of the curve at that column (x-coordinate). That's one approach.
For converting the data into audio you'd have to do some research as to how audio is recorded. basically you want to map a sound wave into a sequence of values. Generally speaking audio is recorded by sampling every so often. You have some device, like a microphone, which is sensitive to sound waves. This device encodes its interaction with any sound wave at time t into a value. So if a microphone is exposed to a sound lasting for 1 second, then, if you sample every 1/10th of a second, the microphone will generate 10 interrupts to the computer, in each interrupt providing the value it read at time t. This is they key point, you need to know how the microphone encodes a sample into data, once you know that you can use the image of the wave to produce your samples.
Notice that the sampling rate is usually pretty high. For .wav files the sample rate is 44100Mhz, so 44100 frames per second. Each frame usually having two samples (stereo), or one sample (mono).
This sounds like an interesting project. It would be cool to make a program that converts a curve into the respective sound.
 
well, I know Mathematica can turn a graph into a sound. If you could somehow make a Fourier series of those waves (since they seem reasonably periodic), then you could graph it in mathematica and output a loopable audio clip.
 
In my discussions elsewhere, I've noticed a lot of disagreement regarding AI. A question that comes up is, "Is AI hype?" Unfortunately, when this question is asked, the one asking, as far as I can tell, may mean one of three things which can lead to lots of confusion. I'll list them out now for clarity. 1. Can AI do everything a human can do and how close are we to that? 2. Are corporations and governments using the promise of AI to gain more power for themselves? 3. Are AI and transhumans...
Thread 'ChatGPT Examples, Good and Bad'
I've been experimenting with ChatGPT. Some results are good, some very very bad. I think examples can help expose the properties of this AI. Maybe you can post some of your favorite examples and tell us what they reveal about the properties of this AI. (I had problems with copy/paste of text and formatting, so I'm posting my examples as screen shots. That is a promising start. :smile: But then I provided values V=1, R1=1, R2=2, R3=3 and asked for the value of I. At first, it said...
Back
Top