Understanding the Basics of FFT for Image Processing

btb4198 · Jun 14, 2022

if you watch this Video :
The Julia Programming Language
at time marker 28:00, You will see that Grant takes an FFT of an Image.
In order to do a FFT, you need to know the sampling rate, but what is the sampling rate of one image?
And what if your image is not size 2^N ? Does the program just pad with zero? but that would add noise right ?
He conveniently did not explain any of this, I am very disappointed in MIT Right now.

Baluncore · Jun 14, 2022

btb4198 said:

In order to do a FFT, you need to know the sampling rate, but what is the sampling rate of one image?

When you take the FT of an image, you treat the pixels as individual samples.
You are computing spatial frequencies, not temporal frequencies.

btb4198 said:

And what if your image is not size 2^N ? Does the program just pad with zero? but that would add noise right ?

You could pad with zeros or wrap the image around, before multiplying by a window function. The window function removes the high frequency noise due to the edge of the image.
There are also FFT algorithms that factorise the pixel dimensions, then do different radix FFTs for those factors. But they usually take longer to generate the result than an image cut down to 2ⁿ, or padded out to 2ⁿ.

btb4198 · Jun 14, 2022

Baluncore said:

When you take the FT of an image, you treat the pixels as individual samples.
You are computing spatial frequencies, not temporal frequencies.You could pad with zeros or wrap the image around, before multiplying by a window function. The window function removes the high frequency noise due to the edge of the image.
There are also FFT algorithms that factorise the pixel dimensions, then do different radix FFTs for those factors. But they usually take longer to generate the result than an image cut down to 2ⁿ, or padded out to 2ⁿ.

Baluncore thanks.
question, so the sampling frequency Fs would be 1hz then ?

berkeman · Jun 14, 2022

btb4198 said:

so the sampling frequency Fs would be 1hz then ?

No, one pixel. What is the image being projected on? You can use units of pixel size in mm or um or whatever the relevant spatial sampling rate is.

Mark44 · Jun 14, 2022

btb4198 said:

You will see that Grant takes an FFT of an Image.

No, Grant is doing a Fourier Transform, not a Fast Fourier Transform (FFT).

Mark44 · Jun 14, 2022

btb4198 said:

And what if your image is not size 2^N ?

I don't believe that any of the images were of size 2^N pixels. The image of the cat was 500 X 399 pixels.

btb4198 said:

He conveniently did not explain any of this, I am very disappointed in MIT Right now.

This was a 30+ minute lecture. You can't expect him to put in all the details in such a short timeframe.

btb4198 · Jun 14, 2022

btb4198 said:

Baluncore thanks.
question, so the sampling frequency Fs would be 1hz then ?

Balucore another question,

A FFT is done on a 1D-array, like a graph or a Spectrum.
So would convert the 2-D image into a 1D-array by doing :
(this is pseudocode)
image[0 ,0] = image [0] -> image[width, height] = image[size] where size is = to width * height

and then run FFT get corresponding frequencies and then place them back into the image in there correct location with respect to original indices ?

I guess that is what Julia is doing...

Baluncore said:

You could pad with zeros or wrap the image around, before multiplying by a window function. The window function removes the high frequency noise due to the edge of the image.

When you say this, do you mean, if Size is not equate to 2^N you add image[0,0] to image [size +1] untel image.lenght == 2^N?
or did I miss understand that ?

btb4198 · Jun 14, 2022

berkeman said:

No, one pixel. What is the image being projected on? You can use units of pixel size in mm or um or whatever the relevant spatial sampling rate is.

I believe Grant is just using pixels. Fs has to be in Hz right ? and 1/1 = 1Hz. I am not saying this right ?
Fs
fs = 1/T Hz

berkeman · Jun 14, 2022

btb4198 said:

Fs has to be in Hz right ?

No! Fourier transforms apply to either time domain or spatial domain samples.

btb4198 · Jun 14, 2022

berkeman said:

No! Fourier transforms apply to either time domain or spatial domain samples.

Sorry, I did not know

btb4198 · Jun 14, 2022

Mark44 said:

I don't believe that any of the images were of size 2^N pixels. The image of the cat was 500 X 399 pixels.

This was a 30+ minute lecture. You can't expect him to put in all the details in such a short timeframe.

Oh, I have been trying to watch all the videos in this playlist and so far they never explain this.

Baluncore · Jun 14, 2022

btb4198 said:

A FFT is done on a 1D-array, like a graph or a Spectrum.
So would convert the 2-D image into a 1D-array by doing :

The 2D transform is done by, say;
Replace each row of pixels with its FFT spatial frequency coefficients.
Then replace each column of coefficients with its FFT.
You then have 2D spatial frequencies.
Multiply the 2D spatial freq image by a coefficient mask to convolve or filter the picture.
Inverse transform the columns, then the rows.
Look at the modified image.

Understanding the Basics of FFT for Image Processing

1. What is FFT and how does it work?

2. How is FFT applied to images?

3. What are the benefits of taking the FFT of an image?

4. Can you explain the steps involved in taking the FFT of an image?

5. Are there any limitations to taking the FFT of an image?

Similar threads

Hot Threads

Recent Insights