How can we represent an image using basis images?

  • Context: Undergrad 
  • Thread starter Thread starter ramdas
  • Start date Start date
  • Tags Tags
    Basis Image Images
Click For Summary

Discussion Overview

The discussion revolves around the representation of images using basis images, specifically through the lens of Fourier transformation. Participants explore concepts such as orthogonality and the nature of basis images, while seeking clarification and examples to better understand these terms in the context of image processing.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification

Main Points Raised

  • Some participants mention that Fourier transformation allows for the decomposition of images into orthogonal basis images, which can then be reconstructed.
  • One participant describes orthogonal basis images as 2-D periodic functions that can be represented by a Fourier series, with terms involving sine functions and frequencies.
  • Another participant explains that a set of functions can reproduce any image, noting that while an infinite set is ideal, a truncated set is often used for practical approximations.
  • It is suggested that orthogonality means that the basis functions describe unique aspects of the image without overlap in information.
  • A comparison is made between audio signals and image representation, highlighting the transformation of brightness in images to sinusoidal variations using Fourier transformation.
  • Concerns are raised about the assumptions made in Fourier transformation, particularly regarding the periodicity of brightness patterns in images and the potential artifacts in dynamic images.
  • Digital signal processing is discussed, with mention of alternative functions that can be used to minimize perceptible distortions in image representation.

Areas of Agreement / Disagreement

Participants express varying levels of understanding and interpretation of the concepts involved, with no consensus reached on the definitions or implications of orthogonality and basis images. Multiple competing views and explanations remain present throughout the discussion.

Contextual Notes

Participants highlight limitations related to assumptions about periodicity in images and the challenges of using discrete Fourier transforms for dynamic images, but these aspects remain unresolved.

ramdas
Messages
78
Reaction score
0
I have read that using Fourier transformation we can decompose any arbitrary image into othogonal basis images and reconstruct it back.

But i don't understand terms like "othogonal " and "basis image".

So can anybody shower their ideas on the above terms with example ??
 
Physics news on Phys.org
ramdas said:
I have read that using Fourier transformation we can decompose any arbitrary image into othogonal basis images and reconstruct it back.

But i don't understand terms like "othogonal " and "basis image".

So can anybody shower their ideas on the above terms with example ??

If I understand you correctly, the 'orthogonal basis images' are 2-D periodic functions: colloquially, 2-D functions can be decomposed into a discrete or continuous Fourier series, each term being A_mn*sin(nx)*sin(my), where n and m are frequencies and A_mn is the amplitude of that function. Most tutorials present 1-D versions for clarity.
 
See also this video at 3:30

https://www.youtube.com/watch?v=mEN7DTdHbAU
 
In a very simplified sense, we define a set of functions that together can be used to reproduce any image. Realistically, this requires an infinite set so we truncate the number of functions to get a good approximation. These functions are the bases and are like interpolating functions. Being orthogonal means that they more or less describe unique aspects of the image from each other. That is, there is no "overlap" in the information in one basis with all the others.

So we have a set of interpolating functions that efficiently describe any image we may have to a good approximation. Then we only need to know the amplitudes of these functions to reconstruct an image.

More to it than this but that's a very basic explanation. You can also note that the Cartesian vectors are a vector basis of the Cartesian space. We have three bases, x, y, and z vectors. They are orthogonal as the dot products between them are all zero. If we want to describe the location of any point, then we simply state the vector coefficients for the three bases.
 
It is easier (just more familiar, I think) to think back to the way that temporal variations of an audio signal (what you see on an oscilloscope - which shows things in two dimensions - volts and time) being transformable into a set of sinusoidal tones (frequency domain). That's in one dimension. An image can have its brightness (in two dimensions) represented by a three dimensional surface.

To get a representation of this image in terms of sinusoidal variations of brightness and distance (Fourier transformation gives spatial frequency domain) you have to make an assumption and that is that the pattern of brightness over the image repeats itself for ever in every direction (like wallpaper). This is the same assumption as is used when the FFT is used for Audio (etc.) work. This is a Discrete Fourier Transform (DFT) which contains discrete frequency components.

With a static TV picture, a DFT can be used over the whole picture or over square sub-sections of the picture (blocks). For a moving picture, subsequent frames are not the same and doing a simple DFT will produce artefacts - jerkiness and smearing.

Digital signal processing, using sinusoidal basis function is inconveniently long winded and it is normal to use different functions, which end up requiring lower transmission bit rates. The 'Raised Cosine' function can be used and there are some very quick algorithms for doing this. This is the basis for JPEG processing. (Other functions are available, as they say) The secret is to choose a set of functions that produce the least perceptible distortions and to reduce the errors (visible boundaries) moving from one block to the next. when you are using as few components as possible.
 

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 4 ·
Replies
4
Views
11K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 9 ·
Replies
9
Views
4K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 6 ·
Replies
6
Views
7K
  • · Replies 2 ·
Replies
2
Views
6K
Replies
7
Views
2K
  • · Replies 226 ·
8
Replies
226
Views
17K