How does our brain create depth perception from two slightly different images?

  • Thread starter Thread starter Graeme M
  • Start date Start date
  • Tags Tags
    Image
AI Thread Summary
The discussion explores the mechanics of depth perception and binocular rivalry, emphasizing how the brain processes images from two eyes to create a unified perception of depth. When both eyes receive mismatched images, binocular rivalry occurs, leading to alternating perceptions. However, when intentionally crossing the eyes, distinct images can be seen simultaneously, suggesting that the brain can still match and process these images without them fading into one another. The conversation highlights the complexity of visual processing in the brain, particularly involving various cortical areas and ocular dominance columns, which contribute to how depth and 3D images are perceived. The phenomenon of creating a new 3D image from separated representations, such as overlapping letters on keys, illustrates the brain's ability to manipulate visual information creatively. Overall, the mechanics behind these processes remain partially understood, with ongoing questions about how different visual areas contribute to conscious awareness and perception.
Graeme M
Messages
325
Reaction score
31
When we see things, we typically see one image which appears to have depth (if we have two functional eyes of course). I understand this is because the images from each eye are matched, and the slightly different perspective caused by the horizontal displacement of the eyes is used to generate the perception of depth. It seems that when images from both eyes do not match, some kind of contention process causes the separate images to alternate in conscious perception (binocular rivalry).

That seems clear enough.

If however I deliberately cross my eyes, I now see two distinct representations of the scene, for example, a cube on my desk can be made to appear as two distinct cubes. These remain within perception without alternating which shows that the binocular process must still be able to match both images (although I can with a little effort cause one or the other, or both, to fade from perception, but that does require effort - typically, both are distinctly visible).

But what on Earth is going on that I see two images? There is after all no "screen" in my head against which the images are "projected". I can understand that a single conscious scene is generated perceptually via some sort of fusing of the two images. But what are the mechanics behind disassociating the fused image?
 
Biology news on Phys.org
When I look at a steropair without a viewer, I first see the two separate images, then I see a third in between which is 3D (or in this case, the perceptual illusion of 3D).
I can then focus in the 3D image and ignore the others.

I can't say what is exactly going on, but it should be noted that there are a several of different cortical visual areas in the brain (several in the cortex, but also in the optic tectum (an important midbrain visual relay area, called the lateral geniculate in humans)). The wikipedia article cites a study of V2 (the second cortial visual area) showing responses to features of binocular disparity.
Which of these can be brought to conscious awareness is not clear to me. Any of them may be the "screen" the consciousness views at different times.
Within the overall visual map in the main visual field are neighboring "ocular dominance columns" which receive input from one eye or the other from the same area of the visual field. This would be input from the two eyes looking at the same external object.
 
Yes, you can do the same with a computer keyboard. By crossing your eyes, you can cause the two images to converge in a 3rd space and generate a new "3D" image of the now separated images (if that makes sense). I think this works because of the similarity of the object representation - the repeating block pattern of the keys means you can combine the disassociated images of the key blocks because they are broadly the same ie you've disassociated the previously fused single representation of the keys, and created a new fused image from those now separated images, or something like that - I have no idea of the mechanics of that! What's cool about that is that the new 3D image of a single key has on its surface different letters, for example you can combine the X and C keys into a new 3D object with the letters X and C overlaid, and bingo, you can somehow "focus" on either the X or C and the other fades out...
 
Looks like I messed up one of my links.
This:
BillTre said:
several of different cortical visual areas
should be this:
several of different cortical visual areas
This has the discussion of the binocular disparity sensitive area.
 
https://www.nhs.uk/mental-health/conditions/body-dysmorphia/ Most people have some mild apprehension about their body, such as one thinks their nose is too big, hair too straight or curvy. At the extreme, cases such as this, are difficult to completely understand. https://www.msn.com/en-ca/health/other/why-would-someone-want-to-amputate-healthy-limbs/ar-AA1MrQK7?ocid=msedgntp&cvid=68ce4014b1fe4953b0b4bd22ef471ab9&ei=78 they feel like they're an amputee in the body of a regular person "For...
Thread 'Did they discover another descendant of homo erectus?'
The study provides critical new insights into the African Humid Period, a time between 14,500 and 5,000 years ago when the Sahara desert was a green savanna, rich in water bodies that facilitated human habitation and the spread of pastoralism. Later aridification turned this region into the world's largest desert. Due to the extreme aridity of the region today, DNA preservation is poor, making this pioneering ancient DNA study all the more significant. Genomic analyses reveal that the...
Back
Top