Creating 3d coordinates from stereoscopic images

Click For Summary

Discussion Overview

The discussion revolves around the process of converting 2D coordinates from stereoscopic images into 3D coordinates (x, y, z). Participants explore the relationship between depth, disparity, and the geometry of the camera setup, focusing on the mathematical formulation needed for this conversion.

Discussion Character

  • Technical explanation
  • Mathematical reasoning
  • Exploratory

Main Points Raised

  • One participant describes a project involving a stereoscopic camera setup to capture depth and 2D coordinates, seeking a formula for converting these into 3D coordinates.
  • Another participant questions whether depth corresponds to the z-coordinate and requests clarification on the imaging context, suggesting the inclusion of illustrations.
  • A participant summarizes the problem, outlining the configuration of two cameras and the need to find the 3D position of a point object based on its 2D coordinates in both images.
  • Links to external resources are provided, which may assist in understanding the triangulation process for determining coordinates from camera positions.
  • One participant mentions the distance from the pinhole to the image plane and expresses a desire for a solution that accommodates varying distances.
  • Another participant elaborates on the relationships between the x-coordinate, z-coordinate, and the focal length, suggesting a formula for the x-coordinate based on the disparity and encouraging further exploration of the y-coordinate.

Areas of Agreement / Disagreement

Participants express varying levels of understanding and interpretation of the problem, with no consensus reached on the exact formulas or methods to be used for the conversion to 3D coordinates. Multiple competing views and approaches remain present in the discussion.

Contextual Notes

Participants mention the need for calibration of the focal length and the camera separation, indicating that the discussion may depend on specific definitions and assumptions regarding the camera setup and measurements.

steven77723
Messages
4
Reaction score
0
Hi, I'm working on a project that would take a 3d image using stereocopic camera and would record the depth and the 2d (x1,y1) , (x2,y2) coordinates of a single point in the image. The depth is found using the focal point, disparity, and the distance between the difference the camera sees on the point.

I was wondering if there is a formula that would convert this point into a (x,y,z) coordinate. I'm stuck because even if the physical location of the point may be different, the camera would reply with a different depth, but same x,y values.

Thanks,
 
Engineering news on Phys.org
isn't the depth the z coordinate? Could you be more specific about what you are imaging, and maybe include some kind of illustration?
 
Is this a correct description of the problem ?

You have two identical cameras. The pinholes of those cameras are positioned on the x-axis, the first at the origin, looking along the z-axis, with the second camera, at x = s, with camera 2 optical axis parallel to that of camera 1.

The images from the two cameras each have a cardinal reference point, defined by the intersection of the image and the optical axis. A point object, visible on both images is located relative to the reference point; on image #1 at (x1, y1), and on image #2 at (x2, y2).

Find the position of the point object(x, y, z), given s, and the (x1, y1) and (x2, y2) offsets.

Do you have the distance of the pinhole from the image plane?
 
Baluncore, Thats exactly how I should've worded it.
The distance from the pinhole to image plane is roughly about 50cm, but I want it to work for any value.
 
steven77723 said:
The distance from the pinhole to image plane is roughly about 50cm, but I want it to work for any value.

The x-coordinate is a function of s.

The z-coordinate is a function of the image plane to pinhole distance.
With a lens, that is the focal length only when focussed at infinity.
Let's be slack and just call it f.

So you must be able to somehow measure or calibrate f, against your camera separation, s.

Draw a view of the object and image placement in the x-z plane and you will see it is ratiometric.
x = s * x1 / (x1+x2), I will let you sort out the sign of the x values.
z = f * ?, and the y = ? values.

See the post#4 references from blainiac.
 
  • Like
Likes   Reactions: 1 person

Similar threads

Replies
7
Views
3K
Replies
17
Views
3K
  • · Replies 4 ·
Replies
4
Views
3K
Replies
7
Views
2K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
1
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K