Computer Vision - Ground plane position based on image point

In summary: H \\ x_{im}(\sin\beta\cos\alpha + \cos\beta\sin\alpha) + y_{im}(\cos\beta\cos\alpha - \sin\beta\sin\alpha) + f \cos\alpha &= H \\ x_{im}\sin(\beta+\alpha) + y_{im}\cos(\beta+\alpha) + f \cos\alpha &= H \end{align}Finally, we can solve for X, Y, and
  • #1
jenny_shoars
21
0
When given an image of a scene of something like a hallway or road (looking down this hallway or road) a vanishing point can be determined. Also, the points on the image must lie on a certain line in the real world environment given by:

[itex] x_{im} = f\frac{X}{Z} [/itex] and [itex] y_{im} = f\frac{Y}{Z} [/itex]

where [itex]f[/itex] is the focal length of the camera, [itex](x_{im},y_{im})[/itex] is the image point and [itex](X, Y, Z)[/itex] is the world frame point. This is illustrated below.
img4.gif

I've found that if the camera optical axis is in the same direction as the hallway/road (i.e. the optical center goes to the vanishing point) and the camera is just a certain height above the ground plane, then it's simple to get the position of a given image point on the ground plane. That being, if [itex]H[/itex] is the height, then:
[itex]Y=-H[/itex], [itex]Z=-H[/itex], [itex]X=-frac{x_{im}H}{y_{im}}[/itex]
This all works fine, but I'm getting a little lost when adding in yaw and pitch to the camera reference frame (no roll). After some work I've found that the ground plane can be represented in the camera reference frame by:

[itex]X sin(\alpha) sin(\beta) + Z sin(\alpha) cos(\beta) + Y cos(\alpha) = H[/itex]

And this is where I'm stuck. I'm not sure how to usefully use the above equation with the first two equations to get [itex]X[/itex], [itex]Y[/itex], and [itex]Z[/itex] from [itex]x_{im}[/itex] and [itex]y_{im}[/itex]. Any suggestions? Thank you much!
 
Physics news on Phys.org
  • #2

Thank you for sharing your findings on determining the position of an image point on a ground plane using a camera and vanishing point. I would like to offer some suggestions on how to approach the equation you have mentioned.

Firstly, it is important to understand the different variables involved in the equation. The focal length of the camera, f, is a fixed value that determines the perspective of the image. The image point, (x_{im}, y_{im}), represents the coordinates of the point on the image where the vanishing point is located. The world frame point, (X, Y, Z), represents the real-world coordinates of the point on the ground plane.

Next, to incorporate yaw and pitch into the equation, we need to consider the rotation of the camera. Yaw refers to the rotation of the camera around the vertical axis, while pitch refers to the rotation around the horizontal axis. These rotations can be represented by the angles \alpha and \beta, respectively.

To incorporate these rotations into the equation, we can use the rotation matrix to transform the coordinates from the camera reference frame to the world frame. This transformation can be represented by the following equation:

\begin{pmatrix} X \\ Y \\ Z \end{pmatrix} = \begin{pmatrix} \cos\alpha & 0 & \sin\alpha \\ 0 & 1 & 0 \\ -\sin\alpha & 0 & \cos\alpha \end{pmatrix} \begin{pmatrix} \cos\beta & -\sin\beta & 0 \\ \sin\beta & \cos\beta & 0 \\ 0 & 0 & 1 \end{pmatrix} \begin{pmatrix} x_{im} \\ y_{im} \\ f \end{pmatrix}

Now, we can substitute the values of X, Y, and Z from the transformed coordinates into the equation for the ground plane:

\begin{align} X \sin\alpha \sin\beta + Z \sin\alpha \cos\beta + Y \cos\alpha &= H \\ (\cos\alpha x_{im} + \sin\alpha y_{im}) \sin\beta + (-\sin\alpha x_{im} + \cos\alpha y_{im}) \cos\beta + f \cos\alpha &= H \\ x_{im} \sin\beta \
 

1. What is computer vision?

Computer vision is a field of study that focuses on developing algorithms and techniques for machines to interpret and understand visual information from digital images or videos. It involves using various methods to extract meaningful data and insights from images to assist in tasks such as object recognition, motion analysis, and scene reconstruction.

2. How does computer vision determine ground plane position based on image point?

Computer vision uses a combination of techniques such as feature detection, camera calibration, and geometric transformations to determine the ground plane position based on an image point. This involves identifying and extracting features from the image, estimating the camera's intrinsic and extrinsic parameters, and using geometric transformations to map the image point to its corresponding position on the ground plane.

3. What are some applications of ground plane position estimation in computer vision?

Ground plane position estimation has various applications in computer vision, including augmented reality, autonomous navigation, and object tracking. By accurately determining the position of objects in the real world, computer vision systems can overlay virtual objects onto a live video feed, guide autonomous vehicles, and track the movements of people or objects within a scene.

4. What are some challenges in ground plane position estimation using computer vision?

One of the main challenges in ground plane position estimation is dealing with perspective distortion, which occurs when objects appear smaller or larger depending on their distance from the camera. This can affect the accuracy of measurements and require advanced techniques to correct for the distortion. Additionally, changes in lighting, occlusions, and noise in the image can also affect the accuracy of ground plane position estimation.

5. How is ground plane position estimation used in 3D reconstruction?

In 3D reconstruction, ground plane position estimation is used to accurately map 2D images onto a 3D coordinate system. By determining the ground plane position of features or objects in multiple images, computer vision systems can reconstruct a 3D model of the scene. This is useful in applications such as virtual reality, urban planning, and forensics.

Similar threads

Replies
2
Views
759
Replies
4
Views
743
Replies
3
Views
402
  • MATLAB, Maple, Mathematica, LaTeX
Replies
1
Views
594
  • Calculus and Beyond Homework Help
Replies
18
Views
1K
Replies
18
Views
2K
  • Advanced Physics Homework Help
Replies
6
Views
799
  • Engineering and Comp Sci Homework Help
Replies
1
Views
2K
  • Electromagnetism
Replies
4
Views
804
  • Classical Physics
Replies
3
Views
1K
Back
Top