Issue with perspective projection?

Huyanyinglei · Oct 20, 2019

Summary: Perspective projection is often referred to when talking about camera models

I have the following problem. Perspective projection is often referred to when talking about camera models(https://en.wikipedia.org/wiki/3D_projection#Perspective_projection). I don’t think I understand it very well though this concept is taught when I was at junior high or even primary school. I think programmers with a computer vision background may be familiar with it.

I read quite some “tutorials” on perspective projection, also computer vision textbooks like “Computer Vision: Algorithms and Applications”, “Multiple View Geometry in Computer Vision:Second Edition”. But the representation conventions seems to be quite a lot. It’s a little bit unfriendly to beginners. Just to get a feeling, here’s one of them:

May I ask if there’s some good, easy-to-read and self-contained articles that can help beginners like me understand perspective projection? It may explains quite clearly the physical meaning of the parameters of it, especially the “scale factor”.

Any ideas? Thanx in advance.

.Scott · Nov 1, 2019

I ran into the perspective projection early in my career as a Software Engineer and found it to be no problem.
1) translate your space so that the focal point of the camera is at (0,0,0). This is simply subtracting the coordinates of your focal point from all the points that will be projected.
2) rotate you space so that the camera is looking down the Z axis in the -Z direction, and the X and Y axis are as you want them in the projection.
3) Eliminate anything with a non-negative Z (they are in back of the camera.
4) Transform: X=-x/z, Y=-y/z

That's it.

Of course, you may want to project more than just points - such as conic sections. But if you're struggling with that wiki article, start by practicing with with points.

Stephen Tashi · Nov 2, 2019

.Scott said:

That's it.

From the OP's links, the problem the OP asks about an inverse problem to finding screen coordinates. It has to do with finding the 3-D coordinates of the camera from information about the object being viewed and the object's screen coordinates.

.Scott · Nov 3, 2019

Stephen Tashi said:

From the OP's links, the problem the OP asks about an inverse problem to finding screen coordinates. It has to do with finding the 3-D coordinates of the camera from information about the object being viewed and the object's screen coordinates.

I have also run into that kind of problem, but the solution depends on the specifics. If the points on an aerial photograph are associated with Lat/Long/altitude values, then the photographic transform can be worked out using simple linear arithmetic. If there are "too many" points, then a least squares best fit can be determined. In practice, I have never gone from the transformation parameters to the actual camera position and orientation, but I don't see any problem in doing that.
The specific of the answer obviously depends on the specifics of the problem. If the OP wants more suggestions, I will keep an eye on this thread for a few days.

Stephen Tashi · Nov 3, 2019

.Scott said:

If the OP wants more suggestions, I will keep an eye on this thread for a few days.

In case you know the "lambda twist algorithm" the OP asks about it another thread. https://www.physicsforums.com/threads/issue-with-one-p-n-p-method.979183/

http://openaccess.thecvf.com/conten...l_Persson_Lambda_Twist_An_ECCV_2018_paper.pdf

.Scott · Nov 3, 2019

As I said, it's very application-specific. I never attempted to solve it "cold" given only three points. I required that the user specify four points - and preferably 5 or 6. That way I could validate the input. It was too easy for an analyst to misidentify a landmark on either the map or the film.
When I say "cold", sometimes there was other information I had that I could use to determine the mapping. Cold was when that other information was not available.

Issue with perspective projection?

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Who May Find This Useful

Similar threads

Undergrad The vector to which a dual vector corresponds

Graduate Confusion about the Moyal-Weyl twist

Undergrad 2 interpretations of bra-ket expression: equal, & isomorphic, but...

Undergrad Spinor calculus

Undergrad Matrix representation of rank-2 spinors

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect