i
i
i
i
i
i
i
i
8.3. A Brief Look at Some Advanced Ideas in Computer Vision 199
noise in the image and errors in matching the correspondences. Resolving the diffi-
culty is a major problem in computer vision, one that is the subject of current and
ongoing research.
Intuitively, it would seem evident that if the effect of noise and errors are
to be minimized then the more points taken into account, the more accurately
the coefficients of the camera matrix could be determined. This is equally
true for the coefficients of a homography between planes in P
2
and for the
techniques that use two and three views to recover 3D structure. A number
of very useful algorithms have been developed that can make the best use of
the point correspondences in determining P etc. These are generally referred
to as optimization algorithms, but many experts who specialize in computer
vision prefer the term estimation algorithms.
8.3.4 Estimation
So, the practicalities of determining the parameters of the camera matrix are
confounded by the presence of noise and error. This also holds true for per-
spective correction and all the other procedures that are used in computer
vision. It is a major topic in itself to devise, characterize and study algorithms
that work robustly. By robustly, we mean algorithms that are not too sensi-
tive to changes in noise patterns or the number of points used and can make
an intelligent guess at what points are obviously in gross error and should be
ignored (outliers).
We will only consider here the briefest of outlines of a couple of algo-
rithms that might be employed in these circumstances. The direct linear
transform (DLT) [7, pp. 88–93] algorithm is the simplest; it makes no at-
tempt to account for outliers. The random sample consensus (RANSAC) [5]
and the least median squares (LMS) algorithms use statistical and iterative
procedures to identify and then ignore outliers.
Since all estimation algorithms are optimization algorithms, they attempt
to minimize a cost function. These tend to be geometric, statistical or al-
gebraic. To illustrate the process, we consider a simple form of the DLT
algorithm to solve the problem of removing perspective distortion. (Using
four corresponding points (in image and projection), we obtained eight linear
inhomogeneous equations (see Equation (8.4)), from which then the coeffi-
cients of H can be estimated.)
If we can identify more than four matching point correspondences, the
question is then how to use them. This is where the DLT algorithm comes
in. To see how to apply it, consider the form of the matrix in Equation (8.4)