3.2 Structure from Motion

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Given two corresponding images taken with different lighting conditions, a unique solution can be obtained for the surface orientation at each point.

Example 3.4 Illustration of photometric stereo.

Two images of a sphere, taken with different lighting conditions (the same source has been put in two different position), are shown in Figure 3.12(a) and Figure 3.12(b), respectively. The surface orientation determined by photometric stereo is shown in Figure 3.12(c) by using an orientation-vector representation (the vector is represented by a line segment). It can be seen that the surface orientation is perpendicular to the paper in places closest to the center of the sphere, while the surface orientation is parallel to the paper in places closest to the edges of the sphere.

The reflectance properties of a surface at different places can be different. A simple example is that the irradiance is the product of a reflectance factor (between 0 and 1) and some functions of orientation. Suppose that a surface, which is like a Lambertian surface except that it does not reflect the entire incident light, is considered. Its brightness is represented by ρ cos θi, where ρ is the reflectance factor. To recover the reflectance factor and the gradient (p, q), three images are required.

Introducing the unit vectors in the directions of three source positions, it has

$S_{j} = \frac{{[- p_{j} - q_{j} 1]}^{T}}{\sqrt{1 + p_{j}^{2} + q_{j}^{2}}} j = 1, 2, 3 (3.33)$ $S_{j} = \frac{{[- p_{j} - q_{j} 1]}^{T}}{\sqrt{1 + p_{j}^{2} + q_{j}^{2}}} j = 1, 2, 3 (3.33)$

Figure 3.12: Computing the surface orientation by photometric stereo.

Then the irradiance is

$E_{j} = ρ (S_{j} • N) j = 1, 2, 3 (3.34)$ $E_{j} = ρ (S_{j} • N) j = 1, 2, 3 (3.34)$

where

$N = \frac{{[- p - q 1]}^{T}}{\sqrt{1 + p^{2} + q^{2}}} (3.35)$ $N = \frac{{[- p - q 1]}^{T}}{\sqrt{1 + p^{2} + q^{2}}} (3.35)$

is the unit surface normal. For unit vectors N and ρ, three equations can be obtained

$E_{1} = ρ (S_{1} • N) E_{2} = ρ (S_{2} • N) E_{3} = ρ (S_{3} • N) (3.36)$ $E_{1} = ρ (S_{1} • N) E_{2} = ρ (S_{2} • N) E_{3} = ρ (S_{3} • N) (3.36)$

Combining these equations gives

$E = ρ S • N (3.37)$ $E = ρ S • N (3.37)$

where the rows of the matrix S are the source directions S1, S2, S3, and the components of the vector E are the three brightness measurements. Suppose that S is non-singular. It can be shown that

$ρ N = S^{- 1} • E = \frac{1}{[S_{1} • (S_{2} \times S_{3})]} [E_{1} (S_{2} \times S_{3}) + E_{2} (S_{3} \times S_{2}) + E_{3} (S_{1} \times S_{2})] (3.38)$ $ρ N = S^{- 1} • E = \frac{1}{[S_{1} • (S_{2} \times S_{3})]} [E_{1} (S_{2} \times S_{3}) + E_{2} (S_{3} \times S_{2}) + E_{3} (S_{1} \times S_{2})] (3.38)$

The direction of the surface normal is the result of multiplying a constant with a linear combination of the three vectors, each of which is perpendicular to the directions of the two light sources. If multiplying the brightness obtained when the third source is used with each of these vectors, the reflectance factor ρ will be recovered by finding the magnitude of the resulting vector.

Example 3.5 Recover the reflectance factor by three images.

Suppose that a source has been put in three places (–3.4, –0.8, –1.0), (0.0, 0.0, –1.0), (–4.7, –3.9, –1.0), and then three images are captured. Three equations can be established and the surface orientation and reflectance factor can be determined. Figure 3.13(a) shows three reflectance property curves. It is seen from Figure 3.13(b) that when ρ = 0.8, three curves are joined at p = –1.0, q = –1.0.

Figure 3.13: Recover the reflectance factor by three images.

3.2Structure from Motion

The structure of a scene consisting of a number of objects or an object consisting of several parts can be recovered from the motion of objects in a scene or the different motion of object parts.

3.2.1Optical Flow and Motion Field

Motion can be represented by a motion field, which is composed of motion vectors of the image points.

Example 3.6 Computation of a motion field.

Suppose that in a particular moment, one point Pi in an image is mapped from a point Po at the object surface, as shown in Figure 3.14. These two points are connected by the projection equation. Let Po have a velocity Vo relative to the camera. It induces a motion Vi in the corresponding Pi. The two velocities are

$V_{o} = \frac{d r_{o}}{d t} V_{1} = \frac{d r_{1}}{d t} (3.39)$ $V_{o} = \frac{d r_{o}}{d t} V_{1} = \frac{d r_{1}}{d t} (3.39)$

where ro and ri are related by

$\frac{1}{λ} r_{1} = \frac{1}{r_{o} • z} r_{o} (3.40)$ $\frac{1}{λ} r_{1} = \frac{1}{r_{o} • z} r_{o} (3.40)$

where λ is the focal length of the camera and z is the distance (vector) from the center of the camera to the object.

According to visual psychology, when there is relative motion between an observer and an object, optical patterns on the surface of object provide the motion and structure information of the object to the observer. Optical flow is the apparent motion of the optical patterns. Three factors are important in optical flow: Motion or velocity field, which is the necessary condition of optical flow; optical patterns, which can carry information; and a projection from the scene to the image, which make it observable.

Figure 3.14: Object point and image point are connected by a projection equation.

Optical flow and motion fields are closely related but different. When there is optical flow, certain motion must exist. However, when there is some motion, the optical flow may not appear.

Example 3.7 The difference between the motion field and optical flow.

Figure 3.15(a) shows a sphere with the homogeneous reflectance property rotating in front of a camera. The light source is fixed. There are brightness variations in different parts of the surface image; however, the brightness variations are not changed with the rotation of sphere, so the image is not changed in gray-level values with time. In this case, though the motion field is not zero, the optical flow is zero everywhere. In Figure 3.15(b), a fixed sphere is illuminated by moving the light source. The gray-level values in different locations of the image change as the illumination changes according to the source motion. In this case, the optical flow is not zero but the motion field of sphere is zero everywhere. This motion is called the apparent motion.

3.2.2Solution to Optical Flow Constraint Equation

The optical flow constraint equation is also called the image flow constraint equation. Denote f(x, y, t) as the gray-level value of an image point (x, y) at time t, u(x, y) and v(x, y) as the horizontal and vertical velocities of the image point (x, y), respectively. The optical flow constraint equation can be written as

$f_{x} u + f_{y} v + f_{t} = 0 (3.41)$ $f_{x} u + f_{y} v + f_{t} = 0 (3.41)$

where fx, fy, ft are the gray-level gradients along X, Y, T directions, respectively.

Figure 3.15: The difference between a motion field and the optical flow.

Figure 3.16: Values of u and v satisfying eq. (3.41) lie on a line.

In addition, eq. (3.41) can also be written as

$(f_{x}, f_{y}) • (u, v) = - f_{t} (3.42)$ $(f_{x}, f_{y}) • (u, v) = - f_{t} (3.42)$

3.2.2.1Computation of Optical Flow: Rigid Body

Equation (3.41) is a linear constraint equation about velocity components u and v. In the velocity space, as shown in Figure 3.16, values of u and v satisfying eq. (3.41) lie on a line,

$u_{0} = - f_{t} / f_{x} v_{0} = - f_{t} / f_{y} θ = arctan (f_{x} / f_{y}) (3.43)$ $u_{0} = - f_{t} / f_{x} v_{0} = - f_{t} / f_{y} θ = arctan (f_{x} / f_{y}) (3.43)$

All points on this line are the solutions of the optical flow constraint equation. In other words, one equation cannot uniquely determine the two variables u and v.

To determine uniquely the two variables u and v, another constraint should be introduced. When dealing with rigid bodies, it is implied that all neighboring points have the same optical velocity. In other words, the change rate of the optical flow is zero, which are given by

${(\nabla u)}^{2} = {(\frac{\partial u}{\partial x} + \frac{\partial u}{\partial y})}^{2} = 0 (3.44)$ ${(\nabla u)}^{2} = {(\frac{\partial u}{\partial x} + \frac{\partial u}{\partial y})}^{2} = 0 (3.44)$

${(\nabla v)}^{2} = {(\frac{\partial v}{\partial x} + \frac{\partial v}{\partial y})}^{2} = 0 (3.45)$ ${(\nabla v)}^{2} = {(\frac{\partial v}{\partial x} + \frac{\partial v}{\partial y})}^{2} = 0 (3.45)$

Combining these two equations with the optical flow constraint equation, the optical flow can be calculated by solving a minimum problem as

$ε (x, y) = \sum_{x} \sum_{y} {{(f_{x} u + f_{y} v + f_{t})}^{2} + λ^{2} [{(\nabla u)}^{2} + {(\nabla v)}^{2}]} (3.46)$ $ε (x, y) = \sum_{x} \sum_{y} {{(f_{x} u + f_{y} v + f_{t})}^{2} + λ^{2} [{(\nabla u)}^{2} + {(\nabla v)}^{2}]} (3.46)$

In eq. (3.46), the value of λ depends on the noise in the image. When the noise is strong, λ must be larger. To minimize the error in eq. (3.46), calculating the derivatives of u and v, and letting the results equal to zero, yields

$f_{x}^{2} u + f_{x} f_{y} v = - λ^{2} \nabla u - f_{x} f_{t} (3.47)$ $f_{x}^{2} u + f_{x} f_{y} v = - λ^{2} \nabla u - f_{x} f_{t} (3.47)$

$f_{x}^{2} v + f_{x} f_{y} u = - λ^{2} \nabla v - f_{y} f_{t} (3.48)$ $f_{x}^{2} v + f_{x} f_{y} u = - λ^{2} \nabla v - f_{y} f_{t} (3.48)$

The above two equations are also called Euler equations. Let $\bar{u} and \bar{v}$ $\bar{u} and \bar{v}$ be the average values in the neighborhood of u and v, and let $\nabla u = u - \bar{u} and \nabla v = v - \bar{v} .$ $\nabla u = u - \bar{u} and \nabla v = v - \bar{v} .$ Equation (3.47) and eq. (3.48) become

$(f_{x}^{2} + λ^{2}) u + f_{x} f_{y} v = λ^{2} \bar{u} - f_{x} f_{t} (3.49)$ $(f_{x}^{2} + λ^{2}) u + f_{x} f_{y} v = λ^{2} \bar{u} - f_{x} f_{t} (3.49)$

$(f_{y}^{2} + λ^{2}) v + f_{x} f_{y} u = λ^{2} \bar{v} - f_{y} f_{t} . (3.50)$ $(f_{y}^{2} + λ^{2}) v + f_{x} f_{y} u = λ^{2} \bar{v} - f_{y} f_{t} . (3.50)$

The solutions for eq. (3.49) and eq. (3.50) are

$u = \bar{u} - \frac{f_{x} [f_{x} \bar{u} + f_{y} \bar{v} + f_{t}]}{λ^{2} + f_{x}^{2} + f_{y}^{2}} (3.51)$ $u = \bar{u} - \frac{f_{x} [f_{x} \bar{u} + f_{y} \bar{v} + f_{t}]}{λ^{2} + f_{x}^{2} + f_{y}^{2}} (3.51)$

$v = \bar{v} - \frac{f_{y} [f_{x} \bar{u} + f_{y} \bar{v} + f_{t}]}{λ^{2} + f_{x}^{2} + f_{y}^{2}} . (3.52)$ $v = \bar{v} - \frac{f_{y} [f_{x} \bar{u} + f_{y} \bar{v} + f_{t}]}{λ^{2} + f_{x}^{2} + f_{y}^{2}} . (3.52)$

Equation (3.51) and eq. (3.52) provide the basis for solving u(x, y) and v(x, y) with an iterative method. In practice, the following relaxation iterative equations are used

$u^{(n + 1)} = {\bar{u}}^{(n)} - \frac{f_{x} [f_{x} {\bar{u}}^{(n)} + f_{y} {\bar{v}}^{(n)} + f_{t}]}{λ^{2} + f_{x}^{2} + f_{y}^{2}} (3.53)$ $u^{(n + 1)} = {\bar{u}}^{(n)} - \frac{f_{x} [f_{x} {\bar{u}}^{(n)} + f_{y} {\bar{v}}^{(n)} + f_{t}]}{λ^{2} + f_{x}^{2} + f_{y}^{2}} (3.53)$

$v^{(n + 1)} = {\bar{v}}^{(n)} - \frac{f_{y} [f_{x} {\bar{u}}^{(n)} + f_{y} {\bar{v}}^{(n)} + f_{t}]}{λ^{2} + f_{x}^{2} + f_{y}^{2}} . (3.54)$ $v^{(n + 1)} = {\bar{v}}^{(n)} - \frac{f_{y} [f_{x} {\bar{u}}^{(n)} + f_{y} {\bar{v}}^{(n)} + f_{t}]}{λ^{2} + f_{x}^{2} + f_{y}^{2}} . (3.54)$

The initial values can be u(0) = 0, v(0) = 0 (a line passing through the origin). Equations (3.53) and (3.54) have a simple geometric explanation: The iterative value at a new point (u, v) equals to the difference of the average value in the neighborhood of this point and an adjust value that is in the direction of the brightness gradient, as shown in Figure 3.17.

Example 3.8 Real example of the optical flow detection.

Some results for a real detection of optical flow are shown in Figure 3.18. Figure 3.18(a) is an image of a soccer ball. Figure 3.18(b) is the image obtained by rotating Figure 3.18(a) around the vertical axis. Figure 3.18(c) is the image obtained by rotating Figure 3.18(a) clockwise around the viewing axis. Figure 3.18(d, e) are the detected optical flow in the above two cases, respectively.

Figure 3.17: Illustration of a geometric explanation.

Figure 3.18: Real example of optical flow detection.

It can be seen from Figure 3.18 that the optical flow has bigger values at the boundaries between the white and black regions as the gray-level values change is acute there, and has smaller values at the inside place of the white and black regions as the gray-level values there show almost no change during the movement.

3.2.2.2Computation of Optical Flow: Smooth Motion

Another method for introducing more constraints is to use the property that the motion is smooth in the field. It is then possible to minimize a measure of the departure from smoothness by

$e_{s} = \iint [(u_{x}^{2} + u_{y}^{2}) + (v_{x}^{2} + v_{y}^{2})] d x d y (3.55)$ $e_{s} = \iint [(u_{x}^{2} + u_{y}^{2}) + (v_{x}^{2} + v_{y}^{2})] d x d y (3.55)$

In addition, the error in the optical flow constraint equation

$e_{c} = \iint {[E_{x} u + E_{y} v + E_{t}]}^{2} d x d y (3.56)$ $e_{c} = \iint {[E_{x} u + E_{y} v + E_{t}]}^{2} d x d y (3.56)$

should also be small. Overall, the function to be minimized is es + λec, where λ is a weight. With strong noise, λ should take a small value.

3.2.2.3Computation of Optical Flow: Gray-Level Break

Consider Figure 3.19(a), where XY is an image plane, I is the gray-level axis, and an object moves along the X direction with a velocity of (u, v). At time t0, the gray-level value at point P0 is I0 and the gray-level value at point Pd is Id. At time t0 + dt, the gray-level value at P0 is moved to Pd and this forms the optical flow. Between P0 and Pd, there is a gray-level break.

Figure 3.19: Gray-level break cases.

Considering Figure 3.19(b), look at the change of gray-level along the path. As the gray-level value at Pd is the gray-level value at P0 plus the gray-level difference between P0 and Pd, it has

$I_{d} = \int_{P_{0}}^{P_{d}} \nabla f • d l + I_{0} (3.57)$ $I_{d} = \int_{P_{0}}^{P_{d}} \nabla f • d l + I_{0} (3.57)$

On the other hand, look at the change of gray level along the time axis. As the gray-level value observed at Pd is changed from Id to I0, it has

$I_{0} = \int_{t_{0}}^{t_{0} + d t} f_{t} d t + I_{d} (3.58)$ $I_{0} = \int_{t_{0}}^{t_{0} + d t} f_{t} d t + I_{d} (3.58)$

The changes of gray-level values in the above two cases are equal, so combining eqs (3.57) and (3.58) gives

$\int_{P_{0}}^{P_{d}} \nabla f • d l = - \int_{t_{0}}^{t_{0} + d t} f_{t} d t (3.59)$ $\int_{P_{0}}^{P_{d}} \nabla f • d l = - \int_{t_{0}}^{t_{0} + d t} f_{t} d t (3.59)$

Substituting dl = [u v]Tdt into eq. (3.59) yields

$f_{x} u + f_{y} v + f_{t} = 0 (3.60)$ $f_{x} u + f_{y} v + f_{t} = 0 (3.60)$

Now it is clear that the optical flow constraint equation can be used when there is a gray-level break.

3.2.3Optical Flow and Surface Orientation

Consider an observer-centered coordinate system XYZ (the observer is located at the origin), and suppose that the observer has a spherical retina, so the world can be projected onto a unit image sphere. Any point in this image sphere can also be represented by a longitude ϕ, a latitude θ and a distance from the origin r.

Figure 3.20: Spherical and Cartesian coordinate systems.

The above two coordinate systems are depicted in Figure 3.20, and can be interchanged.

$x = r \sin θ \cos ϕ (3.61)$ $x = r \sin θ \cos ϕ (3.61)$

$y = r \sin θ \sin ϕ (3.62)$ $y = r \sin θ \sin ϕ (3.62)$

$z = r \cos θ (3.63)$ $z = r \cos θ (3.63)$

$r = \sqrt{x^{2} + y^{2} + z^{2}} (3.64)$ $r = \sqrt{x^{2} + y^{2} + z^{2}} (3.64)$

$θ = \arccos (z / r) (3.65)$ $θ = \arccos (z / r) (3.65)$

$φ = \arccos (y / x) (3.66)$ $φ = \arccos (y / x) (3.66)$

The optical flow of an object point can be determined as follows. Let the velocity of the point be (u, v, w) = (dx/dt, dy/dt, dz/dt), and the angular velocities of the image point in ϕ and θ (directions are

$δ = \frac{v \cos ϕ - u \sin ϕ}{r \sin θ} (3.67)$ $δ = \frac{v \cos ϕ - u \sin ϕ}{r \sin θ} (3.67)$

$ε = \frac{(u r \sin θ \cos ϕ + v r \sin θ \sin ϕ + w r \cos θ) \cos θ - r w}{r^{2} \sin θ} (3.68)$ $ε = \frac{(u r \sin θ \cos ϕ + v r \sin θ \sin ϕ + w r \cos θ) \cos θ - r w}{r^{2} \sin θ} (3.68)$

Equations (3.67) and (3.68) are the general expressions for the optical flow in ϕ and θ directions. Consider a simple example. Suppose that the object is stationary and the observer is traveling along the Z-axis with a speed S. In this case, u = 0, v = 0, and w = −S. Taking them into eq. (3.67) and eq. (3.68) yields

$δ = 0 (3.69)$ $δ = 0 (3.69)$

$ε = S \sin θ / r (3.70)$ $ε = S \sin θ / r (3.70)$

Based on eq. (3.69) and eq. (3.70), the surface orientation can be determined. Suppose that R is a point in the patch on the surface and an observer located at O is looking at this patch along the viewing line OR as in Figure 3.21(a). Let the normal vector of the patch be N . N can be decomposed into two orthogonal directions. One is in the ZR plane and has an angle σ with OR as in Figure 3.21(b). Another is in the plane perpendicular to the ZR plane (so is parallel to the XY plane) and has an angle τ with OR′ (see Figure 3.21(c), the Z-axis is pointed outward from paper).

Figure 3.21: An example of surface orientation determination.

In Figure 3.21(b), ϕ is a constant, while in Figure 3.21(c), θ is a constant. Consider σ in the ZR plane, as shown in Figure 3.22(a). If θ is given an infinitesimal increment Δθ, the change of r is Δr. Drawing an auxiliary line ρ, from one side, it gets ρ/r = tan(Δθ) ≈ Δθ, from other side, it has ρ/Δr = tan σ. Combining them yields

$r Δ θ = Δ r tan σ (3.71)$ $r Δ θ = Δ r tan σ (3.71)$

Similarly, consider τ in the plane perpendicular to the ZR plane, as shown in Figure 3.22(b). If giving ϕ an infinitesimal increment Δϕ, the change of r is Δr. Drawing an auxiliary line p, from one side, it has ρ/r = tan(Δϕ) ≈ Δϕ, and from the other side, it has ρ/Δr = tan τ. Combining them yields

$r Δ ϕ = Δ r \tan τ (3.72)$ $r Δ ϕ = Δ r \tan τ (3.72)$

Taking the limit values of eq. (3.71) and eq. (3.72) yields

$\cot σ = [\frac{1}{r}] \frac{\partial r}{\partial θ} (3.73)$ $\cot σ = [\frac{1}{r}] \frac{\partial r}{\partial θ} (3.73)$

$\cot τ = [\frac{1}{r}] \frac{\partial r}{\partial ϕ} (3.74)$ $\cot τ = [\frac{1}{r}] \frac{\partial r}{\partial ϕ} (3.74)$

Figure 3.22: The process used to determine surface orientation.

in which r can be determined from eq. (3.64). Since ε is the function of both ϕ and θ, eq. (3.70) can be written as

$r = \frac{S \sin θ}{ε (ϕ, θ)} (3.75)$ $r = \frac{S \sin θ}{ε (ϕ, θ)} (3.75)$

Taking partial differentiations of ϕ and θ yields

$\frac{\partial r}{\partial φ} = S \sin θ \frac{- 1 \partial ε}{ε^{2} \partial φ} (3.76)$ $\frac{\partial r}{\partial φ} = S \sin θ \frac{- 1 \partial ε}{ε^{2} \partial φ} (3.76)$

$\frac{\partial r}{\partial θ} = S (\frac{\cos θ}{ε} - \frac{\sin θ}{ε^{2}} \frac{\partial ε}{\partial θ}) (3.77)$ $\frac{\partial r}{\partial θ} = S (\frac{\cos θ}{ε} - \frac{\sin θ}{ε^{2}} \frac{\partial ε}{\partial θ}) (3.77)$

Substituting Eqs (3.65) and (3.66) into eqs (3.73) and (3.74) yields

$σ = arccot [\cot θ - \frac{\partial (\ln ε)}{\partial θ}] (3.78)$ $σ = arccot [\cot θ - \frac{\partial (\ln ε)}{\partial θ}] (3.78)$

$τ = arccot [\frac{\partial (\ln ε)}{\partial ϕ}] (3.79)$ $τ = arccot [\frac{\partial (\ln ε)}{\partial ϕ}] (3.79)$

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 3.2 Structure from Motion

Create new playlist

Sign In

Sign Up

Table of Contents for
3.2 Structure from Motion