Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter Thirteen

Inferring Transforms

May-June 1999

In simple 2D texture mapping, you take a 2D image and render it on the screen after some transformation or distortion. To accomplish this, you will need to take each [X, Y] location on the screen and calculate a [U, V] texture coordinate to place there. A particularly common transformation is

$U = \frac{a X + b Y + c}{g X + h Y + j}, V = \frac{d X + e Y + f}{g X + h Y + j}$

si1_e

By picking the proper values for the coefficients a … j, we can fly the 2D texture around to an arbitrary position, orientation, and perspective projection on the screen. One can, in fact, generate the coefficients by a concatenation of 3D rotation, translation, scale, and perspective matrices, so we, of course, prefer the homogeneous matrix formulation of this transformation:

$\begin{array}{l} [\begin{matrix} X & Y & 1 \end{matrix}] [\begin{matrix} a & d & g \\ b & e & h \\ c & f & j \end{matrix}] = [\begin{matrix} u & v & w \end{matrix}] \\ [\begin{matrix} U & V \end{matrix}] = [\begin{matrix} \frac{u}{w} & \frac{v}{w} \end{matrix}] \end{array}$

si2_e

In this chapter, though, I'm going to talk about a more direct approach to finding a … j. It turns out that the 2D-to-2D mapping is completely specified if you give four arbitrary points in screen space and the four arbitrary points in texture space they must map to. The only restriction is that no three of the input or output points may be collinear. This method of transformation specification is useful, for example, in taking flat objects digitized in perspective and processing them into orthographic views.

Our Goal

Let's make the problem explicit by giving names to some quantities. We are given four 2D screen coordinates $s_{i} = [\begin{matrix} X_{i} & Y_{i} & 1 \end{matrix}]$ and four 2D texture coordinates $t_{i} = [\begin{matrix} U_{i} & V_{i} & 1 \end{matrix}]$ , and we want to find the 3 × 3 homogeneous transformation M_st that maps one to the other so that

$s_{i} M_{s t} = w_{i} t_{i}$

(13.1)

See Figure 13.1. Note that we are not given the w_i values. Their participation in Equation (13.1) acknowledges the fact that even though the original input and output points are nonhomogeneous (their third component is 1), the output of the matrix multiplication will be homogeneous. We will have to solve for the w values as a side effect of solving for the elements of M_st.

f13-01-9781558608603 — Figure 13.1 Desired transform

The Conventional Solution

The conventional way to solve this goes as follows. First, using the names a … j for the elements of M_st, we can rewrite Equation (13.1) explicitly as

$[\begin{matrix} X_{i} & Y_{i} & 1 \end{matrix}] [\begin{matrix} a & d & g \\ b & e & h \\ c & f & j \end{matrix}] = w_{i} [\begin{matrix} U_{i} & V_{i} & 1 \end{matrix}]$

si6_e

Multiplying out and equating each component gives us three equations:

$\begin{array}{l} a X_{i} + b Y_{i} + c = w_{i} U_{i} \\ d X_{i} + e Y_{i} + f = w_{i} V_{i} \\ g X_{i} + h Y_{i} + j = w_{i} \end{array}$

si7_e

Plug the last equation for w_i into the first two and move everything to the left of the equal sign:

$\begin{array}{l} a X_{i} + b Y_{i} + c - g X_{i} U_{i} - h Y_{i} U_{i} - j U_{i} = 0 \\ d X_{i} + e Y_{i} + f - g X_{i} V_{i} - h Y_{i} V_{i} - j V_{i} = 0 \end{array}$

si8_e

Write this as yet another matrix equation in terms of what we are solving for, a … j:

$[\begin{matrix} X_{i} & Y_{i} & 1 & 0 & 0 & 0 & - X_{i} U_{i} & - Y_{i} U_{i} & - U_{i} \\ 0 & 0 & 0 & X_{i} & Y_{i} & 1 & - X_{i} V & - Y_{i} V_{i} & - V_{i} \end{matrix}] [\begin{matrix} a \\ b \\ c \\ d \\ e \\ f \\ g \\ h \\ j \end{matrix}] = [\begin{matrix} 0 \\ 0 \end{matrix}]$

si9_e

Each input point gives us two more 9-element rows; four points gives us an 8 × 9 matrix. Since this is a homogeneous system, that's all we need to solve for the nine values a … j (with an arbitrary global scale factor). One way to calculate each of these nine values is to find the determinant of the 8 × 8 matrix formed by deleting the matching column of the 8 × 9 matrix, so … nine determinants of 8 × 8 matrices. This is doa ble but obnoxious. Looking at all those lovely zeros and ones on the left makes us suspect that there is a better way.

Heckbert's Improvement

In his 1989 master's thesis, Paul Heckbert¹ made a great leap by splitting the transformation into two separate matrices. He first used one matrix to map the input points to a canonical unit square (with vertices [0, 0], [1, 0], [1, 1], [0, 1]) and then mapped that square into the output points with another matrix. See Figure 13.2.

f13-02-9781558608603 — Figure 13.2 Two stages of Heckbert decomposition

Each of these matrices will be individually easier to calculate than the complete transformation since the arithmetic is so much simpler. In fact, the arithmetic turns out to be simplest for the second of these transformations, so we will solve that one explicitly. Naming points in the unit-square space q, we want to find M_qt in the equation

${qM}_{q t} = w_{t} t$

Let's explicitly write out all components for all four input/output (q/t) point pairs. I'll recycle the names a … j and w₀ … w₃ for use in this subcomputation, so be aware that their values are different from those I used in the “conventional” solution:

$[\begin{matrix} 0 & 0 & 1 \\ 1 & 0 & 1 \\ 1 & 1 & 1 \\ 0 & 1 & 1 \end{matrix}] [\begin{matrix} a & d & g \\ b & e & h \\ c & f & j \end{matrix}] = [\begin{matrix} w_{0} U_{0} & w_{0} V_{0} & w_{0} \\ w_{1} U_{1} & w_{1} V_{1} & w_{1} \\ w_{2} U_{2} & w_{2} V_{2} & w_{2} \\ w_{3} U_{3} & w_{3} V_{3} & w_{3} \end{matrix}]$

si11_e

This generates 12 equations:

$\begin{matrix} c = w_{0} U_{0} & f = w_{0} V_{0} & j = w_{0} \\ a + c = w_{1} U_{1} & d + f = w_{1} V_{1} & g + j = w_{1} \\ a + b + c = w_{2} U_{2} & d + e + f = w_{2} V_{2} & g + h + j = w_{2} \\ b + c = w_{3} U_{3} & e + f = w_{3} V_{3} & h + j = w_{3} \end{matrix}$

si12_e

Substituting the right column of equations into the left two columns gives us

$\begin{matrix} c = j U_{0} & f = j V_{0} \\ a + c = (g + j) U_{1} & d + f = (g + j) V_{1} \\ a + b + c = (g + h + j) U_{2} & d + e + f = (g + h + j) V_{2} \\ b + c = (h + j) U_{3} & e + f = (h + j) V_{3} \end{matrix}$

si13_e (13.2)

Then substituting the equations in rows 1, 2, and 4 into the third row gives the two equations

$\begin{array}{l} (g + j) U_{1} + (h + j) U_{3} - j U_{0} = (g + h + j) U_{2} \\ (g + j) V_{1} + (h + j) V_{3} - j V_{0} = (g + h + j) V_{2} \end{array}$

si14_e

We juggle this into

$\begin{array}{l} g (U_{1} - U_{2}) + h (U_{3} - U_{2}) = j (U_{0} - U_{1} + U_{2} - U_{3}) \\ g (V_{1} - V_{2}) + h (V_{3} - V_{2}) = j (V_{0} - V_{1} + V_{2} - V_{3}) \end{array}$

si15_e (13.3)

Heckbert's original solution did the “without loss of generality” trick and assumed that j = 1. This works since we can only expect to solve for the matrix elements up to some global scalar multiple, and since j cannot be zero if point 0 is not at infinity. He then solved for g and h and got something that looked like

$g = \frac{\det [{stuff}_{0}]}{\det [{stuff}_{2}]} h = \frac{\det [{stuff}_{1}]}{\det [{stuff}_{2}]}$

si16_e

I don't really like the j = 1 assumption though since it leads to these nasty divisions. And we really don't need it if we think homogeneously. We can instead write Equation (13.3) as

$[\begin{matrix} g & h & j \end{matrix}] [\begin{matrix} U_{1} - U_{2} & V_{1} - V_{2} \\ U_{3} - U_{2} & V_{3} - V_{2} \\ - U_{0} + U_{1} - U_{2} + U_{3} & - V_{0} + V_{1} - V_{2} + V_{3} \end{matrix}] = [\begin{matrix} 0 & 0 \end{matrix}]$

si17_e

And just say that the vector [g h j] is the cross product of the two columns of the above matrix. This gives us, after a little simplification

$\begin{array}{l} g = (U_{3} - U_{2}) (V_{1} - V_{0}) - (U_{1} - U_{0}) (V_{3} - V_{2}) \\ h = (U_{3} - U_{0}) (V_{1} - V_{2}) - (U_{1} - U_{2}) (V_{3} - V_{0}) \\ j = (U_{1} - U_{2}) (V_{3} - V_{2}) - (U_{3} - U_{2}) (V_{1} - V_{2}) \end{array}$

si18_e

Once we have these, it's a simple matter to get a through f using Equation (13.2):

$\begin{matrix} c & = j U_{0} & f & = j V_{0} \\ a & = (g + j) U_{1} - c & d & = (g + j) V_{1} - f \\ b & = (h + j) U_{3} - c & e & = (h + j) V_{3} - f \end{matrix}$

si19_e

Now that we have M_qt, the same simple arithmetic will give us the mapping from the canonical square to the input points. This will be the matrix that satisfies:

${qM}_{q s} = w_{s} s$

Invert that (or, better, take its adjoint) and multiply to get our desired net matrix:

$M_{s t} = {[M_{q s}]}^{*} M_{q t}$

Olynyk's Improvement

An even better solution recently came from my colleague Kirk Olynyk here at Microsoft Research. His basic idea was to use barycentric coordinates, rather than a unit square, as the intermediate system. Barycentric coordinates represent an arbitrary point in the plane as the weighted sum of three basis points, for example, the first three of our output points. This would represent an arbitrary texture coordinate t as

$τ_{0} t_{0} + τ_{1} t_{1} + τ_{2} t_{2} = t$

with the constraint that the barycentric coordinates sum to one: $τ_{0} + τ_{1} + τ_{2} = 1$ . Similarly, we can represent an arbitrary input point in barycentric coordinates as

$σ_{0} s_{0} + σ_{1} s_{1} + σ_{2} s_{2} = s$

Kirk then related the two barycentric coordinate systems by coming up with a mapping from $[\begin{matrix} σ_{0} & σ_{1} & σ_{2} \end{matrix}]$ to $[\begin{matrix} τ_{0} & τ_{1} & τ_{2} \end{matrix}]$ that has the property that the barycentric coordinates of all four input points map to the barycentric coordinates of their respective output points. That is,

$\begin{matrix} [\begin{matrix} 1 & 0 & 0 \end{matrix}] & \mapsto & [\begin{matrix} 1 & 0 & 0 \end{matrix}] \\ [\begin{matrix} 0 & 1 & 0 \end{matrix}] & \mapsto & [\begin{matrix} 0 & 1 & 0 \end{matrix}] \\ [\begin{matrix} 0 & 0 & 1 \end{matrix}] & \mapsto & [\begin{matrix} 0 & 0 & 1 \end{matrix}] \\ [{\tilde{σ}}_{0} {\tilde{σ}}_{1} {\tilde{σ}}_{2}] & \mapsto & [{\tilde{τ}}_{0} {\tilde{τ}}_{1} {\tilde{τ}}_{2}] \end{matrix}$

si27_e

(The two points in the fourth mapping above are the barycentric coordinates of the fourth input/output point pair.) To get the desired mapping, simply multiply each σ_i, componemt of an arbitrary input point by ${\tilde{τ}}_{i} / {\tilde{σ}}_{i}$ . Then renormalize to a valid barycentric coordinate by dividing by the sum of the components. You can see that this algorithm leaves the coordinates of the first three points intact while properly changing those of the fourth point.

So how do we get the barycentric coordinates of the fourth input/output pair? Start with the matrix formulation of the barycentric coordinates of an arbitrary point. Here's the one for output points:

$[\begin{matrix} τ_{0} & τ_{1} & τ_{2} \end{matrix}] [\begin{matrix} U_{0} & V_{0} & 1 \\ U_{1} & V_{1} & 1 \\ U_{2} & V_{2} & 1 \end{matrix}] = t$

si29_e

Note that if $τ_{0} + τ_{1} + τ_{2} = 1$ , this guarantees that the w (homogeneous) component of t will be 1 also. Anyway, plugging in t = t₃ and moving the matrix over the equal sign gives us

$[\begin{matrix} {\tilde{τ}}_{0} & {\tilde{τ}}_{1} & {\tilde{τ}}_{2} \end{matrix}] = [\begin{matrix} U_{3} & V_{3} & 1 \end{matrix}] {[\begin{matrix} U_{0} & V_{0} & 1 \\ U_{1} & V_{1} & 1 \\ U_{2} & V_{2} & 1 \end{matrix}]}^{- 1}$

si31_e (13.4)

This is nice, but it's not ideal. A full-on matrix inversion requires division by the determinant of the matrix. Likewise, the normalization to unit-sum barycentric coordinates requires division by the sum of the coordinates. Kirk dislikes divisions even more than I do, so in his implementation he removed them by homogeneously scaling them out symbolically after the fact. In thinking about this solution, though, I realized that there is a different way of deriving it that better shows its relation to the Heckbert solution.

Another Interpretation

Heckbert used a unit square as an intermediate coordinate system, and Olynyk used barycentric coordinates. Let's look at this problem anew and try to pick an intermediate coordinate system (call it b) that will minimize our arithmetic as much as possible when we solve for M_bt in the equation

${bM}_{b t} = w t$

We want four points b_i in our new coordinate system that have as many zeros as possible as components. The four simplest (homogeneous) points I can imagine are [1 0 0], [0 1 0], [0 0 1], and [1 1 1]. It doesn't matter that two of these are points at infinity. All that matters is that no three of these points are collinear. I show this new two-stage transformation in Figure 13.3, although it might be a bit confusing (and unnecessary) to try to make a closed quadrilateral out of the four b points.

f13-03-9781558608603 — Figure 13.3 Two stages of new decomposition

The b coordinate system is like “homogeneous barycentric coordinates” (with the restriction relaxed that the sum of the components equals one) but is further scaled so that the components of the fourth point are equal.

So let's solve for M_bt Being extremely ecological, I will again recycle the names a … j for the matrix elements and w₀ … w₃ for the (as yet) unknown homogeneous factors. Our four points generate

$[\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \\ 1 & 1 & 1 \end{matrix}] [\begin{matrix} a & d & g \\ b & e & h \\ c & f & j \end{matrix}] = [\begin{matrix} w_{0} U_{0} & w_{0} V_{0} & w_{0} \\ w_{1} U_{1} & w_{1} V_{1} & w_{1} \\ w_{2} U_{2} & w_{2} V_{2} & w_{2} \\ w_{3} U_{3} & w_{3} V_{3} & w_{3} \end{matrix}]$

si33_e (13.5)

We could solve this by doing the same sort of substitutions that we did in the Heckbert solution, but there's an easier way. Looking at the top three rows on each side, we immediately realize that we have M_bt already staring us in the face; it's just the first three rows of Equation (13.5). Let's write it in factored form:

$M_{b t} = [\begin{matrix} a & d & g \\ b & e & h \\ c & f & j \end{matrix}] = [\begin{matrix} w_{0} & 0 & 0 \\ 0 & w_{1} & 0 \\ 0 & 0 & w_{2} \end{matrix}] [\begin{matrix} U_{0} & V_{0} & 1 \\ U_{1} & V_{1} & 1 \\ U_{2} & V_{2} & 1 \end{matrix}]$

si34_e (13.6)

We only need to find w₀, w₁, and w₂, and we are home free. To get these, we take the bottom row of Equation (13.5):

$\begin{array}{l} [\begin{matrix} 1 & 1 & 1 \end{matrix}] M_{b t} = [\begin{matrix} w_{3} U_{3} & w_{3} V_{3} & w_{3} \end{matrix}] \end{array}$

and combine it with Equation (13.6) and write

$[\begin{matrix} 1 & 1 & 1 \end{matrix}] [\begin{matrix} w_{0} & 0 & 0 \\ 0 & w_{1} & 0 \\ 0 & 0 & w_{2} \end{matrix}] [\begin{matrix} U_{0} & V_{0} & 1 \\ U_{1} & V_{1} & 1 \\ U_{2} & V_{2} & 1 \end{matrix}] = w_{3} [\begin{matrix} U_{3} & V_{3} & 1 \end{matrix}]$

si36_e

We can hop the UV matrix over the equal sign by inverting it to get

$[\begin{matrix} w_{0} & w_{1} & w_{2} \end{matrix}] = w_{3} [\begin{matrix} U_{3} & V_{3} & 1 \end{matrix}] {[\begin{matrix} U_{0} & V_{0} & 1 \\ U_{1} & V_{1} & 1 \\ U_{2} & V_{2} & 1 \end{matrix}]}^{- 1}$

si37_e (13.7)

Comparing Equation (13.7) with Equation (13.4), we can see the relationship

$[\begin{matrix} w_{0} & w_{1} & w_{2} \end{matrix}] = w_{3} [\begin{matrix} {\tilde{τ}}_{0} & {\tilde{τ}}_{1} & {\tilde{τ}}_{2} \end{matrix}]$

si38_e

In other words, our w₀ … w₂ values are just a homogeneous scaling of the pure barycentric coordinates of the fourth point. Note also that the right-hand column of Equation (13.5) tells us that

$w_{0} + w_{1} + w_{2} = w_{3}$

(13.8)

Remember that we can determine the w's only up to a homogeneous scale factor, so let's pick something nice for, say, w₃. How about using the determinant of the matrix we are inverting? This can never be zero if the three points t₀, t₁, and t₂ aren't collinear. Letting w₃ equal this matrix determinant would pleasantly turn the matrix inverse into a matrix adjoint and we have

$[\begin{matrix} w_{0} & w_{1} & w_{2} \end{matrix}] = [\begin{matrix} U_{3} & V_{3} & 1 \end{matrix}] {[\begin{matrix} U_{0} & V_{0} & 1 \\ U_{1} & V_{1} & 1 \\ U_{2} & V_{2} & 1 \end{matrix}]}^{*}$

si40_e (13.9)

So … that's half of our final answer. Now for the other half, the matrix M_sb. We get the inverse of M_sb by doing the same calculation using the four input coordinate points. First calculate the three homogeneous scale factors, which I'll call z, by analogy with Equation (13.9):

$[\begin{matrix} z_{0} & z_{1} & z_{2} \end{matrix}] = [\begin{matrix} X_{3} & Y_{3} & 1 \end{matrix}] {[\begin{matrix} X_{0} & Y_{0} & 1 \\ X_{1} & Y_{1} & 1 \\ X_{2} & Y_{2} & 1 \end{matrix}]}^{*}$

si41_e (13.10)

And then, by analogy with Equation (13.6):

$M_{b s} = [\begin{matrix} z_{0} & 0 & 0 \\ 0 & z_{1} & 0 \\ 0 & 0 & z_{2} \end{matrix}] {[\begin{matrix} X_{0} & Y_{0} & 1 \\ X_{1} & Y_{1} & 1 \\ X_{2} & Y_{2} & 1 \end{matrix}]}^{*}$

si42_e

To arrive at our ultimate goal of M_st, we need the adjoint of M_bs. One very nice feature of the factored form of the matrix that I've written here is that it's arithmetically simpler to adjointify than a general matrix. I'll write it out explicitly:

$\begin{array}{l} {[M_{b s}]}^{*} = \\ [\begin{matrix} Y_{1} - Y_{2} & Y_{2} - Y_{0} & Y_{0} - Y_{1} \\ X_{2} - X_{1} & X_{0} - X_{2} & X_{1} - X_{0} \\ X_{1} Y_{2} - X_{2} Y_{1} & X_{2} Y_{0} - X_{0} Y_{2} & X_{0} Y_{1} - X_{1} Y_{0} \end{matrix}] [\begin{matrix} z_{1} z_{2} & 0 & 0 \\ 0 & z_{0} z_{2} & 0 \\ 0 & 0 & z_{0} z_{1} \end{matrix}] \end{array}$

si43_e (13.11)

Put 'em all together and we get our gigantic punch line:

$M_{s t} = {[M_{b s}]}^{*} M_{b t}$

Now it's time to name some of the matrices I've been laboriously writing out for so long (actually, I've been cutting and pasting). Up to this point, I've felt that explicitly writing them out has been more informative since it allows comparisons with other parts of the equation. I will name the following matrices:

$T \equiv [\begin{matrix} U_{0} & V_{0} & 1 \\ U_{1} & V_{1} & 1 \\ U_{2} & V_{2} & 1 \end{matrix}] S \equiv [\begin{matrix} X_{0} & Y_{0} & 1 \\ X_{1} & Y_{1} & 1 \\ X_{2} & Y_{2} & 1 \end{matrix}]$

si45_e

Finally, let's rewrite our final answer slightly to give a nice comparison of this technique with (a homogenized version of) Kirk's original solution:

$M_{s t} = S^{*} [\begin{matrix} w_{0} z_{1} z_{2} & 0 & 0 \\ 0 & z_{0} w_{1} z_{2} & 0 \\ 0 & 0 & z_{0} z_{1} w_{2} \end{matrix}] T$

si46_e (13.12)

The matrix S* takes us from screen space to (homogeneous) barycentric coordinates. The w and z diagonal matrices combine to give one diagonal matrix whose elements are just homogeneous scalings of the ${\tilde{τ}}_{i} / {\tilde{σ}}_{i}$ quantities that Kirk used to go from the screen barycentric system to the texture barycentric system. Matrix T then takes us to texture coordinates. This form is almost the simplest we can do arithmetically. But let's not give up yet.

Geometric Interpretations

As Equations (13.9) and (13.10) indicate, we need the adjoints of matrices T and S to calculate the w and z values to plug into Equation (13.12). The adjoint S* also shows up in Equation (13.12), but we only use T* to calculate the w's. Let's look at this w calculation, then, to see if there's some way we can save ourselves some work. While this investigation is initially motivated by performance avarice, it will actually point out some geometric relationships that I think are the most interesting results of this whole problem. In other words, greed is good.

First, let's use the definition in Equation (13.9) to explicitly write out the calculation of w₀:

$w_{0} = U_{3} (V_{1} - V_{2}) + V_{3} (U_{2} - U_{1}) + (U_{1} V_{2} - U_{2} V_{1})$

(13.13)

What does this mean? Well, each row of matrix T is a point, one of t₀, t₁ or t₂. According to the definition of the adjoint, each column of T* is the cross product of two rows (points) of T. This means that each column of T* is a homogeneous line. For example, column 0 of T*represents the line connecting points t₁ and t₂. The process of multiplying an arbitrary point t by the matrix T* just takes its dot product with the three lines t₁t₂, t₂t₀, and t₀t₁. In other words, it measures the distance from the point to the three lines. That's the essential meaning of barycentric coordinates. In any event, we can now rewrite Equation (13.13) as

$w_{0} = t_{3} \cdot t_{1} \times t_{2}$

This common algebraic expression has a standard geometric interpretation. Thinking of t₁, t₂, and t₃ as 3D vectors, it's the volume of the parallelepiped they define. Thinking, however, of the three points as homogeneous 2D vectors, there's another interpretation: w₀ equals twice the area of the triangle t₃t₂t₁. We can verify this algebraically by comparing the definition of w₀ from Equation (13.13) with twice the integral under the three triangle edges:

$\begin{array}{l} w_{0} = (U_{1} - U_{2}) (V_{1} + V_{2}) \\ + (U_{2} - U_{3}) (V_{2} + V_{3}) \\ + (U_{3} - U_{1}) (V_{3} + V_{1}) \end{array}$

si50_e

But there's another way to calculate triangle areas: as (half) the length of a cross product. We can rewrite our expression for w₀ so that it looks like a cross product of two vectors along the edges of the triangle. For example, taking the third component of the cross product of the vectors (t₁ − t₃) and (t₁ − t₂) gives us

$w_{0} = (U_{1} - U_{3}) (V_{2} - V_{3}) - (U_{2} - U_{3}) (V_{1} - V_{3})$

Now let's calculate w₁ (which is half the area of triangle t₃t₀t₂) and w₂ (half the area of triangle t₃t₁t₀). We only need one more vector difference: (t₀ − t₃). This gives the simplest way I've found to calculate all the W's:

$\begin{array}{l} w_{0} = (U_{1} - U_{3}) (V_{2} - V_{3}) - (U_{2} - U_{3}) (V_{1} - V_{3}) \\ w_{1} = (U_{2} - U_{3}) (V_{0} - V_{3}) - (U_{0} - U_{3}) (V_{2} - V_{3}) \\ w_{2} = (U_{0} - U_{3}) (V_{1} - V_{3}) - (U_{1} - U_{3}) (V_{0} - V_{3}) \end{array}$

si52_e (13.14)

Again, you can verify algebraically that these expressions equal those from Equation (13.9), but the geometric arguments make it seem a bit less magical. Also, you can look at Figure 13.4 for some more geometric inspiration.

f13-04-9781558608603 — Figure 13.4 Geometric interpretation of w's

One more Thin Little mint

There's one last little bit of juice we can squeeze out of this. This, again, came from Kirk, but he found it purely algebraically. I'm going to motivate it by an even more interesting geometric observation. It turns out that there is a magical relationship between the z_i values and the bottom row of S*. So we have to switch gears and start talking, not about U, V,w but about X,Y,z. I'll write the analog to Equation (13.14) and, for good measure, throw in a formula for z₃, which is, by analogy with Equation (13.8), the sum of the first three:

$\begin{array}{l} z_{0} = (X_{1} - X_{3}) (Y_{2} - Y_{3}) - (X_{2} - X_{3}) (Y_{1} - Y_{3}) \\ z_{1} = (X_{2} - X_{3}) (Y_{0} - Y_{3}) - (X_{0} - X_{3}) (Y_{2} - Y_{3}) \\ z_{2} = (X_{0} - X_{3}) (Y_{1} - Y_{3}) - (X_{1} - X_{3}) (Y_{0} - Y_{3}) \\ z_{3} = (X_{1} - X_{0}) (Y_{2} - Y_{0}) - (X_{2} - X_{0}) (Y_{1} - Y_{0}) \end{array}$

si53_e

We can now see the missing link: the geometric interpretation of the value z₃. It's the area of triangle s₀s₁s₂. To see this, look at Figure 13.5 and note that z₀ plus z₂ equals the area of the whole quadrilateral. Now look at z₁. Since its edge vectors sweep clockwise, that area is negative and subtracts from the quadrilateral to get triangle s₀s₁s₂.

f13-05-9781558608603 — Figure 13.5 Geometric interpretation of z's

One side note: the area of the whole quadrilateral is

$\begin{array}{l} z_{0} + z_{2} = - z_{1} + z_{3} = (X_{1} - X_{3}) (Y_{2} - Y_{0}) - (X_{2} - X_{0}) (Y_{1} - Y_{3}) \end{array}$

This reminds us that the area of the quadrilateral is (twice) the cross product of its diagonals (with appropriate care in algebraic sign).

Now let's take a look at the bottom row of S* (inside Equation (13.11)). Again, these look like areas of some sort. In fact, they are the areas of the three triangles connecting the three edges of triangle t₀t₁t₂ with the origin. (If this isn't immediately clear, try temporarily imagining point 3 at the origin.) The sum of these areas also equals z₃, so we have the following identity, which you can also verify algebraically:

$S^{*}_{20} + S^{*}_{21} + S^{*}_{22} = z_{3}$

This means that

$S^{*}_{20} + S^{*}_{21} + S^{*}_{22} = z_{0} + z_{1} + z_{2}$

So we can calculate one of the z's in terms of the others. This can, for example, turn the calculation

$z_{0} {=X}_{3} S^{*}_{00} + Y_{3} S^{*}_{10} + S^{*}_{20}$

into

$z_{0} {= S}^{*}_{20} + S^{*}_{21} {+ S}^{*}_{22} - z_{1} - z_{2}$

It turns two multiplications into two additions. This may, or may not, be a particularly big deal with current processors.

Code

// Calculate elements of matrix Mst

// From 4 coordinate pairs

// (Ui Vi), (Xi Yi)

U03 = U0-U3; V03 = V0-V3;

U13 = U1-U3; V13 = V1-V3;

U23 = U2-U3; V23 = V2-V3;

w0 = U13*V23 - U23*V13;

wl = U23*V03 - U03*V23;

w2 = U03*V13 - U13*V03;

Sa00 = Y1-Y2; Sa10 = X2-X1;

Sa01 = Y2-Y0; Sa02 = Y0-Y1;

Sa11 = X0-X2; Sa12 = X1-X0;

Sa20 = X1*Y2 - X2*Y1;

Sa21 = X2*Y0 - X0*Y2;

Sa22 = X0*Y1 - X1*Y0;

z1 = X3*Sa01 + Y3*Sa11 + Sa21;

z2 = X3*Sa02 + Y3*Sa12 + Sa22;

z0 = Sa20+Sa21+Sa22 - z1 - z2;

d0 = w0*zl*z2;

d1 = wl*z2*z0;

d2 = w2*z0*z1;

Sa00*=d0; Sa10*=d0; Sa20*=d0;

Sa01*=d1; Sa11*=d1; Sa21*=d1;

Sa02*=d2; Sa12*=d2; Sa22*=d2;

M00 = Sa00*U0 + Sa01*U1 + Sa02*U2;

M10 = Sa10*U0 + Sa11*U1 + Sa12*U2;

M20 = Sa20*U0 + Sa21*U1 + Sa22*U2;

M01 = Sa01*V0 + Sa01*Vl + Sa02*V2;

M11 = Sa11*V0 + Sa11*V1 + Sa12*V2;

M21 = Sa21*V0 + Sa21*V1 + Sa22*V2;

M02 = Sa00 + Sa01 + Sa02;

M12 = Sa10 + Sa11 + Sa12;

M22 = Sa20 + Sa21 + Sa22;

Down a Dimension

As a mental exercise, let's look at this problem in one dimension. Suppose we want to find a single function relating X and U of the form

$U = \frac{a X + b}{c X + d}$

si59_e

In matrix notation, this would be

$[\begin{matrix} X & 1 \end{matrix}] [\begin{matrix} a & c \\ b & d \end{matrix}] = w [\begin{matrix} U & 1 \end{matrix}]$

si60_e

Here, each input/output pair (X_i and U_i) gives us one row in the expression

$[\begin{matrix} X_{i} & 1 & - X_{i} U_{i} & - U_{i} \end{matrix}] [\begin{matrix} a \\ b \\ c \\ d \end{matrix}] = 0$

si61_e

Three input/output pairs generate enough rows on the left to make this a fully determined system. Solving for a … d requires the determinants of four 3 × 3 matrices. But applying our 2D trick to the 1D problem, we can get the same result by the calculation

$\begin{matrix} [\begin{matrix} w_{0} & w_{1} \end{matrix}] & = [(U_{1} - U_{2}) (U_{2} - U_{0})] \\ [\begin{matrix} z_{0} & z_{1} \end{matrix}] & = [(X_{1} - X_{2}) (X_{2} - X_{0})] \\ [\begin{matrix} a & c \\ b & d \end{matrix}] & = [\begin{matrix} 1 & - 1 \\ - X_{1} & X_{0} \end{matrix}] [\begin{matrix} z_{1} w_{0} & 0 \\ 0 & z_{0} w_{1} \end{matrix}] [\begin{matrix} U_{0} & 1 \\ U_{1} & 1 \end{matrix}] \end{matrix}$

si62_e

Tediously multiplying this out gives the following:

$\begin{array}{l} a = U_{0} X_{0} (U_{2} - U_{1}) + X_{1} U_{1} (U_{0} - U_{2}) + X_{2} U_{2} (U_{1} - U_{0}) \\ b = X_{0} X_{1} U_{2} (U_{1} - U_{0}) + X_{1} X_{2} U_{0} (U_{2} - U_{1}) + X_{2} X_{0} U_{1} (U_{0} - U_{2}) \\ c = X_{0} (U_{2} - U_{1}) + X_{1} (U_{0} - U_{2}) + X_{2} (U_{1} - U_{0}) \\ d = X_{0} X_{1} (U_{1} - U_{0}) + X_{1} X_{2} (U_{2} - U_{1}) + X_{2} X_{0} (U_{0} - U_{2}) \end{array}$

si63_e

The patterns in these expressions explicitly show something that we expect and implicitly assumed: the transformation will come out the same if we permute the indices of the input/output point pairs.

We will use this 3-point to 3-point transformation to great effect in the next few chapters.

Up a Dimension

Now, close your eyes and stretch your mind in the other direction. Imagine input/output pairs in homogeneous 3D space. Each pair, connected by a 4 × 4-element matrix multiplication, gives four equations. The fourth equation gives an expression for w_i Plug it into the other three and rearrange to get three equations that can be written as three rows of stuff times a 16-element column of matrix elements a … p. Since this has a zero on the right side, we need one less than 16 equations to solve the homogeneous system. Hmm, 16 minus 1 gives 15, and we get three per input/ output pair. That means that we can nail down a 3D homogeneous perspective transform with five input/output pairs. The conventional solution requires a truly excruciating 16 determinants of 15 × 15 matrices. Opening your eyes in shock, you discover that this exercise in imagination allowed me to get the idea across without making the typesetter hate me.

But now we know better how to solve this. The answer is a fairly straightforward generalization of Equations (13.6) and (13.9) into 4 × 4 matrices and 4-element vectors like so:

$\begin{array}{l} M_{b t} = [\begin{matrix} w_{0} & 0 & 0 & 0 \\ 0 & w_{1} & 0 & 0 \\ 0 & 0 & w_{2} & 0 \\ 0 & 0 & 0 & w_{3} \end{matrix}] [\begin{matrix} U_{0} & V_{0} & T_{0} & 1 \\ U_{1} & V_{1} & T_{1} & 1 \\ U_{2} & V_{2} & T_{2} & 1 \\ U_{3} & V_{3} & T_{3} & 1 \end{matrix}] \\ [\begin{matrix} w_{0} & w_{1} & w_{2} & w_{3} \end{matrix}] = [\begin{matrix} U_{4} & V_{4} & T_{4} & 1 \end{matrix}] {[\begin{matrix} U_{0} & V_{0} & T_{0} & 1 \\ U_{1} & V_{1} & T_{1} & 1 \\ U_{2} & V_{2} & T_{2} & 1 \\ U_{3} & V_{3} & T_{3} & 1 \end{matrix}]}^{*} \end{array}$

si64_e

The details and arithmetic optimization tricks are left as an exercise for you to do (meaning I haven't gotten around to doing it myself).

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter Thirteen: Inferring Transforms: May-June 1999

Create new playlist

Sign In

Sign Up