In this chapter we provide some brief historical background to traditional and geometrical methods of shape analysis. We then define what is meant by a configuration of landmarks and the configuration space, discuss measures of the size, and then consider some simple coordinate systems for representing shapes.
Shape analysis has a long history, especially in biology. In order to analyse size or shape, a biologist traditionally selects ratios of distances between landmarks or angles, and then carries out multivariate analysis (e.g. Rao 1948). This approach has been called ‘multivariate morphometrics’ in biology and a review is given by Reyment et al. (1984, p. 120). Early researchers using such methods include Pearson (1926), who studied a measure of similarity between skulls based on many lengths between landmarks. In fact, various studies have been cited in the journal Biometrika, starting from studies in volume 1 itself (Fawcett and Lee 1902). The data were usually distances between landmarks (such as lengths and widths) or angles between landmarks, rather than landmarks themselves. The definitions of landmarks on skulls follow the Frankfurt Concordat (Frankfurter Verständigung) of the 13th Congress of the German Anthropological Society which met in Frankfurt-am-Main, 14–17 August 1882 (e.g. see Trevor 1950).
Another large area of work in biology has been in the study of allometry, that is differences in shape associated with size (Hopkins 1966; Sprent 1972). The notion of allometry was introduced by Huxley (1924, 1932) and often involves fitting simple non-linear equations to length measurements. We consider a discussion of some related distance-based methods later in Chapter 15.
In the studies of multivariate morphometrics one deals exclusively with positive variables (lengths, angles and ratios of lengths). However, considering only distances and angles can be inferior to using the actual coordinates of the landmarks, because the geometry is often thrown away when using the former. Ratios of distances can easily be calculated from coordinates whereas the converse is not generally true. However, if enough distances are taken, then a configuration can be reconstructed up to a reflection and analysis can be carried out using multidimensional scaling (MDS) or Euclidean distance matrix analysis (see Kent 1994; Lele and Richtsmeier 2001; Dryden et al. 2008a), see Chapter 15.
Since the 1980s there have been many key developments in shape analysis that allow us to work on the landmark coordinates directly. Also, the advances in technology of measuring landmarks have been helpful, for example landmarks from digitized objects. Of course if there were no constraints from registration, then we could use standard multivariate analysis in Euclidean space, but in general the statistical methodology for shape is inherently non-Euclidean.
The main idea of the geometrical approach to shape analysis is that, rather than working with quantities derived from organisms, one works with the complete geometrical object itself (up to similarity transformations). As pointed out by Bookstein (1978) the approach is very much in the spirit of Thompson (1917) who considered the geometric transformations of one species to another (e.g. see Figure 2.1). D’Arcy Thompson’s key ideas will be discussed in more detail in Chapter 12, but the important point to note is that he worked with geometrical pictures of organisms rather than derived quantities.
Throughout the text it will be observed that pictures of the objects under study can always be easily constructed and it is this fact that embodies the geometrical approach to shape analysis. In many applications the statistical goal is inference, for example testing for shape difference. However, the scientific goal is to depict or describe the size and shape changes in a study, and this is a major strength of the geometrical methods that we describe.
We shall consider a shape space obtained directly from the landmark coordinates, which retains the geometry of a point configuration. This approach to shape analysis has been called ‘geometric shape analysis’ or ‘geometric morphometrics’ by various authors and the subject progressed rapidly around the late 1970s/early 1980s. A glossary of geometrical morphometrics is given by Slice et al. (1996).
Following Thompson (1917) some of the earlier work in geometrical shape analysis included that by Medawar (1944) and Sneath (1967). However, the research area became firmly established with the pioneering work of D.G. Kendall and F.L. Bookstein, who independently developed many of the key ideas, in very different styles. Also, some very important mathematical work in the area was also given by Ziezold (1977). Bookstein (1994a) has summarized the history of geometrical shape analysis, mainly through applications in biology. Kendall (1989) has reviewed shape theory and its development from a different, more theoretical, viewpoint, with applications in archaeology, astronomy and geography. Kendall (1989, 1995) provides some historical remarks on his development of shape theory, and some of the contributions of his colleagues. The first articles on the subject were by Kendall (1977), Ziezold (1977) and Bookstein (1978). Some key early papers in the field include those by Kendall (1984, 1989), Bookstein (1986), Goodall (1991), Le and Kendall (1993) and Kent (1994). Also, development in non-i.i.d. (independent and identically distributed) distribution theory for shape started with Mardia and Dryden (1989a).
Definition 2.1 The configuration is the set of landmarks on a particular object. The configuration matrix X is the k × m matrix of Cartesian coordinates of the k landmarks in m dimensions. The configuration space is the space of all landmark coordinates.
In our applications we have k ≥ 3 landmarks in m = 2 or m = 3 dimensions and the configuration space is typically the space of real k × m matrices (or equivalently ) with possibly some special cases removed, such as coincident points.
Before defining shape we should define what we mean by size, so that it can be removed from a configuration. Consider X to be a k × m matrix of the Cartesian coordinates of k landmarks in m real dimensions, that is the configuration matrix of the object.
Definition 2.2 A size measure g(X) is any positive real valued function of the configuration matrix such that
for any positive scalar a > 0.
Definition 2.3 The centroid size is given by:
where Xij is the (i, j)th entry of X, the arithmetic mean in the jth dimension is ,
is the centring matrix,
is the Euclidean norm, Ik is the k × k identity matrix (diagonal matrix with ones on the diagonal), and 1k is the k × 1 vector of ones.
Obviously S(aX) = aS(X) for a > 0, thus satisfying Equation (2.1). The centroid size S(X) is the square root of the sum of squared Euclidean distances from each landmark to the centroid, namely
where (X)i is the ith row of X(i = 1, …, k) and is the centroid. This measure of size will be used throughout the book. In fact, the centroid size is the most commonly used size measure in geometrical shape analysis (e.g. Kendall 1984; Bookstein 1986; Goodall 1991; Dryden and Mardia 1992). The centroid size could also be used in a normalized form, for example S(X)/√k or S(X)/(km)1/2, and this would be particularly appropriate when comparing configurations with different numbers of landmarks. The squared centroid size can also be interpreted as (2k)− 1 times the sum of the squared inter-landmark distances since
In Figure 2.2 we display boxplots of the centroid size of the six groups of great apes which were described in Section 1.4.8. The plot is obtained using the R commands:
data(apes)
size<-centroid.size(apes$x)
plot(apes$group,size)
It is clear that there are sex differences in centroid size for gorillas and orangutans (pongo), with the males all being larger than the females. For chimpanzees (pan) there is an overlap in the centroid size distributions of the sexes, but the males are significantly larger in mean centroid.size, for example using a t-test (p-value = 0.0002):
> t.test(size[apes$group=="panm"],size[apes$group=="panf"],
alternative="greater")
Welch Two Sample t-test
data: size[apes$group == "panm"] and size[apes$group == "panf"]
t = 3.8176, df = 50.704, p-value = 0.0001841
Example 2.1 de Souza et al. (2001a,b) analysed the sizes of stereolithography (SLA) skull models which had been constructed by a manufacturer using CT scans from a real head. In a test to assess the accuracy of the SLA models, six heads (cadavers) were CT scanned, and the CT scans were used to build SLA models. After the removal of the soft tissues, 37 specific anatomical landmarks were measured on both the SLA models and the corresponding real skulls. Using a paired t-test on the log of the centroid sizes, it was found that the SLA model was significantly larger by 1.5% than the original real skull. In particular, this size error was taken into account when designing custom implants, because even a small size increase would have severe clinical consequences. □
An alternative size measure is the baseline size, that is the length between landmarks 1 and 2:
The baseline size was used as early as 1907 by Galton (1907) for normalizing faces and its use came into prominence with Bookstein coordinates, described in Section 2.4. This size variable is also useful in calculating size-and-shape distributions (Dryden and Mardia 1992) which we outline in Section 11.5.
Other alternative size measures include the square root of the area of the convex hull for planar configurations or the cube root of the volume of the convex hull for configurations in three dimensions which intuitively describes size, but these measures have not become popular. For triangles with coordinates (X1, Y1), (X2, Y2) and (X3, Y3) the area is simply given by:
Note that the volume of a parallelepiped in three dimensions formed by v1, v2, v3 is given by the absolute value of the determinant of the matrix with columns v1, v2, v3. Other size measures for triangles include the radius of the inscribed circle and the circumradius (e.g. Miles 1970), the latter being equal to S(X)/√3.
In general the choice of size will affect reported conclusions. In particular, if object A has twice the centroid size of object B it does not necessarily follow that object A is twice the size of object B when using a different measure. When reporting size differences one needs to state which size measure was used. The exception is when both objects have the same shape, in which case the ratio of sizes will be the same regardless of the choice of size variable. If comparing sets of collinear points, then very different conclusions about size are reached, with these pathological configurations all having zero area but non-zero centroid size for non-coincident collinear points.
Example 2.2 Consider the triangles in Figure 2.3. The centroid sizes of the triangles to three decimal places are:
The baseline sizes are:
The square root of area sizes to three decimal places are:
The relative ranking of triangles in terms of size differs depending on the choice of size measure. In particular, in terms of area A is larger than B, but in terms of centroid size B is larger than A. This example demonstrates that different choices of size measure can lead to different conclusions about size. □
The following R commands give the calculations in the example:
> library(shapes)
> x1<-matrix(c(-1.5,-0.5,-1,1,1,sqrt(3)/2+1),3,2)
> x2<-matrix(c(0.5,2,1.25,1,1,1+sqrt(3)/4),3,2)
> x3<-matrix(c(-1.5,-0.5,-1.5,-1,-1,-1+1/sqrt(3)),3,2)
> x4<-matrix(c(0.5,2,1.25,-1,-1,0.75-1),3,2)
> centroid.size(x1)
[1] 1
> centroid.size(x2)
[1] 1.118034
> centroid.size(x3)
[1] 0.942809
> centroid.size(x4)
[1] 1.224745
> #baseline size
> sqrt( (x1[2,1]-x1[1,1])**2+(x1[2,2]-x1[1,2])**2 )
[1] 1
> sqrt( (x2[2,1]-x2[1,1])**2+(x2[2,2]-x2[1,2])**2 )
[1] 1.5
> sqrt( (x3[2,1]-x3[1,1])**2+(x3[2,2]-x3[1,2])**2 )
[1] 1
> sqrt( (x4[2,1]-x4[1,1])**2+(x4[2,2]-x4[1,2])**2 )
[1] 1.5
> #areas
> sqrt(abs((x1[3,2]-x1[1,2])*(x1[2,1]-x1[1,1])-
(x1[3,1]-x1[1,1])*(x1[2,2]-x1[1,2]))/2)
[1] 0.658037
> sqrt(abs((x2[3,2]-x2[1,2])*(x2[2,1]-x2[1,1])-
(x2[3,1]-x2[1,1])*(x2[2,2]-x2[1,2]))/2)
[1] 0.5698768
> sqrt(abs((x3[3,2]-x3[1,2])*(x3[2,1]-x3[1,1])-
(x3[3,1]-x3[1,1])*(x3[2,2]-x3[1,2]))/2)
[1] 0.537285
> sqrt(abs((x4[3,2]-x4[1,2])*(x4[2,1]-x4[1,1])-
(x4[3,1]-x4[1,1])*(x4[2,2]-x4[1,2]))/2)
[1] 0.75
Mosimann (1970) considers a definition of size based on length measurements which satisfies Equation (2.1), but with vectors of positive measurements (length, width, etc.) in place of the configuration matrix X.
In order to describe an object’s shape it is useful to specify a coordinate system. We initially consider some of the most straightforward coordinate systems, which helps to provide an elementary introduction to aspects of shape. A suitable choice of coordinate system for shape is invariant under translation, scaling and rotation of the configuration. Further coordinate systems are discussed later in Section 4.4.
For k = 3 points in m = 2 dimensions two internal angles are an obvious choice of coordinates that are invariant under the similarity transformations. For example, x1 and x2 could measure the shape of the triangle in Figure 2.4, where x1 + x2 + x3 = 180°.
However, it soon becomes apparent that using angles to describe shape can be problematic. For cases such as very flat triangles (three points in a straight line) there are many different arrangements of three points. For example, see Figure 2.5, where the triangles all have x1 = 0°, x3 = 0°, x2 = 180°, and yet the configurations have different shapes.
For larger numbers of points (k > 3) one could subdivide the configuration into triangles and so 2k − 4 angles would be needed. For the m = 3 dimensional case angles could also be used, but again these suffer from problems in pathological cases. Also, probability distributions of the angles themselves are not easy to work with (see Mardia et al. 1977). If the angles of the triangle are x1, x2 and x3, then the use of log (x1/x3) and log (x2/x3) (where x1 + x2 + x3 = 180°) has some potential for analysing triangle shape using compositional data analysis (Aitchison 1986; Pukkila and Rao 1988) and the approach can be adapted to higher dimensions. Mardia et al. (1996b) used angles from triangulations of landmark data on photographs of faces, and Mardia et al. (1977) used the angles between towns in central place data. In such cases, analysis of shape can proceed as in directional statistics (Mardia and Jupp 2000).
A Ramachandran plot (Ramachandran et al. 1963) is a method for visualizing backbone dihedral torsion angles ψ versus ϕ of amino acid residues in protein secondary structures. The two angles describe the torsion angles either side of each α-carbon atom, and provide very useful shape information. This plot contains important angular shape information about the protein backbone and has proved very popular in the proteomics literature. The torsion angles are helpful in the prediction of a protein’s 3D folding. Note that each angle is on the circle, S1, and so the plots are on a torus S1 × S1. The plot is also known as a [ϕ, ψ] plot or a Ramachandran diagram. Another geometrical description of a protein backbone was given by Zacharias and Knapp (2013) called a (d,θ) plot.
Another traditional approach to representing shape is through ratios of length measurements, or lengths divided by size. A considerable amount of work has been carried out in multivariate morphometrics using distances, ratios, angles, and so on and it is still very commonly used in the biological literature (Reyment et al. 1984). There are many situations, such as classification problems, where the techniques can be very powerful. However, sometimes the interpretation of the important linear combinations of ratios of lengths and angles can be difficult. It is often easier to interpret pictures in the original space of the specimens than in some derived multivariate space, so we consider methods where the geometry of the object is retained.
Typical applications of the multivariate morphometrics approach include the classification of species (taxonomy) or the sexing of skulls using lengths, angles or ratios of lengths between landmarks. A very commonly used method is to perform a principal component analysis (PCA) of the multivariate measurements and to interpret each component as a measure of some aspect of size or shape (e.g. Jolicoeur and Mosimann 1960). Quite often the first principal component has approximately equal loadings on each variable and so can be interpreted as an overall measure of size.
Mosimann (1970) gives a rigorous treatment of this subject and provides a theoretical framework for the study of the size and shape of positive length measurements. Mosimann (1970) defines the shape vector to be l/g(l), where g(l) is a size measure. He provides theorems for the independence of population size and shape, including characterizations of various distributions. Further details of the approach can be found in Mosimann (1975a,b, 1988) and Darroch and Mosimann (1985) and the references therein.
Given a set of measurements d1, …, dm the Penrose (1952) size coefficient is:
and the Penrose shape coefficient is:
The coefficients are related to a coefficient of racial likeness (Penrose, 1952; after Pearson, 1926) given by:
These very traditional shape measures usually only capture part of the shape or size-and-shape information, so we prefer a newer approach where the geometry of the objects under study is retained.
Let (xj, yj), j = 1, …, k, be k ≥ 3 landmarks in a plane (m = 2 dimensions). Bookstein (1986, 1984) suggests removing the similarity transformations by translating, rotating and rescaling such that landmarks 1 and 2 are sent to a fixed position. If landmark 1 is sent to (0, 0) and landmark 2 is sent to (1, 0), then suitable shape variables are the coordinates of the remaining k − 2 coordinates after these operations. To preserve symmetry, we consider the coordinate system where the baseline landmarks are sent to and .
Definition 2.4 Bookstein coordinates (uBj, vjB)T, j = 3, …, k, are the remaining coordinates of an object after translating, rotating and rescaling the baseline to and so that
where j = 3, …, k, D212 = (x2 − x1)2 + (y2 − y1)2 > 0 and − ∞ < uBj, vjB < ∞.
A geometrical illustration of these transformations is given in Figure 2.6. If the baseline is taken as (0, 0) and (1, 0), then there is no in the equation for uBj, as originally proposed by Bookstein (1986). We subtract the in uBj to also simplify the transformation to Kendall coordinates in Section 2.5, although precisely where we send the baseline is an arbitrary choice. These coordinates have been used widely in shape analysis for planar data. Bookstein coordinates are the most straightforward to use for a newcomer to shape analysis. However, because of the lack of symmetry in choosing a particular baseline and the fact that correlations are induced into the coordinates, many practitioners often prefer to use the Procrustes tangent coordinates (see Section 4.4).
Galton (1907) defined precisely the same coordinates [with baseline (0,0), (1,0)] at the beginning of the last century, but the statistical details of using this approach needed to be worked out.
The construction of Bookstein coordinates is particularly simple if using complex arithmetic. Bookstein coordinates are obtained from the original complex coordinates zo1, …, zko, where :
where . Consider how the formulae in Equation (2.5a) are obtained (Mardia 1991). To find (uBj, vjB) for a fixed j = 3, …, k, we have to find the scale c > 0, the rotation A, and the translation b = (b1, b2)T such that
where X = (xj, yj)T and U = (uBj, vjB)T are the coordinates of the jth point before and after the transformation, j = 3, …, k, and A is a 2 × 2 rotation matrix, namely
where we rotate clockwise by θ radians. Applying the transformation to landmarks 1 and 2 gives four equations in four unknowns (c, θ, b1, b2)
Now, we can solve these equations and see that the translation is
the rotation (in the appropriate quadrant) is
and the rescaling is c = {(x2 − x1)2 + (y2 − y1)2}− 1/2. So,
Substituting X = (xj, yj)T(j = 3, …, k) and c, A, b into Equation (2.7) we obtain the shape variables of Equation (2.5a). This solution can also be seen from Figure 2.6 using geometry.
Example 2.3 In Figure 2.7 we see scatter plots of the Bookstein coordinates for the T2 Small vertebrae of Section 1.4.1. We take landmarks 1 and 2 as the baseline. Note that the marginal scatter plots at landmarks 4 and possibly 3 and 5 are elliptical in nature. The plot can be obtained using:
data(mice)
qset2<-mice$x[,,mice$group=="s"]
u<-bookstein2d(qset2)$bshpv
plotshapes(u,symbol=as.character(1:6))
For example, the first mouse has Bookstein shape variables given by the last four rows of:
> u[,,1]
, , 1
[,1] [,2]
[1,] -0.50000000 0.0000000
[2,] 0.50000000 0.0000000
[3,] 0.09121051 0.2826022
[4,] 0.04065577 0.5089238
[5,] -0.07367575 0.2735386
[6,] -0.02672182 -0.3054334
and this could also be obtained directly using the command:
bookstein.shpv(qset2[,,1])
For practical data analysis it is sensible to choose the baseline as landmarks that are not too close together, as in this example. As often happens with using Bookstein coordinates, the variability in the points away from the origin appears larger than the points nearer to the origin. This is an artifact of this coordinate system, and will be explored further in Sections 9.4 and 11.1.4. The centroid sizes of the bones are obtained using:
sz<-centroid.size(qset2)
A pairwise scatter plot of Bookstein’s shape variance and centroid size is obtained using:
x<-rbind(u[3:6,1,],u[3:6,2,])
x<-rbind(matrix(sz,1,23),x)
pairs(t(x),labels=c("s","u3","u4","u5","u6","v3","v4","v5","v6"))
In Figure 2.8 we see pairwise scatter plots of Bookstein coordinates. There are strong positive correlations in the vB3, v4B and vB5 coordinates. There are also positive correlations between vB3 and vB6, between vB5 and vB6, and between uB3 and uB4. However, it is also an artifact of the coordinate system that correlations are induced into the shape variables, even when the landmark coordinates are uncorrelated (see Section 11.1.4), and so correlations can be difficult to interpret.
We can also examine the joint relationship between size and shape, which will be considered in more detail in Chapter 5. Scatter plots of the centroid size S versus each of the shape coordinates are also given in Figure 2.8. We see that there are quite strong positive correlations between S and each of vB3, v4B and vB5. □
Throughout the text we shall often refer to the real (2k − 4)-vector of Bookstein coordinates uB = (uB3, …, ukB, vB3, …, vkB)T, stacking the coordinates in this particular order.
One approach to shape analysis is to use standard multivariate analysis of the Bookstein coordinates, ignoring the non-Euclidean nature of the space. Provided variations in the data are small, then the method is adequate for mean estimation and hypothesis testing. For example, we could obtain an estimate of the mean shape of the configuration by taking the arithmetic average of the Bookstein coordinates (the Bookstein mean shape). The Bookstein mean shape for the T2 Small mouse vertebrae of Example NaN is given by the last four rows of:
> bookstein2d(mice$x[,,mice$group=="s"])$mshape
[,1] [,2]
[1,] -0.50000000 0.000000e+00
[2,] 0.50000000 -6.634366e-21
[3,] 0.08469746 2.933430e-01
[4,] 0.01215768 5.613175e-01
[5,] -0.06874750 2.991278e-01
[6,] -0.02502185 -3.041418e-01
which when rounded to three decimal places is:
The variability of the shape variables is less straightforward to interpret. Transforming the objects (or registering) to a given edge induces correlations into the shape variables in general and this can lead to spurious correlations (Kent 1994), see Sections 9.4 and 11.1.4. So the method should not be used to interpret the structure of shape variability unless there is a good reason to believe that two landmarks are essentially fixed (see Section 9.4).
Registration on a base to filter out the similarity transformations can also be carried out for 3D data, in a similar way to edge registration in Bookstein coordinates for 2D data. Consider k landmarks , in three dimensions. The number of shape variables is 3k − 7 since we have 3k landmark coordinates but must remove 3 location, 1 scale and 3 rotation parameters. The Bookstein coordinates can be taken as
where A is a 3 × 3 rotation matrix (a function of (X1, X2, X3)) and
where u23 ≥ 0, u33 = 0 and Xj → uj for j = 4, …, k. We drop the superscript B denoting Bookstein coordinates in this section. Geometrically the figure has been translated, rotated and rescaled so that the baseline (landmarks 1 and 2) is of unit length, has midpoint at the origin and is rotated to lie along the x-axis. The figure is further rotated so that landmark 3 lies in the plane of the first two dimensions (X, Y), with Y ≥ 0. The remaining non-fixed coordinates are the shape variables (cf. Mardia and Dryden 1997). The details are as follows: first translate the figure,
We then rescale by dividing by the length of the baseline between points 1 and 2: D12 = 2(w212 + w222 + w322)1/2, and rotate through clockwise angles of θ about the z-axis, ω about the y-axis and ϕ about the x-axis where
So, using matrix notation if
then the 3D Bookstein coordinates are:
An explicit expression for the shape coordinates is:
where a = 2(w212 + w222 + w232), j = 4, …, k, and
As in the m = 2 dimensional case, care must be taken when interpreting correlations between the shape variables, because spurious correlations can occur with edge registration methods such as this (see Sections 9.4 and 11.1.4).
Example 2.4 In Figure 2.9(a) we see a male macaque skull from the dataset described in Section 1.4.3, with 24 landmarks located in three dimensions and a wire box regular grid drawn over the skull. Let us consider the page to be the x–y plane and the z-axis is perpendicular to the page. The 3D Bookstein coordinates involve selecting two landmarks (e.g. landmarks 1 and 6 in Figure 1.11) to translate, rotate and rescale to lie horizontal in the plane of the page at points and , and a third landmark (e.g. landmark 10) is chosen to lie in the x–y plane, as displayed in Figure 2.9(b).
Important point: For affine shape (where invariance is under affine transformations rather than Euclidean similarity transformations – see Section 12.2), there are 12 rather than 7 constraints. Hence all the coordinates of the first four landmarks are sufficient to form a base in the affine case and the expressions are simpler than for similarity shape.
Kendall coordinates (Kendall 1984) are related to Bookstein coordinates but location is removed in a different manner. We first need to define the Helmert submatrix which is used to remove location.
The Helmert submatrix H is the (k − 1) × k Helmert matrix without the first row. The full Helmert matrix HF, which is commonly used in Statistics, is a square k × k orthogonal matrix with its first row of elements equal to 1/√k, and the remaining rows are orthogonal to the first row. We drop the first row of HF so that the transformed HX does not depend on the original location of the configuration. Note HTH = C, where C is the centring matrix of (2.3).
Definition 2.5 The jth row of the Helmert submatrix H is given by:
and so the jth row consists of hj repeated j times, followed by − jhj and then k − j − 1 zeros, j = 1, …, k − 1.
For k = 3 the full Helmert matrix is explicitly
and the Helmert submatrix is
For k = 4 points the full Helmert matrix is
and the Helmert submatrix is
We can obtain the Helmert submatrix using defh(k-1)
where k is the number of landmarks, for example:
> defh(2)
[,1] [,2] [,3]
[1,] -0.7071068 0.7071068 0.0000000
[2,] -0.4082483 -0.4082483 0.8164966
> defh(3)
[,1] [,2] [,3] [,4]
[1,] -0.7071068 0.7071068 0.0000000 0.0000000
[2,] -0.4082483 -0.4082483 0.8164966 0.0000000
[3,] -0.2886751 -0.2886751 -0.2886751 0.8660254
> defh(4)
[,1] [,2] [,3] [,4] [,5]
[1,] -0.7071068 0.7071068 0.0000000 0.0000000 0.0000000
[2,] -0.4082483 -0.4082483 0.8164966 0.0000000 0.0000000
[3,] -0.2886751 -0.2886751 -0.2886751 0.8660254 0.0000000
[4,] -0.2236068 -0.2236068 -0.2236068 -0.2236068 0.8944272
Consider the original complex landmarks zo = (zo1, …, zko)T and remove location by pre-multiplying by the Helmert submatrix H to give zH = Hzo = (z1, …, zk − 1)T.
Definition 2.6 The Kendall coordinates are given by:
There is a simple one to one correspondence between Kendall and Bookstein coordinates. If we write
for Bookstein coordinates and
for Kendall coordinates, then it follows that
where H1 is the lower right (k − 2) × (k − 2) partition matrix of the Helmert submatrix H. Note that
so linear transformation from one coordinate system to the other is straightforward. The inverse transformation is:
For k = 3 we have the relationship:
and so Kendall coordinates in this case are the coordinates of the third landmark after transforming landmarks 1 and 2 to ( − 1/√3, 0) and (1/√3, 0) by the similarity transformations. The transformation from (zo1, z2o, zo3) to Kendall coordinates is:
Throughout the text we shall often refer to the real (2k − 4)-vector of Kendall coordinates uK = (uK3, …, ukK, vK3, …, vkK)T, stacking the coordinates in this particular order.
For the case of a triangle of landmarks we have two shape coordinates.
Example 2.5 In Figure 2.3 we see the triangles with internal angles at points 1, 2, 3 given by:
A: 60°, 60°, 60° (equilateral),
B: 30°, 30°, 120° (isosceles) ,
C: 90°, 30°, 60° (right-angled),
D: 45°, 45°, 90° (right-angled and isosceles).
Bookstein coordinates for these three triangles with baseline points 1, 2 are:
A: ,
B: ,
C: ,
D: .
A plot of the shape space of triangles for Bookstein coordinates is given in Figure 2.10. Each triangle shape is drawn with its centroid in the position of the Bookstein coordinates (UB, VB) corresponding to its shape in the shape space. For example, the equilateral triangles are at (0, √3/2) and (0, −√3/2) (marked E and F ). The right-angled triangles are located on the lines UB = −0.5, UB = 0.5 and on the circle (UB)2 + (VB)2 = 0.5; the isosceles triangles are located on the line UB = 0 and on the circles (UB + 0.5)2 + (VB)2 = 1 and (UB − 0.5)2 + (VB)2 = 1; and the flat triangles (3 points in a straight line) are located on VB = 0.
The plane can be partitioned into two half-planes – all triangles below VB = 0 can be reflected to lie above VB = 0. In addition, each half-plane can be partitioned into six regions where the triangles have their labels permuted. If a triangle has baseline 1, 2 and apex 3 and side lengths d12, d13 and d23, then the six regions are:
Thus, if invariance under relabelling and reflection of the landmarks was required, then we would be restricted to one of the 12 regions, for example the region AOE, bounded by the arc of isosceles triangles AE, the line of isosceles triangles OE and the line of flat triangles AO in Figure 2.10.
It is quite apparent from Figure 2.10 that a non-Euclidean distance in (UB, VB) is most appropriate for the shape space. For example, two triangles near the origin that are a Euclidean distance of 1 apart are very different in shape, but two triangles away from the origin that are a Euclidean distance of 1 apart are quite similar in shape. Non-Euclidean shape metrics are described in Section 4.1.1. Bookstein (1986) also suggested using the hyperbolic Poincaré metric for triangle shapes, which is described in Section 12.2.4.
For k = 3 we will see in Section 4.3.4 that the shape space is a sphere with radius . A mapping from Kendall coordinates to the sphere of radius is:
and r2 = (uK3)2 + (v3K)2, so that , where uK3 and vK3 are Kendall coordinates of Section 2.5.
Definition 2.7 Kendall’s spherical coordinates (θ, ϕ) are given by the polar coordinates
where 0 ≤ θ ≤ π is the angle of latitude and 0 ≤ ϕ < 2π is the angle of longitude.
The relationship between (uK3, v3K) and the spherical shape variables (Mardia 1989b) is given by:
The sphere can be partitioned into 6 lunes and 12 half-lunes. In order to make the terminology clear, let us note that one example full-lune is 0 ≤ ϕ ≤ π/3, 0 ≤ θ ≤ π and one example half-lune is 0 ≤ ϕ ≤ π/3, 0 ≤ θ ≤ π/2.
In Figure 2.11 we see triangle shapes located on the spherical shape space. The equilateral triangle with anti-clockwise labelling corresponds to the ‘North pole’ (θ = 0) and the reflected equilateral triangle (with clockwise labelling) is at the ‘South pole’ (θ = π). The flat triangles (three collinear points) lie around the equator (θ = π/2). The isosceles triangles lie on the meridians ϕ = 0, π/3, 2π/3, π, 4π/3, 5π/3. The right-angled triangles lie on three small circles given by:
and we see the arc of unlabelled right-angled triangles on the front half-lune in Figure 2.11.
Reflections of triangles in the upper hemisphere at (θ, ϕ) are located in the lower hemisphere at (π − θ, ϕ). In addition, permuting the triangle labels gives rise to points in each of the six equal half-lunes in each hemisphere. Thus, if invariance under labelling and reflection was required, then we would be restricted to one of these half-lunes, for example the sphere surface defined by 0 ≤ ϕ ≤ π/3, 0 ≤ θ ≤ π/2. Consider a triangle with labels A, B and C, and edge lengths AB, BC and AC. If the labelling and reflection of the points was unimportant, then we could relabel each triangle so that, for example, AB ≥ AC ≥ BC and point C is above the baseline AB.
If we have a triangle in three dimensions, then we see that we can translate, rotate and rescale so that
where u23 > 0. Hence, the triangle has two shape coordinates (u13, u23)T which lie in a half-plane. We could transfer to Kendall’s spherical shape coordinates of Equation (2.14), although the range of the latitude angle θ will be , as u23 > 0. Hence, the shape space for triangles in three dimensions is the hemisphere with radius half, that is , as we will see in Section 4.3.4. This is also the case for triangles in more than three dimensions.
For practical analysis and the presentation of data it is often desirable to use a suitable projection of the sphere for triangle shapes. Kendall (1983) defined an equal area projection of one of the half-lunes of the shape sphere to display unlabelled triangle shapes. The projected lune is bell-shaped and this graphical tool is also known as ‘Kendall’s Bell’ or the spherical blackboard (an example is given later in Figure 14.2).
An alternative equal-area projection is the Schmidt net (Mardia 1989b) otherwise known as the Lambert projection given by:
In Figure 2.12 we see a plot of one of the half-lunes on the upper hemisphere of shape space projected onto the Schmidt net. Example triangles are drawn with their centroids at polar coordinates (ξ, ψ) in the Schmidt net.
Watson (1986) considers a coordinate system for triangle shape. Let be the complex landmark coordinates. Let ω = exp(iπ/3) and u = (1, ω, ω2)T, So, 13, u and are orthogonal and have length √3. One can express the triplet of points z as:
The vector 13 represents the degenerate triangle shape (all points coincident), u is an equilateral triangle and is its reflection. Since z has the same shape as cz + d13, where , the shape of the triangle can be obtained from . The Watson shape coordinates are given by the complex number b. All unlabelled triangles can be represented in the sector of the unit disc, with |b| ≤ 1 and 0 < Arg(b) < π/3. Watson (1986) constructs sequences of triangle shapes from products of circulant matrices.
18.227.107.147