Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

On shape analysis of functional data

Ruiyi Zhang; Anuj Srivastava Florida State University, Tallahassee, FL, United States

Abstract

This chapter studies shape analysis of functional data, specifically real-valued functions on intervals and closed curves in $R^{2}$ . This analysis uses a comprehensive elastic Riemannian framework that integrates solutions to the sub-problems of registration, shape comparison and shape analysis, all under the same formulation. Registration implies a dense matching of points across objects when comparing their shapes. This unified framework starts from a square-root transformation of functions, and removes undesired shape-preserving transformations, including translations, scalings, rotations and re-parametrizations. This removal results in a quotient space, called the shape space, where all statistical analysis is performed using the quotient space metric. We provide simulation examples and real data to illustrate the elastic shape analysis framework.

Keywords

Shape analysis; registration problem; Riemannian metric; geodesic; group action on a manifold; quotient space

11.1 Introduction

The problem of shape analysis of objects is very important with applications across all areas. Specifically, the shape analysis of curves in Euclidean spaces is important with widespread applications in biology, computer vision, and medical image analysis. For instance, the functionality of biological objects, such as proteins, RNAs, and chromosomes, is often closely related to their shapes, and we would like statistical tools for analyzing such shapes to understand their functionalities. These tools include metrics for quantifying shape differences, geodesics to study deformations between shapes, shape summaries (including means, covariances, and principals components) to characterize shape population, and statistical shape models to capture shape variability. Consequently, a large number of mathematical representations and approaches have been developed to analyze shapes of interest. Depending on their goals, these representations can differ vastly in their complexities and capabilities.

Shape analysis naturally involves elements of differential geometry of curves and surfaces. In the appendix we summarize some concepts and notions from differential geometry that are useful in any approach to shape analysis. Specifically, we introduce the concepts of a Riemannian metric, geodesics, group actions on manifolds, quotient spaces, and equivalence relations using orbits. We also provide some simple examples to illustrate them to a reader not familiar with these concepts. Given these items, we can layout a typical approach to shape analysis. A typical framework for shape analysis takes the following form. We start with a mathematical representation of objects—as vectors, matrices, or functions—and remove certain shape-preserving transformations termed as preprocessing. The remaining transformations, the ones that cannot be removed as preprocessing, as removing by forming group actions and quotient spaces.

David Kendall [10] provided one of the earliest formal mathematical frameworks for quantifying shapes. In this framework we start with a finite set of points, termed landmarks, that represent an object and removes the effects of certain transformation groups, namely rigid motions and global scaling, to reach final shape representations. As depicted in Fig. 11.1, the key idea is to remove two groups via preprocessing—remove translation group by centering landmarks and remove size group by rescaling landmark vector—and reach a constrained space called the preshape space. The remaining transformation, the rotation in this case, is removed by forming orbits under this group action, or equivalence classes, and imposing a metric on this quotient space of orbits. We introduce the concepts of equivalence relations, group actions, orbits, and quotient spaces in the Appendix. A number of prominent researchers have subsequently developed this framework into a rich set of statistical tools for practical data analysis [3,17,11,6].

Figure 11.1 A general framework of shape analysis: a mathematical representation leads to a preshape space that further results in the shape space.

One of the most important challenges in shape analysis is registration. Registration stands for densely matching points across objects and using this registration for comparing shapes and developing shape models. Historically, some approaches presume that objects are already perfectly registered, whereas some other approaches use off-the-shelf methods to preregister before applying their own shape analysis. Whereas a presumption of perfect registration is severely restrictive, the use of registration as a preprocessing step is also questionable, especially when the metrics for registration have no bearing on the metrics used in ensuing shape analysis and modeling. A better solution, one that has gained recognition over the last few years, is an approach called elastic shape analysis. Here we incorporate a solution for performing registration along with the process of shape comparisons, thus resulting in a unified framework for registration, shape comparisons, and analysis. The key idea here is to endow the shape space with an elastic Riemannian metric that has an appropriate invariance under the action of the registration group (and other nuisance groups). Whereas such elastic metrics are somewhat complicated to be of use directly, especially for analyzing large datasets, there is often a square-root transformation that simplifies them into the standard Euclidean metric. This point of view is the main theme of this chapter.

In this chapter we focus on the problem of shape analysis of curves in Euclidean spaces and provide an overview of this problem area. This setup includes, for example, shape analysis of planar curves that form silhouettes of objects in images or shape analysis of space curves representing complex biomolecular structures, such as proteins and chromosomes. A particular case of this problem is when we restrict to curves in $R$ , that is, we analyze shapes of real-valued functions on a fixed interval. This fast growing area in statistics, called functional data analysis [16], deals with modeling and analyzing data, where observations are functions over intervals. The use of elastic Riemannian metrics and square-root transformations for curves was first proposed by [22,23] although this treatment used complex arithmetic and was restricted to planar curves. Later on, [15] presented a family of elastic metrics that allowed for different levels of elasticity in shape comparisons. Joshi et al. [9] and Srivastava et al. [19] introduced a square-root representation, slightly different from that of Younes, which was applicable to curves in any Euclidean space. Subsequently, several other elastic metrics and square-root representations, each representing a different strength and limitation, have been discussed in the literature [25,1,2,12,24]. In this chapter we focus on the framework of [9] and [19] and demonstrate that approach using a number of examples involving functional and curve data.

We mention in passing that such elastic frameworks have also been developed for curves taking values on nonlinear domains also, including unit spheres [26], hyperbolic spaces [5,4], the space of symmetric positive definite matrices [27], and some other manifolds. Additionally, elastic metrics and square-root representations have also been used to analyze shapes of surfaces in $R^{3}$ . These methods provide techniques for registration of points across objects, as well as comparisons of their shapes, in a unified metric-based framework. For details, we refer the reader to a text book by [8] and some related papers [7,21,13].

11.2 Registration problem and elastic approach

Using a formulation similar to Kendall's approach, we provide a comprehensive framework for comparing shapes of curves in Euclidean spaces. The key idea here is to study curves as (continuous) parameterized objects and to use parameterization as a tool for registration of points across curves. Consequently, this introduces an additional group, namely the re-parameterization group, which is added in the representation and needs to be removed using the notion of equivalence class and quotient spaces.

11.2.1 The $L^{2}$ norm and associated problems

As mentioned earlier, the problem of registration of points across curves is important in comparisons of shapes of curves. To formalize this problem, let Γ represent all boundary-preserving diffeomorphisms of $[0, 1]$ to itself. Elements of Γ play the role of re-parameterization (and registration) functions. Let $F$ be the set of all absolutely continuous parameterized curves of the type $f : [0, 1] \to R^{n}$ . For any $f \in F$ and $γ \in Γ$ , the composition $f \circ γ$ denotes a reparameterization of f. Note that both f and $f \circ γ$ go through the same set of points in $R^{n}$ and thus have exactly the same shape. Fig. 11.2 shows an example of reparameterization of a semicircular curve f. The top row shows three γ's, and the bottom row shows the corresponding parameterizations of f. For any $t \in [0, 1]$ and any two curves $f_{1}, f_{2} \in F$ , the points $f_{1} (t)$ and $f_{2} (t)$ are said to be registered to each other. If we re-parameterize $f_{2}$ by γ, then the registration of $f_{1} (t)$ changes to $f_{2} (γ (t))$ for all t. In this way the reparameterization γ controls registration of points between $f_{1}$ and $f_{2}$ . On one hand, reparameterization is a nuisance variable since it preserves the shape of a curve but, and on the other hand, it is an important tool in controlling registration between curves.

Figure 11.2 An illustration of reparameterization of an open curve. This figure is taken from [18].

To register two curves, we need an objective function that evaluates and quantifies the level of registration between them. A natural choice will use the $L^{2}$ norm, but it has some unexpected pitfalls as described next. Let $‖ f ‖$ represent the $L^{2}$ norm, that is, $‖ f ‖ = \sqrt{\int_{0}^{1} | f (t) |^{2} d t}$ , of a curve f, where $| \cdot |$ inside the integral denotes the $ℓ^{2}$ norm of a vector. The $L^{2}$ norm provides the most commonly used Hilbert structure in functional data analysis. Despite its popularity, there are problems with this metric. The main problem is that as an objective function for registration, it leads to a degeneracy, called the pinching effect. In other words, if we try to minimize $‖ f_{1} - f_{2} \circ γ ‖$ over γ, this quantity can be made infinitesimally small despite $f_{1}$ and $f_{2}$ being very different functions. Fig. 11.3 shows a simple example to illustrate this idea using scalar functions on $[0, 1]$ . The top left panel of the figure shows to functions $f_{1}$ , $f_{2}$ that agree at only one point $t = 0.5$ in the domain. We design a sequence of piecewise-linear time-warping functions γs (shown in the bottom row) that increasingly spend time at $t = 0.5$ from left to right, resulting in the steady decrease in the $L^{2}$ norm $‖ f_{1} \circ γ - f_{2} \circ γ ‖$ . Continuing this process, we can arbitrarily decrease the $L^{2}$ norm between two functions, irrespective of the other values of these functions. Thus an optimization problem of the type $\inf_{γ \in Γ} ‖ f_{1} - f_{2} \circ γ ‖$ has degenerate solutions. This problem is well recognized in the literature [16,14], and the most common solution used to avoid pinching is to penalize large warpings using roughness penalties. More precisely, we solve an optimization problem of the type

$\inf_{γ \in Γ} ({‖ f_{1} - f_{2} \circ γ ‖}^{2} + λ R (γ)),$

(11.1)

where $R (γ)$ measures the roughness of γ. Examples of $R$ include $\int \dot{γ} {(t)}^{2} d t$ , $\int \ddot{γ} {(t)}^{2} d t$ , and so on. The role of $R$ is to discourage large time warpings. By large we mean γs whose first or second derivatives have high norms.

Figure 11.3 Pinching of functions f₁ and f₂ to reduce the $L^{2}$ norm between them. In each column we show f₁∘γ, f₂∘γ (top panel), γ (bottom panel), and the value of ‖f₁∘γ − f₂∘γ‖ below them.

Whereas this solution helps avoid the pinching effect, it often also restricts alignment of functions. Since it reduces the solution space for warping functions, it can also inhibit solutions that correctly require large deformations. Fig. 11.4 shows an example of this problem using two functions $f_{1}$ and $f_{2}$ shown in the top left panel. In this example we use the first order penalty ( $\int \dot{γ} {(t)}^{2} d t$ ) in Eq. (11.1) and study several properties of the resulting solution—does to align the functions well, is the solution symmetric in the functions $f_{1}$ and $f_{2}$ , does it have pinching effect, and how sensitive is the solution to the choice of λ? In each row, for a certain value of λ, we first obtain an optimal value of γ in Eq. (11.1). Then, to study the symmetry of the solution, we swap $f_{1}$ and $f_{2}$ and solve this optimization again. Finally, we compose the two resulting optimal warping functions. If the composition is a perfect identity function $γ_{i d} (t) = t$ , then the solution is called inverse consistent (and symmetric), otherwise not. In the first row, where $λ = 0$ , we see that the solution is inverse consistent, but there is substantial pinching, as expected. As we increase λ (shown in the second row), the level of pinching is reduced, but the solution also shows a small inconsistency in the forward and backward solutions. In the last row, where λ is large, the pinching effect is gone, but inverse inconsistency has increased. Also, due to increased penalty, the quality of alignment has gone down too. This discussion also points an obvious limitation of methods involving penalty terms. We need to decide which type of penalty and how large λ should be used in real situations. In contrast to the penalized $L^{2}$ approach, the elastic approach described in this chapter results in the solution shown in the bottom row. This solution is inverse consistent, rules out the pinching effect (and does not need any penalty or choice of λ), and achieves an excellent registration between the functions. Additionally, the computational cost is very similar to the penalized $L^{2}$ approach.

Figure 11.4 Illustration of problems with penalized $L^{2}$ norm in registering functions using time warpings: pinching effect, asymmetry of solutions, and need to balance matching with penalty. The bottom row shows the proposed method that avoids all three problems.

Lack of invariance under $L^{2}$ norm To study the shape of curves, we need representations and metrics that are invariant to rigid motions, global scale, and reparameterization of curves. The biggest challenge comes from the last group since it is an infinite-dimensional group and requires closer inspection. To understand this issue, take an analogous task of removing the rotation group in Kendall's shape analysis, where each object is represented by a set of landmarks. Let $X_{1}, X_{2} \in R^{n \times k}$ represent two sets of landmarks (k landmarks in $R^{n}$ ), and let $S O (n)$ be the set of all rotations in $R^{n}$ . To (rotationally) align $X_{2}$ to $X_{1}$ , we solve for a Procrustes rotation according to ${argmin}_{O \in S O (n)} {‖ X_{1} - O X_{2} ‖}_{F}$ , where ${‖ \cdot ‖}_{F}$ denotes the Frobenius norm of matrices. In other words, we keep $X_{1}$ fixed and rotate $X_{2}$ into a configuration that minimizes the Frobenius norm between them. The choice of Frobenius norm is important because it satisfies the following property:

$‖ X_{1} - X_{2} ‖ = ‖ O X_{1} - O X_{2} ‖ for all X_{1}, X_{2} \in R^{n \times k}, O \in S O (n) .$

(11.2)

If this property was not satisfied, we would not be able to perform Procrustes rotation. In mathematical terms we say that the action of $S O (n)$ on $R^{n \times k}$ is by isometries under the Frobenius norm or that the metric is invariant under the rotation group action. (Please refer to the Appendix for a proper definition of isometry under group actions.) A similar approach is needed for performing registration and removing the reparameterization group in the case of functions and curves. However, it is easy to see that $‖ f_{1} - f_{2} ‖ \neq ‖ f_{1} \circ γ - f_{2} \circ γ ‖$ in general. In fact, Fig. 11.3 already provides an example of this inequality. Since $L^{2}$ norm is not preserved under identical reparameterizations of curves, it is not suitable for use in registration and shape analysis. Thus we seek a new metric to accomplish registration and shape analysis of curves.

11.2.2 SRVFs and curve registration

Now we describe an elastic approach that addresses these issues and provides a fundamental tool for registration and shape analysis of curves.

As earlier, let $F$ be the set of all absolutely continuous parameterized curves in $R^{n}$ , and let Γ be the same of all reparameterization functions. Define the square-root velocity function (SRVF) [19,18] of f as a mathematical representation of f given by

$q (t) = {\begin{matrix} \frac{\dot{f} (t)}{\sqrt{| \dot{f} (t) |}}, & | \dot{f} (t) | \neq 0, \\ 0, & | \dot{f} (t) | = 0 . \end{matrix}$

(11.3)

In case of $n = 1$ this expression simply reduces to $q (t) = sign (\dot{f} (t)) \times \sqrt{| \dot{f} (t) |}$ . Here are some important properties associated with this definition.

• If f is absolutely continuous, as assumed, then q is square integrable, that is, $‖ q ‖ < \infty$ .
• This transformation from f to q is invertible up to a constant, with the inverse given by $f (t) = f (0) + \int_{0}^{t} q (s) | q (s) | d s$ . In fact the mapping $f \mapsto (f (0), q)$ is a bijection between $F$ and $R \times L^{2} ([0, 1], R^{n})$ .
• If f is reparameterized by $γ \in Γ$ to result in $f \circ γ$ , the SRVF change from q to $(q \circ γ) \sqrt{\dot{γ}} \overset{Δ}{=} (q ⋆ γ)$ . In other words, the action of Γ on $L^{2}$ is given by $q ⋆ γ$ . Also, if a curve is rotated by a matrix $O \in S O (n)$ , then its SRVF gets rotated by the same matrix O, that is, the SRVF of a curve Of is given by Oq.
• The length of the curve f, given by $L [f] = \int_{0}^{1} | \dot{f} (t) | d t$ , is equal to the $L^{2}$ norm of its SRVF q, that is, $L [f] = ‖ q ‖$ .
• The most important property of this representation is preservation of the $L^{2}$ norm under time warping or reparameterization, that is,

$‖ q_{1} - q_{2} ‖ = ‖ (q_{1} ⋆ γ) - (q_{2} ⋆ γ) ‖, \forall q_{1}, q_{2} \in L^{2} ([0, 1], R^{n}), γ \in Γ .$

(11.4)

We already know that $L^{2}$ norm is preserved under identical rotation, that is, $‖ q_{1} - q_{2} ‖ = ‖ O q_{1} - O q_{2} ‖$ .

In view of the invariant property, this representation provides a proper setup for registration of functional and curve data.

Definition 11.1

Pairwise registration

Given two curves $f_{1}, f_{2} \in F$ and their SRVFs $q_{1}, q_{2} \in L^{2} ([0, 1], R^{n})$ , we define their registration to be the optimization problem

$\inf_{γ \in Γ, O \in S O (n)} ‖ q_{1} - O (q_{2} ⋆ γ) ‖ = \inf_{γ \in Γ, O \in S O (n)} ‖ q_{2} - O (q_{1} ⋆ γ) ‖ .$

(11.5)

We make a few remarks about this registration formulation.

1. No Pinching: Firstly, this setup does not have any pinching problem. A special case of the isometry condition is that $‖ q ⋆ γ ‖ = ‖ q ‖$ for all q and γ. In other words, the action of Γ on $L^{2}$ is norm preserving. Thus pinching is not possible in this setup.
2. No Need for a Penalty Term: Since there is no pinching, we do not have to include any penalty term in this setup. Therefore there is no inherent problem about choosing the relative weight between the data term and the penalty term. However, if we need to control the time warping or registration, beyond the basic formulation, we can still add an additional penalty term as desired.
3. Inverse Symmetry: As stated in Eq. (11.5), the registration of $f_{1}$ to $f_{2}$ results in the same solution as the registration of $f_{2}$ to $f_{1}$ . The equality between the two terms in that equation comes from fact that $S O (n)$ and Γ are groups, and the isometry condition is satisfied (Eq. (11.4)).
4. Invariance to Scale and Translation: We can show that the optimizer in Eq. (11.5) does not change if either function is changed using positive scaling or translation, that is, we can replace any $f_{i} (t)$ with $a f_{i} (t) + c$ for any $a \in R_{+}$ and $c \in R^{n}$ , and the solution does not change. This is an important requirement for shape analysis since shape is invariant to these transformations. The solution is invariant to translation because the SRVF $q_{i}$ is based on the of the curve $f_{i}$ and because

$\underset{γ \in Γ}{arginf} ‖ q_{1} - (q_{2} ⋆ γ) ‖ = \underset{γ \in Γ}{arginf} ‖ a_{1} q_{1} - a_{2} (q_{2} ⋆ γ) ‖$

for any $a_{1}, a_{2} \in R_{+}$ .
5. Proper Metric: As described in the next section, the infimum in Eq. (11.5) is a proper metric itself in a quotient space and hence can be used for ensuing statistical analysis. (Please refer to the definition of a metric on the quotient space $M / G$ using an invariant metric on parent space M).

In Eq. (11.5) the optimization over $S O (n)$ is performed using the Procrustes solution:

$\begin{matrix} O^{⁎} = {\begin{matrix} U V if \det (A) > 0, \\ U [\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}] V^{T} otherwise, \end{matrix} \end{matrix}$

where $A = \int_{0}^{1} q_{1} (t) q_{2}^{T} (t) d t \in R^{n \times n}$ , and $svd (A) = U Σ V^{T}$ . The optimization over Γ is accomplished using the dynamic programming algorithm [20]. The optimization over $S O (n) \times Γ$ is performed by iterating between the two individual solutions until convergence.

We present some examples of registration of functions and curves to illustrate this approach. In the case $n = 1$ we do not have any rotation in Eq. (11.5), and one minimizes only over Γ using the dynamic programming algorithm. The last row of Fig. 11.4 shows alignment of two functions studied in that example. It shows alignments of the two functions to each other and the composition of the two γ functions resulting in a perfect identity function (confirming inverse consistency). Note that even though the optimization is performed using SRVFs in $L^{2}$ space, the results are displayed in the original function space for convenience. Fig. 11.5 shows another example of registering functions using this approach.

Figure 11.5 Registration of scalar functions using Eq. (11.5).

Fig. 11.6 shows a few examples of registration of points along planar curves. In each panel we show the two curves in red (light gray in print version) and blue (dark gray in print version), and optimal correspondences across these curves using black lines, obtained using Eq. (11.5). We can see that the algorithm is successful in matching points with similar geometric features (local orientation, curvature, etc.) despite different locations of matched points along the two curves. This success in registering curves translates into a superior performance in ensuing shape analysis.

Figure 11.6 Registration of planar curves using Eq. (11.5).This figure is taken from [18].

11.3 Shape space and geodesic paths

So far we have focused on an important problem of registering curves and functions. Our larger goal, of course, is the shape analysis of these objects, and registration plays an important role in that analysis. Returning to the problem of analyzing shapes, we describe the framework for an elastic shape analysis of Euclidean curves. It is called elastic because it incorporates the registration of curves as a part of their shape comparisons.

This framework follows the approach laid out in Fig. 11.1. We will use the SRVF representation of curves, reach a preshape space by removing some nuisance transformations, and form quotient space of that preshape space under the remaining nuisance transformations. Once again, let $F$ be the set of all absolutely continuous curves in $R^{n}$ , and let $L^{2}$ denote the set of square-integrable curves in $R^{n}$ . For any $f \in F$ , its length $L [f] = ‖ q ‖$ , where q is the SRVF of f. So, if we rescale f to be of unit length, then its SRVF satisfies $‖ q ‖ = 1$ . Let $C$ denote the unit Hilbert sphere inside $L^{2}$ . $C$ is called the preshape space and is the set of SRVFs representing all unit length curves in $F$ . The geometry of $C$ is simple, and we can compute distances/geodesics on $C$ relatively easily. For any two points $q_{1}, q_{2} \in C$ , the distance between them on $C$ is given by the arc length $d_{c} (q_{1}, q_{2}) = \cos^{- 1} (〈 q_{1}, q_{2} 〉)$ . The geodesic path between them is the shorter arc on a great circle given by

$α : [0, 1] \to C, α (τ) = \frac{1}{\sin θ} (\sin ((1 - τ) θ) q_{1} + \sin (τ θ) q_{2}),$

(11.6)

where $θ = d_{c} (q_{1}, q_{2})$ .

In representing a unit-length $f \in F$ by its $q \in C$ we have removed its translation (since q depends only on the derivatives of f) and its scale. However, the rotation and re-parameterization variabilities are still left in this representation, that is, we can have two curves with exactly the same shape but at different rotations and reparameterizations and thus with nonzero distance between them in $C$ . These transformations are removed using groups actions and equivalence relations, as described next. For any $O \in S O (n)$ and $γ \in Γ$ , $O (f \circ γ)$ has the same shape as f. In the SRVF representation we characterize these transformations as actions of product group $S O (n) \times Γ$ on $C$ according to

$(S O (n) \times Γ) \times L^{2} \to L^{2}, (O, γ) ⁎ q = O (q ⋆ γ) .$

This leads to the definition of orbits or equivalent classes:

$[q] = {O (q ⋆ γ) | O \in S O (n), γ \in Γ} .$

Each orbit $[q]$ represents a unique shape of curves. The set of all such orbits forms the shape space of curves

$S = C / (S O (n) \times Γ) = {[q] | q \in C} .$

(Once again, we advise the reader not familiar with these ideas to follow the definitions given in the Appendix.)

Definition 11.2

Shape metric

For any two curves $f_{1}, f_{2} \in F$ , define a metric between their shapes according to

$d_{s} ([q_{1}], [q_{2}]) = \inf_{O \in S O (n), γ \in Γ} d_{c} (q_{1}, O (q_{2} ⋆ γ))$

(11.7)

$= \inf_{O \in S O (n), γ \in Γ} \cos^{- 1} (〈 q_{1}, O (q_{2} ⋆ γ) 〉) .$

(11.8)

The interesting part of this definition is that the process of registration of points across shapes is incorporated in the definition of shape metric. The two computations, registration of curves and comparisons of their shapes, have been unified under the same metric. The optimal deformation from one shape to the other is mathematically realized as a geodesic in the shape space. We can evaluate the geodesic between the shapes $[q_{1}]$ and $[q_{2}]$ in $S$ by constructing the shortest geodesic between the two orbits in $C$ . If $(\hat{O}, \hat{γ})$ denote the optimal arguments in Eq. (11.8), then this geodesic is given by

$α (τ) = \frac{1}{\sin θ} (\sin ((1 - τ) θ) q_{1} + \sin (τ θ) {\hat{q}}_{2}), {\hat{q}}_{2} = \hat{O} (q_{2} ⁎ \hat{γ}) .$

We present some examples of this framework. Fig. 11.7 shows three examples of geodesic paths between given 2D curves. It is clear from these examples that the elastic registration of points across the two curves result in a very natural deformation between them. Similar geometric parts are matched to each other despite different relative sizes across the objects, a clear depiction of stretching and compression needed for optimal matching. Fig. 11.8 shows an example of elastic geodesic between two very simple proteins viewed as curves in $R^{3}$ .

Figure 11.7 Elastic geodesic paths between planar curves, showing that a good registration leads to natural deformations.

Figure 11.8 Elastic deformation between two proteins.This figure is taken from [18].

Shape spaces of closed curves In case we are interested in shapes of closed curves, we need to restrict to the curves satisfying the condition $f (0) = f (1)$ . In this case it is often more natural to choose the domain of parameterization to be $S^{1}$ , instead of $[0, 1]$ . Thus Γ now represents all orientation-preserving diffeomorphisms of $S^{1}$ to itself. Under the SRVF representation of f, the closure condition is given by $\int_{S^{1}} q (t) | q (t) | d t = 0$ . The preshape space for unit-length closed curves is

$C^{c} = {q \in L^{2} (S^{1}, R^{n}) | \int_{S^{1}} | q (t) | d t = 1, \int_{S^{1}} q (t) | q (t) | d t = 0} \subset C .$

An orbit or an equivalence class is defined to be: for $q \in C^{c}$ , $[q] = {O (q ⋆ γ) | O \in S O (n), γ \in Γ}$ , and the resulting shape space is $S^{c} = C^{c} / (S O (n) \times Γ) = {[q] | q \in C^{c}}$ . The computation of geodesic paths in $C^{c}$ is more complicated as there is no analytical expression similar to Eq. (11.6) is available for $C^{c}$ . In this case we use a numerical approximation, called path straightening [19], for computing geodesics and geodesic distances. Let $d_{c} (q_{1}, q_{2})$ denote the length of a geodesic path in $C^{c}$ between any two curves $q_{1}, q_{2} \in C^{c}$ . Then the distance between their shapes is given by

$d_{s} ([q_{1}], [q_{2}]) = \inf_{O \in S O (n), γ \in Γ} d_{c} (q_{1}, O (q_{2} ⋆ γ)) .$

(11.9)

Fig. 11.9 shows some examples of geodesic between curves taken from the MPEG7 shape dataset.

Figure 11.9 Examples of elastic geodesic paths between closed planar curves in $S^{c}$ .

11.4 Statistical summaries and principal modes of shape variability

Using the mathematical platform developed so far, we can define and compute several quantities that are useful in statistical shape analysis. For example, we can use the shape metric to define and compute a mean or a median shape, as a representative of shapes denoting a population. Furthermore, using the tangent structure of the shape space, we can compute principal modes for variability in a given sample of data. Given mean and covariance estimates, we can characterize underlying shape populations using Gaussian-type distributions on shape spaces.

Definition 11.3

Intrinsic, Fréchet mean

For a given set of curves $f_{1}, f_{2}, \dots, f_{n} \in F$ and the associated shapes $[q_{1}], [q_{2}], \dots, [q_{n}] \in S$ (or $S^{c}$ if curves are closed), their intrinsic or Fréchet mean is defined as the quantity

$[μ] = \underset{[q] \in S}{argmin} \sum_{i = 1}^{n} d_{s}^{2} {([q], [q_{i}])}^{2} .$

In other words, the mean is defined to be the shape that achieves minimum of the sum of squared distances to the given shapes. There is a well-known gradient-based algorithm for estimating this mean from given data.

In case of real-valued functions, we can simplify the setup and use it to register and align multiple functions. The basic idea is to use a template to register all the given curves using previous pairwise alignment. A good candidate of the template is a mean function defined previously. In the case of unscaled functions the definition of the mean simplifies to

$[μ] = \underset{q \in L^{2}}{argmin} \sum_{i = 1}^{n} (\inf_{γ_{i} \in Γ} {‖ q - (q_{i} ⋆ γ_{i}) ‖}^{2}) .$

If we fix the optimal warpings to be ${γ_{i}}$ s, then the solution for the mean reduces to $μ = \frac{1}{n} \sum_{i = 1}^{n} (q_{i} ⋆ γ_{i})$ . Using this idea, we can write an iterative algorithm for registration of multiple functions or curves.

Multiple alignment algorithm

1. Use Eq. (11.3) to compute SRVFs $q_{i}$ , $i = 1, \dots, n$ , from the given $f_{i}$ .
2. Initialize the mean with $μ = \underset{q_{i}}{\arg \min} ‖ q_{i} - \frac{1}{n} \sum_{k = 1}^{n} q_{k} ‖$ .
3. For each $q_{i}$ , solve $γ_{i} = \underset{γ \in Γ}{\arg \min} ‖ μ - (q_{i} ⋆ γ) ‖$ . Compute the aligned SRVFs ${\tilde{q}}_{i} = (q_{i} ⋆ γ_{i})$ .
4. Update $μ \mapsto \frac{1}{n} \sum_{i = 1}^{n} \tilde{q_{i}}$ and return to step 2 until the change $‖ μ - \frac{1}{n} \sum_{i = 1}^{n} \tilde{q_{i}} ‖$ is small.
5. Map all the SRVFs back to the function space using by $\tilde{f} (t) = f (0) + \int_{0}^{t} \tilde{q} (s) | \tilde{q} (s) | d s$ .

Fig. 11.10 shows an example of registration of multiple functions using this algorithm. The left panel shows a set of simulated functions ${f_{i}}$ that all are bimodal, but their modes differ in the locations and heights. This set forms the input to the algorithm, and the next two panels show the outputs. The middle panel shows the functions ${{\tilde{f}}_{i}}$ associated with the aligned SRVFs ${{\tilde{q}}_{i}}$ , and the right panel shows the optimal time warping functions ${γ_{i}}$ . We can see the high quality of registration in matching of peaks and valleys across functions.

Figure 11.10 Registration of multiple functions by registering each one of them to their mean.

Fig. 11.11 shows some more examples of multiple function registration, this time using real data. The two rows of this figure correspond to female (top) and male (bottom) growth curves taken from the famous Berkeley growth dataset. Each curve shows the rate at which the height of an individual changes from the age of one to twelve. Each row shows the original growth rate functions ${f_{i}}$ (left), their registrations ${{\tilde{f}}_{i}}$ (middle), and the optimal warpings ${γ_{i}}$ . The peaks in rate functions, denoting growth spurts, are better matched after registrations and are easier to interpret in the resulting data.

Figure 11.11 Registration of Berkeley growth data.This figure is taken from [18].

Fig. 11.12 shows some examples of mean shapes computed for a set of given shapes. We can see that the mean shapes are able to preserve main distinguishing features of the shape class while smoothing out the intraclass variabilities.

Figure 11.12 Mean shape for a set of closed curves.

We can use the shape metric and the flatness of the tangent space to discover principal modes of variability in the given set of shapes. Let $T_{[μ]} S$ denote the tangent space to $S$ at the mean shape $[μ]$ , and let $\exp_{[μ]}^{- 1} ([q])$ denote the mapping from the shape space $S$ to this tangent space using the inverse exponential map. We evaluate this mapping by finding the geodesic α from $[μ]$ to a shape $[q]$ and then computing the initial velocity of the geodesic, that is, $\exp_{[μ]}^{- 1} ([q]) = \dot{α} (0)$ . We also call these initial velocities the shooting vectors. Let $v_{i} = \exp_{[μ]}^{- 1} ([q_{i}])$ , $i = 1, 2, \dots, n$ , be the shooting vectors from the mean shape to the given shapes. Then performing PCA of ${v_{i}}$ provides the directions of maximum variability in the given shapes and can be used to visualize the main variability in that set. Fig. 11.13 shows three examples of this idea using some leaf shapes. In each panel of the top row we show the set of given leafs and their mean shapes. The bottom row shows the shape variability along the two dominant modes of variability, obtained using PCA in the tangent space. In the bottom plots the two axes represent the two dominant directions, and we can see the variation in shapes as one moves along these coordinates.

Figure 11.13 Mean (top) and principal modes of variability (bottom) for three sets of leaf shapes.

11.5 Summary and conclusion

This chapter describes an elastic approach to shape analysis of curves in Euclidean spaces. It focuses on the problem of registration of points across curves and shows that the standard $L^{2}$ norm, or its penalized version, has inferior mathematical properties and practical performances. Instead, it uses a combination of SRVFs and $L^{2}$ norm to derive a framework for a unified shape analysis framework. Here we register curves and compare their shape in a single optimization framework. We compute geodesic paths, geodesic distances, and shape summarizes using the geometries of shape spaces. These tools can then be used in statistical modeling and analysis of shapes of curves.

Appendix Mathematical background

We summarize here some background knowledge of algebra and differential geometry with the example of planar shapes represented by k ordered points in $R^{2}$ , denoted by $X \in R^{2 \times k}$ . Clearly, X and $O X, O \in S O (2)$ have the same shape. That can be defined strictly through the following concepts.

Definition 11.4

A binary relation ∼ on a set X is called an equivalence relation if it satisfies the following properties: For $x, y, z \in X$ ,

• $x \sim x$ (reflexivity),
• $x \sim y \Leftrightarrow y \sim x$ (symmetry),
• $x \sim y$ , $y \sim z \Rightarrow x \sim y$ (transitivity).

Definition 11.5

Equivalence class. $[x] = {y \in X | y \sim x}$ .

Definition 11.6

The quotient space of X under equivalence relation ∼ is the set of all equivalence classes $[x]$ , denoted by $X / \sim$ .

Thus the space of planar shapes in the form of discrete points is the quotient space of $R^{2 \times k} / S O (2)$ . To measure the difference between two shapes, we need a proper metric. For $R^{2 \times k}$ , a natural choice is the Euclidean metric, but for a nonlinear manifold, the difference is measured by the length of the shortest path connecting the two shapes, which requires the following definitions.

Definition 11.7

A Riemannian metric on differential manifold M is a smooth inner product on the tangent $T_{p} (M)$ space of M. A differential manifold with a Riemannian metric is called Riemannian manifold.

Definition 11.8

Let $α : [0, 1] \to M$ represent a path on a Riemannian manifold M. Then $\frac{d α}{d t}$ represents the velocity at time t. The length of α can be defined as $L [α] = \int_{0}^{1} \sqrt{〈 \frac{d α}{d t}, \frac{d α}{d t} 〉} d t$ . The path connecting two points with shortest length is called the geodesic between them. The length of geodesic is a proper metric between the two points.

Example 11.1

The geodesic between $p, q \in R^{n}$ is a straight line: $α (τ) = (1 - τ) p + τ q$ , $τ \in [0, 1]$ .

Example 11.2

The geodesic between $p, q \in S^{n} \subseteq R^{n + 1}$ is the arc on a great circle: $α (τ) = \frac{1}{\sin θ} [\sin ((1 - τ) θ) p + \sin (τ θ) q]$ , $τ \in [0, 1]$ .

For $X \in R^{2 \times k}$ , the fact $‖ X ‖ = ‖ O X ‖$ , $O \in S O (2)$ , induces two important concepts, group action and isometry.

Definition 11.9

A Lie group G is a smooth manifold such that the group operations $G \to G \times G$ defined by $(g, h) \to g h$ and $G \to G$ defined by $g \to g^{- 1}$ are both smooth mappings.

The groups appearing in this chapter are all Lie groups.

Definition 11.10

Given a manifold M and a Lie group G, a left group action of G on M is a map $G \times M \to M$ , denoted by $(g, p)$ , satisfying

1. $(g_{1}, (g_{2}, p)) = ((g_{1} \cdot g_{2}), p)$ , $\forall g_{1}, g_{2} \in G$ and $p \in M$ .
2. $(e, p) = p$ , $\forall p \in M$ .

Similarly, we can define the right group action.

Definition 11.11

Given two metric spaces X and Y, a map $f : X \to Y$ is called an isometry or distance preserving if $d_{X} (a, b) = d_{Y} (f (a), f (b))$ for all $a, b \in X$ .

Definition 11.12

A group action of G on a Riemannian manifold M is called isometric if it preserves the Riemannian metric on M, that is, $d (x, y) = d ((g, x), (g, y))$ for all $g \in G$ and $x, y \in M$ .

Definition 11.13

Given a manifold M, for any $p \in M$ , the orbit of p under the action of group G is defined as $[p] = {g \cdot p | g \in G}$ .

Example 11.3

The rotation group $S O (n)$ acts on $R^{n}$ by the action $O ⁎ x = O x$ for all $O \in S O (n)$ and $x \in R^{n}$ . The orbit of x is the sphere centered at the origin with radius $‖ x ‖$ .

For a planar shape denoted by a $2 \times k$ matrix X, the shape can be identified by an orbit $[X] = {O X | O \in S O (2)}$ .

Definition 11.14

For group G acting on the manifold M, the quotient space $M / G$ is defined as the set of all orbits of G in M: $M / G = {[p] | p \in M}$ .

Definition 11.15

If a group action of G on a Riemannian manifold M is isometric and the orbits under G are closed, then we can define a metric on the quotient space $M / G$ as follows:

$d_{M / G} ([p], [q]) = \min_{g \in G} d_{M} (p, (g, q))$

We illustrate this idea by an example. Let $R^{2 \times k}$ be the space of planar shapes of k points with Euclidean metric. Consider the action of rotation group $S O (2)$ on $R^{2 \times k}$ . The metric on the quotient space $R^{2 \times k} / S O (2)$ is

$d_{R^{2 \times k} / S O (2)} ([X_{1}], [X_{2}]) = \min_{O \in S O (2)} ‖ X_{1} - O X_{2} ‖ .$

The geodesic between two shapes $[X_{1}]$ and $[X_{2}]$ is $α (τ) = (1 - τ) X_{1} + τ O^{⁎} X_{2}$ , where $O^{⁎} = \underset{O \in S O (2)}{\arg \min} ‖ X_{1} - O X_{2} ‖$ , $τ \in [0, 1]$ . If the two shapes are rescaled to be unit length, that is, $‖ X_{1} ‖ = ‖ X_{2} ‖ = 1$ , then the geodesic is the arc on a great circle of the unit sphere $α (τ) = \frac{1}{\sin (θ)} [\sin ((1 - τ) θ) X_{1} + \sin (τ θ) O^{⁎} X_{2}]$ , where $θ = arc \cos (〈 X_{1}, O^{⁎} X_{2} 〉)$ , $O^{⁎} = \underset{O \in S O (2)}{\arg \min} arc \cos (〈 X_{1}, O X_{2} 〉)$ .