21.3 Criteria for Band Prioritization

In order to implement BP, a criterion is needed to measure the significance of a spectral band in terms of its priority score. In what follows, four classes of criteria are considered.

21.3.1 Second-Order Statistics-Based BPC

The first category of BP criterion (BPC) is derived from second-order statistics that are based on variance and signal-to-noise ratio (SNR).

21.3.1.1 Variance-Based BPC

A natural and logical approach to band prioritization is to compute variances for all spectral band images img in a hyperspectral image cube, denoted by img and further use band variances to define a priority score for each of band images as follows:

(21.1) equation

Another alternative interpretation is to use a set of loading factors img proposed in Tu et al. (1998) and Chang et al. (1999) that can be defined by

(21.2) equation

It is easy to show that for each img, ρl defined by

(21.3) equation

turns out to be the variance img of the lth spectral band image. As a result of (21.3), the priority score calculated by (21.1) for the lth spectral band image Bl is also equivalent to the PCA-based priority score defined by

(21.4) equation

21.3.1.2 Signal-to-Noise-Ratio-Based BPC

It was noted in Green et al. (1988) that variance was not an appropriate criterion to measure image quality. In order to alleviate this dilemma, an SNR-based criterion was first developed by Green et al. (1988) to improve the PCA. The resulting transform was called maximum noise fraction transform and later reinterpreted by Lee et al. (1990) as noise-adjusted principal component (NAPC) transform. In analogy with the criterion specified by (21.4) for the variance-based PCA, a similar criterion to (21.4) can also be derived from the SNR-based NAPC as follows.

Assume that img is the set of eigenvalues of noise-adjusted sample covariance matrix and img are their associated orthonormal eigenvectors. We can define the loading factors in a similar manner to (21.2) for an NAPC by

(21.5) equation

Using (21.5), a noise-adjusted variance-based priority score can be calculated for the lth spectral band image Bl via (21.5) defined by the NAPC-based priority score:

(21.6) equation

21.3.2 High-Order Statistics-Based BPC

In many applications, the information of interest may not be captured by second-order statistics, but rather be characterized by higher-order statistics. In order to take this into account, three higher-order statistics-based criteria are derived in this section for BPC.

21.3.2.1 Skewness

The simplest high-order statistics is the third central moment, referred to as skewness and defined by

(21.7) equation

21.3.2.2 Kurtosis

A fourth central moment, referred to as kurtosis, is defined by

(21.8) equation

21.3.3 Infinite-Order Statistics-Based BPC

It should be noted that according to our experience, criteria for BP based on statistics higher than 4 do not have much significant advantage compared to skewness and kurtosis (Ren et al., 2006). Therefore, only ∞-order statistics-based BP criteria, entropy and information divergence, are discussed in this section.

21.3.3.1 Entropy

One of the simplest and most widely used ∞-order statistic-based BPC is entropy that requires an infinite number of moments. Let H(Bl) be the entropy calculated for the lth band image Bl. The entropy-based priority score for Bl is defined by

(21.9) equation

It should be noted that the entropy H(Bl.) in (21.9) is calculated based on the gray-level histogram produced by the lth spectral band image Bl where the number of bins is totally determined by the difference between the maximal and minimal gray levels present in the spectral band image Bl. For example, if the maximal and minimal gray levels are 255 and 0, respectively, then there are together 256 bins needed to estimate the entropy.

21.3.3.2 Information Divergence

As an alternative to entropy defined by (21.9), an information theoretic measure, called information divergence (ID), can also be used as a BPC. Assume that the pl is the image histogram of the lth spectral band image, Bl normalized as a probability distribution and gl is its associated Gaussian distribution with mean and variance determined by sample mean and sample variance of the Bl. BP criterion of interest is to measure the deviation far away from a Gaussian distribution for a given spectral band image, that is, the discrepancy between pl and gl defined by

(21.10) equation

where D(pl;gl) is called information divergence (Kullback, 1968)

(21.11) equation

The higher the value of D(21.pl;gl) in (21.11), the greater deviation of pl from the Gaussian distribution, gl is. This implies that ID is used to measure the degree of non-Gaussianity of a band. It should be noted that if both pl and gl are replaced with two spectral signatures, the D(pl;gl) defined by (21.11) becomes spectral information divergence (SID) in Chang (2000) and Chang (2003a).

21.3.4 Classification-Based BPC

In the previous two subsections, BPC are designed based on statistics. Additionally, they are also unsupervised in the sense that no prior knowledge is involved in these criteria. However, in some applications, prior knowledge may be available and can be taken advantage of to design BPC. In this subsection, two supervised classification-based criteria are developed for BP. Such classification-based BP criteria are different from statistics-based BP criteria such as variance-, SNR- or high-order statistics-based BP criteria in the sense that the former is developed for target detection and classification applications while the latter is completely determined by statistics that has little to do applications.

21.3.4.1 Fisher's Linear Discriminant Analysis (FLDA)-Based BPC

Minimum misclassification canonical analysis (MMCA) derived from Fisher's linear discriminant analysis (FLDA) was used in Tu et al. (1998) to minimize the misclassification error. For any given band number img, we can use (21.2)(21.3) with eigenvalues and unit eigenvectors replaced by the eigenvalues img and normalized unit feature vectors imgimg as used in Tu et al. (1998) to define the loading factors as follows:

(21.12) equation

for img and img In light of (21.12), the priority score can be calculated for the lth spectral band image Bl by

(21.13) equation

21.3.4.2 OSP-Based BPC

Another classification-based criterion is derived from the orthogonal subspace projection (OSP) (Harsanyi and Chang, 1994) that is based on the linear mixture model as follows:

(21.14) equation

where img, img and n is noise or model error. If we further assume that the p image endmembers img can be divided into two classes of endmembers, one class of nD desired image endmembers denoted by img and the other class of undesired endmembers denoted by img with img. Then the OSP classifier for a particular desired endmember mj, img can be actually obtained by img with img where img and img, img is the pseudo-inverse of img. Now following the same argument outlined by (21.12) and (21.13) we can define loading factors for the OSP classifiers img as

(21.15) equation

and

(21.16) equation

where img and img obtained in Chang et al. (1999). By means of (21.16), the priority score assigned to the lth spectral band image Bl can be calculated by

(21.17) equation

21.3.5 Constrained Band Correlation/Dependence Minimization

Taking a rather different approach from the ideas used to design previous BP criteria, a recent new approach, called constrained band selection (CBS) developed in Chang and Wang (2006), suggested a new criterion for BP, which linearly constrained a particular band image while minimizing band correlation/dependence resulting from other band images. In other words, the priority score of a spectral band can be calculated according to the degree of correlation or dependence between this particular band image and other band images measured by least squares errors. Its idea can be briefly described as follows.

Assume that the size of all the spectral band images Bl is img. Since each spectral band image Bl can be represented by a column vector of dimensions img, denoted by bl, we have a total number of L spectral band image vectors img. For any given spectral band image vector bl we can design a finite impulse response (FIR) specified by a set of L weighting vectors, img that constrains bl while minimizing least squares error caused by other band image vectors img. More specifically, let yl be the filter output obtained by

(21.18) equation

The averaged least squares filter output is given by

(21.19) equation

Let img denote the band image correlation matrix. A similar optimization problem to the constrained energy minimization (CEM) in Chapter 2 can be obtained as follows:

(21.20) equation

The solution to (21.20), denoted by img is given by

(21.21) equation

Alternatively, we can exclude the spectral band image bl from the band correlation matrix Q and further define img as the band dependence matrix. Replacing Q in (21.20) with img results in a similar constrained band selection problem

(21.22) equation

The solution to (21.22), img is the same as the one in (21.21) with the Q replaced by img that is given by

(21.23) equation

21.3.5.1 Band Correlation/Dependence Minimization

By means of (21.21) and (21.23) we can calculate the following least squares errors (LSEs):

(21.24) equation

(21.25) equation

that can be used to measure degree of the spectral band image vector bl correlated with and dependent on other spectral band image vectors img, respectively. That is, the greater the LSE in (21.24) or (21.25), the higher the correlation of bl with other band image vectors. So, we can use (21.24) and (21.25) to derive two criteria for BP, called band correlation minimization (BCM) defined by

(21.26) equation

and band dependence minimization (BDM) defined by

(21.27) equation

21.3.5.2 Band Correlation Constraint

Comparing to (21.24) and (21.25), an alternative approach is to calculate band correlation constraint (BCC)

(21.28) equation

and band dependence constraint (BDC)

(21.29) equation

which can also be used to measure the correlation between the spectral band image vector bl and any other spectral band image vector bk (21.img). By comparing the value of img with the filter constraint specified by img in (21.20) or (21.22), a spectral band image Bk has less correlation with the spectral band image Bl if its band constraint img is far away from 1. In other words, the closer the img to 1, the higher the correlation of Bk to Bl. With this interpretation, two criteria similar to (21.26) and (21.27) can also be derived for BP, called BCC given by

(21.30) equation

band dependence constraint (BDC)

(21.31) equation

One disadvantage of these CEM-based criteria is the enormous size of vectors converted from band images that causes tremendous computing time. For example, it requires a vector with 4 × 104 dimensions to represent a band image with size 200 × 200. In order to mitigate this dilemma, a linearly constrained minimum variance (LCMV) in (Frost, 1972; Van Veen and Buckley, 1988) is developed to derive four criteria similar to four CEM-based criteria specified by (21.24) and (21.25) and (21.28) and (21.29). Instead of constraining a band image as a vector, the LCMV-CBS constrains a band image as an image matrix without vector conversion. Its idea is derived from the LCMV approach, which can be traced back to Frost's work in adaptive beamforming (Frost, 1972). More specifically, assume that img are nc columns of the lth spectral band image Bl, which has nr rows and nc columns. So, the jth column vector of Bl denoted by img is represented by an nr-dimensional column vector, img. In this case, the lth spectral band image Bl can be further expressed by a matrix given by

(21.32) equation

Like the CEM, the goal is to design a constrained FIR linear filter with an nr-dimensional weight column vector img specified by a set of nr filter coefficients img that minimizes (21.19) subject to the following simultaneous nc multiple constraints, img, img that is equivalent to

(21.33) equation

where img is an nc-dimensional column vector with all 1s in its nc components. It should be noted that since the weight vector vl is used to constrain column vector of a band image, its dimensionality is nr compared to the nrnc-dimensional weight vector wl used in (21.20) that constrains a band image as a vector with dimensionality nrnc. By virtue of the nc multiple constraints in (21.33), the CEM problem described by (21.20) can be rederived as the following LCMV-based optimization problem:

(21.34) equation

where img is the sample band correlation matrix. The solution to (21.34) can be solved as

(21.35) equation

and

(21.36) equation

plays the same role that ρl does for the CEM-BCM. Similar derivations to CEM-BDM can also be obtained for img and

(21.37) equation

In analogy with the CEM-based band correlation/dependence constraint criteria (BCC/BDC)

(21.38) equation

and

(21.39) equation

can also be derived for an LCMV-based band correlation/dependence constraint criteria by replacing band image vector bl and CEM-based weight vectors with band image Bl and LCMV-based weight vectors, respectively.

Despite the fact that the CBS described was developed in Chang and Wang (2006), the idea of BP was not introduced in their paper and nor were the priority scores specified by (21.26) and (21.27), (21.30)(21.31). Table 21.1 summarizes all the proposed BPC in terms of their characteristics where “supervised” indicates that training samples are required for the particular criterion.

Table 21.1 Comparison among various BP criteria.

img

As a concluding remark, one comment is noteworthy. The effectiveness of BP is determined by its applications not criteria alone. As will be demonstrated by following experiments, a different application yields a totally different selected set of bands. With proper bands selected by BP the number of bands can be significantly reduced, while still achieving performance comparable to that accomplished by using full bands.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.142.198.129