As noted in the introduction, hyperspectral signatures can be categorized into background signatures characterized by second-order statistics of IBSI(S) and target signatures characterized by HOS of IBSI(S). Recall that the commonly used PCA is a second-order statistics-based transform that uses a set of PCs to represent the data where eigenvectors are projection vectors to specify PCs with eigenvalues being data variances. In this case, PCA can be then used to extract background signatures characterized by second-order statistics of IBSI(S) in PCs. On the other hand, ICA is an HOS-based transform that uses mutual information to generate a set of ICs to represent data. Therefore, ICA can be used to find desired target signatures characterized by HOS of IBSI(S) in ICs. In both cases, VD is again used to determine how many PCs and ICs are required to extract signatures. Since PCs and ICs are obtained by mapping all data samples onto the projection vectors, the projection values of data samples are real values. So, an issue arises: how many data sample vectors should be selected from each PC and each IC? Two sample values in each IC are of major interest: one with maximal projection value and the other with minimal projection value. These two samples represent maximal projections in two opposite directions of a projection vector that specifies an IC. They both indicate their importance in data analysis. This idea was previously explored in pixel purity index (PPI) in Chapter 7 where the most likely endmembers are those samples with either maximal or minimal projections on each randomly generated vectors referred to as skewers. However, it is worth noting that there is no similar selection of sample vectors with minimal projections in PCs due to the fact that PCA is a transformation of second-order statistics with variance representing signal energy. Those samples with minimal projections, that is, variances are supposed to correspond to noisy samples. In this case, there is no reason to select data sample vectors with minimal projections in PCs as desired samples similar to the case described above for ICA where noisy samples with small variances have been removed by sphering. All such data sample vectors extracted from PCs and ICs are considered as VSs.
Using VD again to determine the numbers of PCs and ICs along with data samples selected from the first PCs and data sample vectors with maximal and minimal projections in the first ICs a CA-based unsupervised VS finding algorithm (CA-UVSFA) can be described as follows.
CA-UVSFA
A comment on that each of PCA and ICA is required to generate p components is noteworthy. It is often the case that target sample vectors may also show up in either PCs or ICs, but not both. In order to make sure that no matter which scenario will be, using p PCs and p ICs should have sufficient components to capture all these target sample vectors. A detailed step-by-step procedure of CA-ULSMA can be summarized as follows.
CA-ULSMA
In Step 1 of CA-ULSMA, in order for the HFC/NWHFC method to work effectively, the spectrally distinct signatures defined by VD are those that do not have significant contribution to data variances. Such signatures are generally characterized by three unique features. Firstly, the probabilities of their occurrence are usually low. Secondly, when such signatures are present, there are not too many samples. Thirdly, as a result, the variances of such signatures are generally very small and can be considered to be negligible. So, when the HFC/NWHFC method is used to estimate VD that determines the number of VSs used for LSMA, two assumptions are made on the VSs. The first and foremost assumption is that all the VSs in SVS are assumed to have the least intersample spectral correlation. This is a reasonable assumption since different VSs should have the least spectral correlation among all the data sample vectors and should also be as distinct as possible in terms of spectral characteristics via IBSI(S). Another is that the number of samples specified by VS should be relatively small since they represent most spectrally distinctive signatures. With these two assumptions in mind a VS is only contributed to the sample mean but not variance of each spectral band. This is also the key idea used to develop the HFC/NWHFC method and explains why the HFC/NWHFC-estimated VD works very effectively for HYDICE data in the following experiments where the number of target panel pixels specified by each of five panel signatures is very small. When these two assumptions are violated, the HFC/NWHFC-estimated VD may not be accurate. So, when the HFC/NWHFC method is used to estimate VD we should be aware of these assumptions. Finally, we conclude a noteworthy comment on the use of twice VD value, 2 = 2p for LS-ULSMA and CA-ULSMA to extract VSs. Such selection is not arbitrary. It is actually based on the concept, called dynamic dimensionality allocation (DDA) derived in Chapter 22. For more details we refer readers to this chapter.
3.15.171.202