22.2 Dynamic Dimensionality Allocaction

The DDA presented in this section is designed to dynamically determine the values of q and img. It originates from the pigeon-hole principle described in Section 1.3.2 as well as variable-length coding from the information theory. According to the pigeon-hole principle each signature is assumed as a pigeon to be accommodated by a particular spectral dimension/band which can be considered as a pigeon-hole. Therefore, the number of pigeons should determine at least how many pigeon-holes required for accommodation. This is equivalent to saying that the number of spectrally distinct signatures determines the minimal number of spectral dimensions/bands required for signature discrimination, which is exactly the original idea of VD. To materialize DDA, we first interpret the use of a pigeon-hole to accommodate a pigeon by a binary bit “1” and “0” otherwise. This implies that a spectral dimension/band being used to specify a particular signature will be encoded by “1.” Otherwise, “0” will be assigned to an unused spectral dimension/band. To fit the profile of source coding, the first task is to determine what type of signatures that can be considered as source alphabets. If the signature knowledge is provided a priori, this known signatures can be used as desired source alphabets. If there is no prior knowledge available about the data, the signatures should be found in an unsupervised means. In this case, the VD developed in Chapter 5 can be used to estimate the number of spectrally distinct signatures, denoted by nVD. To find nVD unknown signatures, the automatic target generation process (ATGP) developed by Ren and Chang (2003) and discussed in Section 8.5.1 can be used to produce a set of signatures, img, that correspond to desired source alphabets. Or alternatively, the virtual signatures found in the supervised LSMA in Chapter 17 can also serve as the same purpose. Once these signatures of interest are found, the next task is to calculate their discriminatory probabilities according to the relative spectral discriminatory probability (RSDPB) in Chang (2000) and Chapter 2 in Chang (2003a) that will determine DDA. In doing so, we briefly review the basic concept of source coding that will be used to define DDA.

Assume that an information source S is emitted by a set of source alphabets img with a given probability distribution img where pj is the probability of the occurrence of the source alphabet aj. To encode the source S these source alphabets must be represented by a set of code words, called code book. Two coding schemes are generally used for finding a code book for source encoding. One is called fixed-length coding which assigns code words with equal length to all the source alphabets img. The other is variable-length coding that assigns code words with variable coding lengths to individual source alphabets according to their occurrence probabilities. Let the coding length used to encode aj be denoted by lj. Shannon showed that the optimal coding scheme must be variable-length coding with the mean coding length img determined and approximated by the source entropy, img. The Huffman coding is proved to be the one that achieves the optimal coding performance in terms of Shannon entropy (see details in Section 31.2.1). The only case that a fixed-length coding also achieves the same performance as the Huffman coding does is when the occurrence probabilities of all the source alphabets are equally likely.

Now how can we borrow the idea of the variable-length coding described above to apply to hyperspectral imaging? First, let img denote the signatures of interest where nS can be either the number of known signatures or nVD if no prior knowledge is given. Furthermore, let nj denote the number of spectral dimensions/bands required to represent the jth signature, sj. Then sj, nS, and nj shall play the same role as aj, J, and lj do in source coding to represent the jth source alphabet, the number of source alphabets and number of bits required for a code book to encode the set of source alphabets img along with their corresponding probabilities img. In other words, the source occurrence probability pj is now interpreted by nj that reflects how difficult the sj is discriminated from other signatures in terms of spectral similarity in the same way that how frequently the aj occurs in terms of probabilities relative to other source alphabets used in source coding. As a result, the higher the probability pj is, the shorter coding length, lj is. This implies that the easier to be discriminated the signature sj is, the smaller number the nj is. In particular, to best represent the sj in terms of spectral dimensions/bands the nj must vary with discriminatory power possessed by the signature sj. The traditional DR/BS makes a simple and natural assumption that all the signatures are equally discriminable by setting nj = nS for all img in which case it performs so-called static dimensionality allocation (SDA), that is, fixed-size band dimensionality. Apparently, it is generally not true in hyperspectral data where each material substance signature has its own spectral characteristics and has a different level of signature discriminability. So, what DDA is to the SDA in DR/BS is the same as what is the variable-length coding to fixed-length coding in source coding. Recently, Wang and Chang (2007) have introduced a new concept of variable number variable band selection (VNVBS) to be discussed in Chapter 27 and showed that using variable numbers of spectral bands was more effective than using fixed number of spectral bands. It provided evidence in advantages of using DDA over SDA.

The next remaining issue is how to come up a technique to determine DDA similar to variable-length coding used in source coding where the bit allocation is determined by the coding lengths img which can be calculated by their associated probabilities. By interpreting signature discriminatory probabilities as source alphabet probabilities we are able to do the same for DDA in such a way that variable numbers of spectral dimensions/bands are assigned to represent various discriminatory powers of signatures. Three commonly used coding schemes, Shannon coding, Huffman coding, and Hamming coding can be developed to find DDA.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.219.22.169