33.7 Hyperspectral Signal Processing

The hyperspectral data considered in Chapters 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22,23 are three-dimensional image cubes where each data sample is actually a pixel vector. So, the data processing carried out in this manner can be viewed as hyperspectral image processing in Category A where two types of correlation are of interest. One is correlation provided by data samples in terms of their spatial locations while paying no attention to interband spectral correlation. This is generally referred to as intersample spatial correlation and is commonly used in traditional image processing to develop spatial domain-based algorithms to perform tasks such as edge detection, region growing, clustering, segmentation, etc. Early data processing for remote sensing data, for example, geographical information system (GIS), belongs to such an approach, which can be considered as spatial domain-based data analysis. The other type of correlation is provided by data samples regardless of their spatial location. It is a complete opposite to the above-mentioned intersample spatial correlation and can be referred to as intrasample spectral correlation which has been explored and investigated in great detail in Chapter 17. The key difference between these two types of correlation can be well explained by the following example. Let img be a set of N data sample vectors where img is the ith data sample vector, L is the total number of spectral bands, and the i indicates the ith spatial location of ri. Let img denote a permutation of N spatial locations, img. There are N! permutations of img with img denoting one data matrix with data samples arranged in a particular order of img specified by a permutation img. Now, the intersample correlation matrix and intrasample correlation matrix provided by SP are img and img, respectively. In comparison between these two sample correlation matrices, it is very clear that img is independent of permutations and remains the same, that is, img for any permutation P even though the samples reshuffled by permutations, while img will result in another different sample spatial correlation matrix if another permutation Q is used. That is, img if P and Q are two different permutations of img since it alters spatial correlation among data samples when data samples are re-arranged in different spatial coordinates. Nevertheless, intersample spatial correlation and intrasample spectral correlation share one thing in common, that is, they both produce sample statistics when N > 1. In this case, the intersample spatial correlation and intrasample spectral correlation are more specifically referred to as sample intersample spatial correlation and sample intrasample spectral correlation to indicate the vital role of sample size plays in generation of sample statistics by img. Throughout this book, only sample intrasample spectral correlation is of interest, specifically, interband spectral information (IBSI) introduced in Chapter 17. Three scenarios can be developed to effectively use such IBSI(S) with various sizes of training sample pool S and can be described as follows:

1. The size of S, N = |S|, is very large: In this scenario, the sample pool is the entire data sample vectors. This type of scenario is generally to use IBSI(S) to perform background suppression such as RX detector by Reed and Yu (1990) and subpixel detector such as CEM in Chang (2003a) where the S consists of entire data sample vectors.
2. The size of S, N = |S|, is relative small: This scenario generally makes use of IBSI(S) to perform subtle target detection as demonstrated in Chapter 17, to find targets of interest such as anomalies, endmembers, man-made objects, particularly, those targets which are generally scattered and may not be clustered together as a group or a region.
3. The size of S, N = |S|, is one, that is, N = 1: The third scenario is the case that no sample statistics is available, that is, S is a singleton made up of the data sample vector r itself only. Under such a circumstance, the only available data information is the intrasample spectral correlation IBSI(r) that can be used for data analysis. As a result, processing the data sample vector r becomes a one-dimensional (1D) signal processing of r along wavelengths, which is exactly the topic to be covered in the second category of this book, Category B: Hyperspectral Signal Processing.

Assume that a given data sample vector is a signature vector without reference to others (i.e., N = 1). What the best we can do is to explore as much spectral information across the entire wavelength range as possible to specify the data sample vector for spectral characterization. For this purpose, we can consider a general case that a spectral signature s(λ) is a 1D continuous-wavelength real-valued signal defined on a wavelength range of [a,b]. For s(λ) to be implemented in discrete signal processing, s(λ) must be sampled according to a sampling interval, Δλ. For example, for a HYDICE data sample vector, there are 210 spectral channels over [0.4 μm, 2.5 μm] with spectral resolution 10 nm, in which case, the sampling interval is Δλ = 10 nm where s(lΔλ) where img can be considered as the spectral value of the lth spectral channel to form a spectral signature vector given by img where L is the total number of samples corresponding to the total number of spectral channels. So, Δλ actually determines the spectral resolution. When the interval is relatively small, the spectral bands can be considered as contiguous spectral channels and the data sample vector is a hyperspectral signature. Otherwise, the spectral bands are considered as discrete spectral channels, and the data sample vector is a multispectral signature.

Once a spectral signature img is acquired; there are two ways to process the spectral value sl of the lth spectral channel in either a discrete manner, referred to as signal coding or in a continuous manner, referred to as signal estimation.

33.7.1 Signal Coding

Signal coding represents a signal by a finite number of discrete values, to be called code words. For example, a real value can be encoded as a binary representation used by computers as bits. In communications, coding is carried out by a quantizer with quantization levels specified by a desired set of discrete values. So, the simplest encoder or quantizer is sign detector, known as hard-limiter, which only detects the sign change of a signal or delta modulator with only two discrete values. In hyperspectral signal processing, the signals are represented by real-valued spectral values img across the wavelength range img and IBSI(s) is spectral correlation provided by L signals, img within the signature vector img. One earliest attempt to perform signal coding on img was the spectral analysis manager (SPAM) by Mazer et al. (1988), which is actually implemented as two sign detectors to detect a sign change between the current spectral signal sl and spectral mean and a sign change in spectral signals between the two adjacent bands, sl − 1 and sl + 1, of the current being processed band, sl. SPAM was later improved by spectral feature-based binary coding (SFBC) developed by Qian et al. (1996) by adding a third sign detector, which detects a sign change in spectral deviation from the spectral mean by a prescribed threshold. Since each sign detector requires one bit to dictate the sign change, SPAM and SFBC can be viewed as 2-bit encoder/qunatizer or 3-bit encoder/quantizer or sign detector, respectively. The ideas of SPAM and SFBC are further explored in Chapter 24 to derive many variations from information theory perspectives such as median partition (MP) binary coding, halfway partition (HP) binary coding, and equal probability partition (EPP) binary coding. While these binary coding methods may be effective on many occasions, they may not be so if the signature vector img has a sophisticated and complex spectrum across L spectral signals img. This is because binary coding only uses up to 2-bit memory to store spectral changes of two adjacent bands and may fail to capture and characterize their spectral behaviors. Nevertheless, these coding methods are still considered as memoryless coding since they are implemented as 2-bit or 3-bit quantizers as scalar quantizers (Gersho and Gray, 1992). In order to resolve this dilemma, a concept of vector coding similar to vector quantization (Gersho and Gray, 1992) is introduced in Chapter 25. One such vector coding is developed based on texture analysis, called spectral derivative feature coding (SDFC) where the sign detector used by binary coding is replaced with a spectral texture descriptor. Another is derived from arithmetic coding, called spectral feature probabilistic coding (SFPC) which can keep track of all spectral changes across the entire set of img. However, implementing vector coding generally requires high computational complexity. Chapter 26 further develops a progressive coding, called multiple-stage PCM (MPCM)-based progressive spectral signature coding (MPCM-PSSC), which takes advantages of simplicity of scalar coding in implementation as well as capability of vector coding in capturing spectral changes in img.

33.7.2 Signal Estimation

The basic goal of the above-mentioned signal coding is designed to generate a credible fingerprint of each of spectral signals, img so that these fingerprints provide sufficient information for their own identities. But such signal coding does not necessarily tell you what the real signal is. In other words, a signal identity and its fingerprint is one-to-one correspondence such as one-to-one identification between a person and his unique nickname, known as code words. Using a more specific example for illustration, let img be used for data transmission. When one of these signals is selected for transmission, it is its subscript instead of the signal itself being transmitted. According to information theory, if a fixed length coding is used, only img bits required to derive a set of L code words corresponding to fingerprints of the L signals, img for signal transmission without actually transmitting the signal itself. This is because a signal can be identified by its subscript through its code word used as its fingerprint. Such processing performed by signal coding is indeed 1D discrete-value signal processing and can be only used for hard-decision-made applications such as signal detection, discrimination, classification, identification, but certainly cannot be used for continuous value (real-valued) signal processing-based applications, which require soft decisions such as signal quantification, in which case signal coding does not suffice to characterize img provided by a signature vector. In order to address this issue, the encoder used by signal coding must be replaced by an operator that can reconstruct the signal to be processed for signal recovery. This is particularly important and critical in applications in chemical/biological agent detection where the agent concentration is crucial to determine various levels of threat and fatality. In this case, each of img must be approximated by its estimate img rather than its code word. PART VII in this book is particularly developed for this purpose where three approaches, variable number variable band selection for hyperspectral signal charcaterization in Chapter 27, Kalman filtering-based estimation for hyperspectral signals in Chapter 28, and wavelet representation for hyperspectral signals in Chapter 29, are investigated.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.144.143.31