A sparsity-based approach for spectral image target detection from compressive measurements acquired by the CASSI architecture [1]

David Alberto Boada Supelano; Héctor Miguel Vargas García; Jaime Octavio Albarracín Ferreira; Henry  Arguello Fuentes

Un enfoque basado en escasez para la detección de objetivos en imágenes espectrales a partir de medidas compresivas adquiridas por la arquitectura CASSI [2]

David Alberto Boada Supelano * david.boada@correo.uis.edu.co

Universidad Industrial de Santander, Colombia

Héctor Miguel Vargas García ****. hvargas121288@gmail.com

Universidad Industrial de Santander, Colombia

Jaime Octavio Albarracín Ferreira jaimealb@uis.edu.co

Universidad Industrial de Santander., Colombia

Henry Arguello Fuentes **** henarfu@uis.edu.co

Universidad Industrial de Santander, Colombia

A sparsity-based approach for spectral image target detection from compressive measurements acquired by the CASSI architecture [1]

Ingeniería y Universidad, vol. 21, no. 2, 2017

Pontificia Universidad Javeriana

Received: 04 July 2016

Accepted: 19 May 2017

DOI: https://doi.org/http://dx.doi.org/10.11144/Javeriana.iyu21-2.sasi

Abstract: Hyperspectral imaging requires handling a large amount of multidimensional spectral information. Hyperspectral image acquisition, processing and storage are computationally and economically expensive and, in most cases, slow processes. In recent years, optical architectures have been developed for acquisition of spectral information in compressed form by using a small set of measurements coded by a spatial modulator. This work formulates a processing scheme that allows the measurements acquired by such compressive sampling systems to be used to perform spectral detection of targets by adapting traditional detection algorithms for use in the compressive sampling model and shows that the performance is comparable with that obtained by detection processes without compression.

Keywords: Hyperspectral Imaging, Compressive Sensing, Target Detection, Sparsity Model.

Resumen: La adquisición y procesamiento de imágenes espectrales involucra el manejo de grandes cantidades de información espectral multidimensional. Su adquisición, procesamiento y almacenamiento son costosos temporal, computacional y económicamente. En los últimos años se han desarrollado arquitecturas ópticas para la adquisición de información espectral de forma comprimida usando un conjunto reducido de mediciones codificadas por un modulador espacial. Este trabajo busca formular un esquema de procesamiento que permita utilizar las mediciones adquiridas por dichos sistemas de muestreo compresivo para efectuar detección espectral de objetivos, se adaptaran algoritmos de detección tradicionales para ser usados en el modelo de muestreo compresivo y se mostrara que su desempeño es comparable al obtenido en procesos de detección sin compresión. alabras Clave Imágenes hiperespectrales; Muestreo compresivo; Detección de objetivos; Modelo de escasez Introduction

Palabras clave: Imágenes hiperespectrales, Muestreo compresivo, Detección de objetivos, Modelo de escasez.

Introduction

Over the last several decades, the development of optical sensors has facilitated remote sensing analysis with rich spatial, spectral, and temporal information. The increase in the spectral resolution of hyperspectral images (HSI) and infrared sounders has led to new application domains and poses new methodological challenges in data analysis. HSI allows characterization of the objects of interest (e.g., land-cover classes) with unprecedented accuracy and aids in keeping inventories up to date. Furthermore, improvements in spectral resolution have necessitated advances in signal processing and exploitation algorithms [1].

Hyperspectral image classification and target detection are among the most important

problems in various scientific disciplines, such as machine learning [2] image processing, and computer vision. Several critical issues should be considered in the classification of hyperspectral data. For instance, the high number of spectral channels and low number of labeled training samples lead to the curse-of-dimensionality problem (i.e., the Hughes phenomenon [3]) and result in the risk of overfitting of the training data [4]. To alleviate the problems that come with the great dimensionality of data, the spatial variability of spectral information, and the high cost of true sample labeling and to enhance the numerical and stability, a variety of approaches have been proposed [5]. In general, these approaches take advantage of the inherent sparsity in a certain basis of the natural signals, whereby they can be approximately represented by a few coefficients that carry the most relevant information [6]. Applications of sparse representations in computer vision and pattern recognition can be found in various fields, including motion segmentation [7] image super-resolution [8], image restoration [9], and discriminative tasks such as face recognition [10], iris recognition [11], tumor classification [12], and HSI classification [13]. In these applications, the use of sparsity as a prior condition often leads to state-of-the-art performance. Furthermore, the sparse nature of spectral imagery can be exploited when classifying images that were acquired using compressive spectral imaging systems, which require fewer measurements than do systems attained with traditional hyperspectral imaging sensors [14][15].

The coded-aperture snapshot spectral imaging (CASSI) system depicted in Figure 1 is a compressive hyperspectral imager that is used to acquire compressive spectral measurements. The CASSI system simultaneously encodes spatial and spectral information of a scene in a small set of coded focal plane array (FPA) measurements [16].

Figure 1
Coded-aperture snapshot spectral imaging (CASSI) system [17]
Source: Authors' own elaboration

The main elements of the CASSI system are the coded apertures, a dispersive element, and the sensor responsible for capturing the energy of the scene encoded. The coded apertures are matrix arrays composed of translucent optical elements that block or unblock the path of light through the system. The dispersive elements (usually prisms or gratings) are responsible for splitting the light into its wavelengths. The quality of the images acquired by the system depend on three main factors: the percentage of translucent elements that allow light into the apertures (commonly known as transmittance), the size of the data cube and the compression rate [17].

Mathematically, CASSI projections measured in the i-th shot can be treated as shown in Figure 2 and are described by y=Hf+ , where H is a N(N+L+1)x(N²L) matrix whose structure is determined by the coded aperture entries and the dispersive element effect. For spectrally rich or spatially detailed images, a single-shot FPA measurement is not sufficient to achieve proper quality reconstructions, and additional shots are required. The CASSI architecture is capable of admitting multiple snapshots, each with a different coded aperture pattern, thus yielding a less ill-posed inverse problem and improved signal reconstructions [18]. The set of L FPA measurements is given by y=Hf+ , where y=[y^T₀,...,H^T_k-1]^Tis the one-dimensional vectorized form of all FPA shots,H=[H^T₀,...,H^T_k-1]^T R^{M(N+L+1)x(N2L)} is the sensing matrix, and is the additive noise of the sensing system. The spectral data cube is reconstructed as , is an S-sparse representation of f on the basis and is a regularization constant [19].

Figure 2
Matrix representation of the compressive sensing process
Source: Authors' own elaboration

Among the main limitations of the CASSI system are the mixture of spectral information with spatial information due to spectral shifting and the way in which the energy is integrated within the detector. In other architectures, only the spectral information is mixed [20]. In addition, the number of spectral bands available is limited by the size of the detector, . However, this example is one of the most broadly studied compressive spectral architectures and has been used in several applications [19][21] [22], which is why it was selected for this work.

This paper focuses on designing a target detection model that uses compressive measurements to find a sparse representation of image pixels from spectral information-based dictionaries. In addition, an algorithm is implemented that determines whether the evaluated pixel is a target pixel. The proposed algorithm is based on a joint sparsity model, where every f _i pixel is approximately represented by a few sets of training signatures among the entire training dictionary. This dictionary is composed of subdictionaries of the target and background signatures. The sparse vector represents the atoms in the training dictionary, and their associated weights for each pixel can be recovered from the CASSI compressive measurements by solving a sparsity-constrained optimization problem. This process is used to determine whether the observation pixel is a target pixel.

Spectral image target detection using a sparsity model

Traditional spectral target detection based on sparsity takes advantage of the fact that any pixel in an image can be sparsely represented using a trained dictionary composed of the selected target and the background pixels. Considering the existent spatial correlation between each pixel and its spatial neighborhood, we can model a linear problem as follows:

[equation 1.1]

where F is the neighborhood consisting of T number of pixels and A is the sparse coefficient matrix of the pixels represented in the subspace spanned by M.

With a proper pixel-wise sparse representation, the goal of the detection task is to apply a detection function to each pixel in the image as follows:

[equation 1.2]

where A^b consists of the first N_b rows of the matrix A corresponding to the background subdictionary M^b and A^t consists of the remaining rows in that correspond to the target subdictionary M^t If the output D(x) is greater than a fixed threshold, then the test sample is labeled a target; otherwise, it is labeled background. Further details on matrix estimation and sparse representation are explicitly presented in previous works [23] and [24].

Proposed model

A principal component transformation can be performed on the structured dictionary M R^LxTsuch that an orthonormal basis is obtained from the set of N_T training vectors. From [19], we know that the orthonormal basis can be formed by the set of eigenvectors of the correlation matrix C R^LxL given by the following:

[equation 1.3]

such that =VT . Principal component analysis (PCA) is one of the most frequently used approaches for hyperspectral dimensionality reduction and compression in HIS because it preserves the meaningful information of the image in a few of its components. Such basis transformation was successfully performed for classification in [19]. Thus, the original sparse vector α representing the test pixel f can be transformed into a new sparse vector , which represents the pixel in the orthogonal basis . The observation pixel f_ican now be expressed as

[equation 1.4]

Using the sparse representation of the pixels in the training basis , the compressive CASSI measurements can be rewritten as

[equation 1.5]

where , identity matrix and is the Kronecker product operator. Additionally, , where corresponds to the i-th PCA coefficient of the i-th pixel.

In Eq. (1.5), is the noise of the system, and H is the CASSI sensing matrix. The proposed algorithm first finds an estimate of the sparse vector directly from the FPA measurements by solving the sparsity-constrained optimization problem given by

[equation 1.6]

where the norm accounts for the sparsity constraint and the error norm finds the closest sparse vector to the optimal CASSI compressive measurements. A variety of algorithms have been used in the literature to solve problems similar to the one stated in Eq. (1.6), such as the -regularized least square solution via the interior point method [25] or, in this case, the gradient projection for sparse reconstruction (GPSR) [26]. Spatial interpixel correlation can be included in the optimization problem given in Eq. (1.6), by replacing the sparsifying basis with , where is the 2D wavelet basis dictionary used in [21].

The sparse target detection model proposed in this paper requires the detection algorithm introduced in Eq. (1.2) to operate over the subspace described in Eq. (1.5) and Eq. (1.6). This process is depicted in Algorithm 1, where , is an identity matrix, ⊗ is the Kronecker product operator, is the sparse vector obtained in Eq. (1.6)and is the sparse matrix of all spectral bands in the basis .

Algorithm 1

Source: Authors' own ellaboration

The target detector is based on a joint sparse model to extract the contextual information in HSI. In particular, it is assumed that pixels for the same material in a region share a common sparsity pattern. Thus, similar neighboring pixels can be sparsely represented by a linear combination of a few shared atoms:

[equation 1.7]

where is the sparse representation of pixels in a spatial neighborhood formed by T neighboring pixels of the i-th test pixel. From the estimated matrix , the joint sparsity problem can be formulated as

[equation 1.8]

where is a joint sparse matrix with only K non-zero rows, K₀ denotes the sparsity level of and ||.||_F denotes the Frobenius norm. Once the sparse matrix is obtained, the label of the test pixel is determined using the minimal total residual.

[equation 1.9]

where and consist of the N_b and N_t rows in that are associated with the background and target sub-dictionaries and , respectively.

Computer Simulations and Results

The performance of the proposed sparsity model in target detection is evaluated from compressive CASSI measurements obtained from two spectral images. The first image is the self-test dataset from the RIT-CIS-DIRT project [27], and the data of this image, as shown in Fig 2, were collected as a component of a field experiment conducted in July 2006, near the small town of Cooke City, MT, USA. The hyperspectral imagery was collected using the HyMap sensor operated by HyVista, with approximately 3-meter ground resolution. The sensor generates 128 bands across the reflective solar wavelength region of 0.45-2.5 μm, with contiguous spectral coverage (except in the atmospheric water vapor bands) and bandwidths between 15 and 20 μm. A small fabric panel was used as target, and its reflectance spectra were measured by a Cary 500 spectrophotometer in the laboratory.

The second image is the EO1H0070552014301110PF-SG1-01 spectral image, as shown in Fig. 3, collected by the EO-1 Hyperion sensor on October 28, 2014 in the region of Mogotes, Santander, Colombia. The spectral imagery has approximately 30-meter resolution, and only a small patch of the whole image is used in the experiments. The image is composed mostly of cultivated and ready-to-cultivate fields. The sensor is capable of resolving 220 spectral bands (from 0.4 to 2.5 μm) with a 30-meter resolution. In both images, the model of the multishot CASSI system [16] is used to obtain a set of FPA compressive measurements using different numbers of shots corresponding to an approximate percentage of sensed information of the image.

The target detector proposed in Algorithm 1 is used over the representation of the image. The sparse vector that solves the problem formulated in Eq. (1.6) is obtained using the GPSR algorithm proposed in [26]

The sparsity-constrained problem in Eq. (1.8) is solved using the SOMP algorithm, with a fixed sparsity level K₀=4 and a joint sparsity neighborhood with T=9. The estimated matrix is later used to calculate the score matrix D, whose entries determine the probability that the pixel area is a target, as shown in Eq. (1.9)

The proposed algorithm is compared with three target detection algorithms for hyperspectral images, i.e., Adaptive Matched Subspace Detector (AMSD), Orthogonal Subspace Projection (OSP), and Constrained Energy Minimization (CEM), which are available in the Matlab signal-processing toolbox. These algorithms were used without compression using 100% of the spectral data

The results are analyzed both visually and quantitatively using the receiver operating characteristic (ROC) curves, as shown Graph 1 in and Graph 2. The ROC curve describes the probability of detection (PD) as a function of the probability of false alarms (PFA). To calculate the ROC curve, we pick thousands of thresholds between the minimum and maximum of the detector output. The target or background labels for all pixels in the test region are determined at each threshold. The PFA is calculated by using the number of false alarms (background pixels determined as target) over the total number of pixels in the test region, and the PD is the ratio of the number of hits (target pixels determined as target) and the total number of true target pixels.

RIT-CIS-DIRT Dataset: Cooke City

The first spectral image used to test the performance of the classifier is the Cooke City self-test [27]. In the following simulations, the number of bands is reduced to 90 (3rd-46th, 49th, 51st-62nd, 66th, 69th-72nd, and 86th-122nd) by eliminating 38 absorption and low-SNR bands. This image has a spatial resolution of 3m per pixel and a spatial dimension of 800×200 pixels.

The proposed algorithm was tested by varying two specific parameters: the compression level that we expect to achieve and the transmittance level of the coded apertures. The numerical results in Table 1 show the area under the curve (AUC) of the detector under the selected transmittance level and compression rate.

Table 1

AUC of the proposed method under different configurations

Source: Authors' own ellaboration

The detection results for the proposed algorithm and the comparative results from other detection algorithms are shown in Figure 3. For additional clarity, the ROC curves of the algorithms are displayed in Graph 1.

Figure 3
Detection output of the tested algorithms
Source: Authors' own ellaboration

Based on these results, it is reasonable to conclude that the proposed method achieves a performance similar to that of the target detection algorithms used in traditional spectral imaging.

Hyperion Image

The spatial dimensions used for this spectral image is 32×32. A set of 10 target pixels was taken from the image for use as training samples. A set of 206 pixels was also taken from the image for use as test pixels to be assigned as either target or background. As in the previous experiments, the GPRS algorithm was used to solve Eq. (1.6).

The results of the simulations performed in this image are shown in Fig. 4 and Graph. 2. As in the previous image, Fig 4 shows a grayscale view of the image processed together with the ground truth of the selected target and the outcomes of the proposed algorithm using 20% of the data cube and 40% transmittance.

Similarly, Graph 2 shows the behavior of the proposed algorithm compared with traditional approaches using ROC curves. The AUC corresponds to the probability of detection of the algorithm. In addition, the numerical results are shown in Table 2. It can be observed that the proposed algorithm obtains a probability of detection near when using only 15% of the spectral information in the images with a well-defined target spatial location.

Table 2

AUC of the proposed method under different configurations

Source: Authors' own ellaboration

Figure 4
Detection output of the tested algorithms
Source: Authors' own ellaboration

4. Conclusion

This work proposes a spectral image target detector that directly labels each spectral pixel as either target or background from a set of compressive CASSI measurements. This detector uses the sparsity of spectral pixels in a given training basis. The sparse vector representing each pixel in the training basis is recovered from the CASSI measurements and is then used to determine whether or not the test pixel is a target pixel. The interpixel correlation in HSI is incorporated using a joint sparsity model, where the pixels in a small neighborhood in the test image are represented by a linear combination of a few common training samples weighted with a different set of coefficients for each pixel. The resulting sparse representations are used directly for target detection. The proposed detection method achieves a probability of detection of 98.92% if only 40% of the spectral information is used. A transmittance level between 10% and 30% produces the most accurate results.

References

[1] J.-S. Pan, P.-W. Tsai, and H.-C. Huang, Advances in Intelligent Information Hiding and Multimedia Signal Processing. 2016.

[2] R. Singh, M. Vatsa, A. Majumdar, and A. Kumar, Machine Intelligence and Signal Processing. 2016.

[3] G. Hughes, “On the mean accuracy of statistical pattern recognizers,” IEEE Trans. Inf. Theory, vol. 14, no. 1, 1968.

[4] L. Zhang, W. Wei, Y. Zhang, C. Shen, A. Van Den Hengel, and Q. Shi, “Dictionary Learning for Promoting Structured Sparsity in Hyperspectral Compressive Sensing,” IEEE Trans. Geosci. Remote Sens., vol. PP, no. 99, pp. 7223–7235, 2016.

[5] A. M. Bruckstein, D. L. Donoho, and M. Elad, “From Sparse Solutions of Systems of Equations to Sparse Modeling of Signals and Images,” SIAM Rev., vol. 51, no. 1, pp. 34–81, 2009.

[6] M. F. Duarte and Y. C. Eldar, “Structured Compressed Sensing : From Theory to Applications,” IEEE Trans. Signal Process., vol. 59, no. 9, 2011.

[7] M. Ying Yang, S. Feng, H. Ackermann, and B. Rosenhahn, “Global and Local Sparse Subspace Optimization for Motion Segmentation,” ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci., vol. II-3/W5, pp. 475–482, 2015.

[8] Z. Wang, D. Liu, J. Yang, W. Han, and T. Huang, “Deep networks for image super-resolution with sparse prior,” Proc. IEEE Int. Conf. Comput. Vis., vol. 11–18–Dece, pp. 370–378, 2016.

[9] W. Dong, G. Shi, Y. Ma, and X. Li, “Image Restoration via Simultaneous Sparse Coding: Where Structured Sparsity Meets Gaussian Scale Mixture,” Int. J. Comput. Vis., pp. 1–16, 2015.

[10] X. Jiang and J. Lai, “Sparse and dense hybrid representation via dictionary decomposition for face recognition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 37, no. 5, pp. 1067–1079, 2015.

[11] J. K. Pillai, V. Patel, R. Chellappa, and N. Ratha, “Robust and Secure Iris Recognition,” in Handbook of Iris Recognition, 2016.

[12] S. D. S. Al-Shaikhli, M. Y. Yang, and B. Rosenhahn, “Brain tumor classification and segmentation using sparse coding and dictionary learning,” in Biomedical Engineering / Biomedizinische Technik, 2016.

[13] G. Camps-Valls, T. V Bandos Marsheva, and D. Zhou, “Semi-supervised graph-based hyperspectral image classification,” IEEE Trans. Geosci. Remote Sens., vol. 45, no. 10, pp. 3044–3054, 2007.

[14] L. Fang, S. Li, X. Kang, and S. Member, “Spectral – Spatial Hyperspectral Image Classification via Multiscale Adaptive Sparse Representation,” IEEE Trans. Geosci. Remote Sens., vol. 52, no. 12, pp. 7738–7749, 2014.

[15] W. Li, S. Prasad, and J. E. Fowler, “Classification and reconstruction from random projections for hyperspectral imagery,” IEEE Trans. Geosci. Remote Sens., vol. 51, no. 2, pp. 833–843, 2013.

[16] H. Arguello and G. R. Arce, “Colored coded aperture design by concentration of measure in compressive spectral imaging,” IEEE Trans. Image Process., vol. 23, no. 4, pp. 1896–1908, 2014.

[17] G. R. G. R. . Arce, D. J. Brady, L. Carin, H. Arguello, and ] D. S. K., “Compressive coded aperture imaging,” IEEE Signal Process. Mag., vol. 1, no. 919, pp. 105–115, 2014.

[18] H. Arguello and G. R. Arce, “Code aperture optimization for spectrally agile compressive imaging,” J. Opt. Soc. Am. A, vol. 28, no. 11, p. 2400, 2011.

[19] A. Ramirez, S. Member, H. Arguello, S. Member, G. R. Arce, and B. M. Sadler, “Spectral Image Classification From Optimal Coded-Aperture Compressive Measurements,” vol. 52, no. 6, pp. 3299–3309, 2014.

[20] X. Lin, Y. Liu, J. Wu, and Q. Dai, “Spatial-spectral encoded compressive hyperspectral imaging,” ACM Trans. Graph., vol. 33, no. 6, 2014.

[21] A. Ramirez, G. R. Arce, and B. M. Sadler, “Spectral Image Unmixing From Optimal Measurements,” vol. 53, no. 1, pp. 405–415, 2015.

[22] H. Rueda, H. Arguello, and G. R. Arce, “DMD-based implementation of patterned optical filter arrays for compressive spectral imaging,” J. Opt. Soc. Am. A, vol. 32, no. 1, p. 80, 2014.

[23] Y. Chen, N. M. Nasrabadi, and T. D. Tran, “Simultaneous joint sparsity model for target detection in hyperspectral imagery,” IEEE Geosci. Remote Sens. Lett., vol. 8, no. 4, pp. 676–680, 2011.

[24] Y. Chen, N. M. Nasrabadi, and T. D. Tran, “Sparse Representation for Target Detection in Hyperspectral Imagery,” Sel. Top. Signal Process. IEEE J., vol. 5, no. 3, pp. 629–640, 2011.

[25] S. Kim, “An Interior-Point Method for Large-Scale Logistic Regression,” J. Mach. Learn. Res., vol. 8, pp. 1519–1555, 2007.

[26] M. Figueiredo, R. Nowak, and S. J. Wright, “Gradient projection for sparse reconstruction: application to compressed sensing and other inverse problems,” IEEE J. Sel. Top. Signal Process., vol, no. 1, p. 1pp586--597, 2007.

[27] D. Snyder, J. Kerekes, I. Fairweather, R. Crabtree, J. Shive, and S. Hager, “Development of a web-based application to evaluate target finding algorithms,” Int. Geosci. Remote Sens. Symp., vol. 2, no. 1, pp. 915–918, 2008.

Notes

[1] This paper is derived from a research project named “Detection and Classification in spectral images obtained through a compressive acquisition system with a single pixel detector” with VIE code 1802. Developed by the HDSP research group of Universidad Industrial de Santander, Bucaramanga, Colombia

[2] Este artículo se deriva de un proyecto de investigación denominado” Detección y Clasificación en imágenes espectrales obtenidas a través de un sistema de adquisición compresivo con un detector de un solo pixel” con código VIE 1802. Desarrollado por el grupo de investigación HDSP de la Universidad Industrial de Santander, Bucaramanga, Colombia.

Author notes

* David Boada: received the B.S.E. degree from the Universidad Industrial de Santander, Santander, Colombia, in 2016. He is currently working on his M.Sc. degree in Applied Mathematics the Universidad Industrial de Santander, Santander, Colombia.

****. Hector Vargas: received the B.E.E. degree from the Universidad Industrial de Santander, Santander, Colombia, in 2012 and the M.Sc. degree in electrical engineering from the Universidad Industrial de Santander, Santander, Colombia, in 2016. He is currently working toward the Ph.D. degree in the Department of Electrical and Computer Engineering in the Universidad Industrial de Santander, Santander, Colombia.

**** Henry Arguello: received the M.S. degree in electrical engineering from the Universidad Indus- trial de Santander, Santander, Colombia, in 2003 and the Ph.D. degree in electrical and computer engineering from the University of Delaware, Newark, DE, USA, in 2013. He is full-time professor with the Systems Engineering Department at Universidad Industrial de Santander.