A sparsity-based approach for spectral image target detection from compressive measurements acquired by the CASSI architecture [1]
Un enfoque basado en escasez para la detección de objetivos en imágenes espectrales a partir de medidas compresivas adquiridas por la arquitectura CASSI [2]
A sparsity-based approach for spectral image target detection from compressive measurements acquired by the CASSI architecture [1]
Ingeniería y Universidad, vol. 21, no. 2, 2017
Pontificia Universidad Javeriana
Received: 04 July 2016
Accepted: 19 May 2017
Abstract: Hyperspectral imaging requires handling a large amount of multidimensional spectral information. Hyperspectral image acquisition, processing and storage are computationally and economically expensive and, in most cases, slow processes. In recent years, optical architectures have been developed for acquisition of spectral information in compressed form by using a small set of measurements coded by a spatial modulator. This work formulates a processing scheme that allows the measurements acquired by such compressive sampling systems to be used to perform spectral detection of targets by adapting traditional detection algorithms for use in the compressive sampling model and shows that the performance is comparable with that obtained by detection processes without compression.
Keywords: Hyperspectral Imaging, Compressive Sensing, Target Detection, Sparsity Model.
Resumen: La adquisición y procesamiento de imágenes espectrales involucra el manejo de grandes cantidades de información espectral multidimensional. Su adquisición, procesamiento y almacenamiento son costosos temporal, computacional y económicamente. En los últimos años se han desarrollado arquitecturas ópticas para la adquisición de información espectral de forma comprimida usando un conjunto reducido de mediciones codificadas por un modulador espacial. Este trabajo busca formular un esquema de procesamiento que permita utilizar las mediciones adquiridas por dichos sistemas de muestreo compresivo para efectuar detección espectral de objetivos, se adaptaran algoritmos de detección tradicionales para ser usados en el modelo de muestreo compresivo y se mostrara que su desempeño es comparable al obtenido en procesos de detección sin compresión. alabras Clave Imágenes hiperespectrales; Muestreo compresivo; Detección de objetivos; Modelo de escasez Introduction
Palabras clave: Imágenes hiperespectrales, Muestreo compresivo, Detección de objetivos, Modelo de escasez.
Introduction
Over the last several decades, the development of optical sensors has facilitated remote sensing analysis with rich spatial, spectral, and temporal information. The increase in the spectral resolution of hyperspectral images (HSI) and infrared sounders has led to new application domains and poses new methodological challenges in data analysis. HSI allows characterization of the objects of interest (e.g., land-cover classes) with unprecedented accuracy and aids in keeping inventories up to date. Furthermore, improvements in spectral resolution have necessitated advances in signal processing and exploitation algorithms [1].
Hyperspectral image classification and target detection are among the most important
problems in various scientific disciplines, such as machine learning [2] image processing, and computer vision. Several critical issues should be considered in the classification of hyperspectral data. For instance, the high number of spectral channels and low number of labeled training samples lead to the curse-of-dimensionality problem (i.e., the Hughes phenomenon [3]) and result in the risk of overfitting of the training data [4]. To alleviate the problems that come with the great dimensionality of data, the spatial variability of spectral information, and the high cost of true sample labeling and to enhance the numerical and stability, a variety of approaches have been proposed [5]. In general, these approaches take advantage of the inherent sparsity in a certain basis of the natural signals, whereby they can be approximately represented by a few coefficients that carry the most relevant information [6]. Applications of sparse representations in computer vision and pattern recognition can be found in various fields, including motion segmentation [7] image super-resolution [8], image restoration [9], and discriminative tasks such as face recognition [10], iris recognition [11], tumor classification [12], and HSI classification [13]. In these applications, the use of sparsity as a prior condition often leads to state-of-the-art performance. Furthermore, the sparse nature of spectral imagery can be exploited when classifying images that were acquired using compressive spectral imaging systems, which require fewer measurements than do systems attained with traditional hyperspectral imaging sensors [14][15].
The coded-aperture snapshot spectral imaging (CASSI) system depicted in Figure 1 is a compressive hyperspectral imager that is used to acquire compressive spectral measurements. The CASSI system simultaneously encodes spatial and spectral information of a scene in a small set of coded focal plane array (FPA) measurements [16].
![Coded-aperture snapshot spectral imaging (CASSI) system [17]](../47751131005_gf1.png)
The main elements of the CASSI system are the coded apertures, a dispersive element, and the sensor responsible for capturing the energy of the scene encoded. The coded apertures are matrix arrays composed of translucent optical elements that block or unblock the path of light through the system. The dispersive elements (usually prisms or gratings) are responsible for splitting the light into its wavelengths. The quality of the images acquired by the system depend on three main factors: the percentage of translucent elements that allow light into the apertures (commonly known as transmittance), the size of the data cube and the compression rate [17].
Mathematically, CASSI projections measured in the i-th shot can be treated as shown in Figure 2 and are described by y=Hf+
, where H is a N(N+L+1)x(N2L) matrix whose structure is determined by the coded aperture entries and the dispersive element effect. For spectrally rich or spatially detailed images, a single-shot FPA measurement is not sufficient to achieve proper quality reconstructions, and additional shots are required. The CASSI architecture is capable of admitting multiple snapshots, each with a different coded aperture pattern, thus yielding a less ill-posed inverse problem and improved signal reconstructions [18]. The set of
L FPA measurements is given by y=Hf+
, where y=[yT0,...,HTk-1]T is the one-dimensional vectorized form of all FPA shots,H=[HT0,...,HTk-1]T
RM(N+L+1)x(N2L) is the sensing matrix, and is the additive noise of the sensing system. The spectral data cube is reconstructed as
, is an S-sparse representation of f on the basis
and
is a regularization constant [19].

Among the main limitations of the CASSI system are the mixture of spectral information with spatial information due to spectral shifting and the way in which the energy is integrated within the detector. In other architectures, only the spectral information is mixed [20]. In addition, the number of spectral bands available is limited by the size of the detector, . However, this example is one of the most broadly studied compressive spectral architectures and has been used in several applications [19][21] [22], which is why it was selected for this work.
This paper focuses on designing a target detection model that uses compressive measurements to find a sparse representation of image pixels from spectral information-based dictionaries. In addition, an algorithm is implemented that determines whether the evaluated pixel is a target pixel. The proposed algorithm is based on a joint sparsity model, where every f i pixel is approximately represented by a few sets of training signatures among the entire training dictionary. This dictionary is composed of subdictionaries of the target and background signatures. The sparse vector represents the atoms in the training dictionary, and their associated weights for each pixel can be recovered from the CASSI compressive measurements by solving a sparsity-constrained optimization problem. This process is used to determine whether the observation pixel is a target pixel.
Spectral image target detection using a sparsity model
Traditional spectral target detection based on sparsity takes advantage of the fact that any pixel in an image can be sparsely represented using a trained dictionary composed of the selected target and the background pixels. Considering the existent spatial correlation between each pixel and its spatial neighborhood, we can model a linear problem as follows:
[equation 1.1]where F is the neighborhood consisting of T number of pixels and A is the sparse coefficient matrix of the
pixels represented in the subspace spanned by M.
With a proper pixel-wise sparse representation, the goal of the detection task is to apply a detection function to each pixel in the image as follows:
[equation 1.2]where Ab consists of the first Nb rows of the matrix A corresponding to the background subdictionary Mb and At consists of the remaining rows in that correspond to the target subdictionary Mt If the output D(x) is greater than a fixed threshold, then the test sample is labeled a target; otherwise, it is labeled background. Further details on matrix estimation and sparse representation are explicitly presented in previous works [23] and [24].
Proposed model
A principal component transformation can be performed on the structured dictionary M
RLxTsuch that an orthonormal basis
is obtained from the set of NT training vectors. From [19], we know that the orthonormal basis
can be formed by the set of eigenvectors of the correlation matrix C
RLxL given by the following:
[equation 1.3]such that =VT . Principal component analysis (PCA) is one of the most frequently used approaches for hyperspectral dimensionality reduction and compression in HIS because it preserves the meaningful information of the image in a few of its components. Such basis transformation was successfully performed for classification in [19]. Thus, the original sparse vector α representing the test pixel f can be transformed into a new sparse vector
, which represents the pixel in the orthogonal basis
. The observation pixel fi can now be expressed as
[equation 1.4]Using the sparse representation of the pixels in the training basis
, the compressive CASSI measurements can be rewritten as
[equation 1.5]where
, identity matrix and
is the Kronecker product operator. Additionally,
, where
corresponds to the i-th PCA coefficient of the i-th pixel.
In Eq. (1.5),
is the noise of the system, and H is the CASSI sensing matrix. The proposed algorithm first finds an estimate of the sparse vector
directly from the FPA measurements by solving the sparsity-constrained optimization problem given by
[equation 1.6]where the
norm accounts for the
sparsity constraint and the error norm finds the closest sparse vector to the optimal CASSI compressive measurements. A variety of algorithms have been used in the literature to solve problems similar to the one stated in Eq. (1.6), such as the
-regularized least square solution via the interior point method [25] or, in this case, the gradient projection for sparse reconstruction (GPSR) [26]. Spatial interpixel correlation can be included in the optimization problem given in Eq. (1.6), by replacing the sparsifying basis
with
, where
is the 2D wavelet basis dictionary used in [21].
The sparse target detection model proposed in this paper requires the detection algorithm introduced in Eq. (1.2) to operate over the subspace described in Eq. (1.5) and Eq. (1.6). This process is depicted in Algorithm 1, where
, is an identity matrix, ⊗ is the Kronecker product operator,
is the sparse vector obtained in Eq. (1.6)and
is the sparse matrix of all spectral bands in the basis
.

The target detector is based on a joint sparse model to extract the contextual information in HSI. In particular, it is assumed that pixels for the same material in a region share a common sparsity pattern. Thus, similar neighboring pixels can be sparsely represented by a linear combination of a few shared atoms:
[equation 1.7]where
is the sparse representation of pixels in a spatial neighborhood formed by T neighboring pixels of the i-th test pixel. From the estimated matrix
, the joint sparsity problem can be formulated as
[equation 1.8]where
is a joint sparse matrix with only K non-zero rows, K0 denotes the sparsity level of
and ||.||F denotes the Frobenius norm. Once the sparse matrix
is obtained, the label of the test pixel
is determined using the minimal total residual.
[equation 1.9]where
and
consist of the Nb and Nt rows in
that are associated with the background and target sub-dictionaries
and
, respectively.
Computer Simulations and Results
The performance of the proposed sparsity model in target detection is evaluated from compressive CASSI measurements obtained from two spectral images. The first image is the self-test dataset from the RIT-CIS-DIRT project [27], and the data of this image, as shown in Fig 2, were collected as a component of a field experiment conducted in July 2006, near the small town of Cooke City, MT, USA. The hyperspectral imagery was collected using the HyMap sensor operated by HyVista, with approximately 3-meter ground resolution. The sensor generates 128 bands across the reflective solar wavelength region of 0.45-2.5 μm, with contiguous spectral coverage (except in the atmospheric water vapor bands) and bandwidths between 15 and 20 μm. A small fabric panel was used as target, and its reflectance spectra were measured by a Cary 500 spectrophotometer in the laboratory.
The second image is the EO1H0070552014301110PF-SG1-01 spectral image, as shown in Fig. 3, collected by the EO-1 Hyperion sensor on October 28, 2014 in the region of Mogotes, Santander, Colombia. The spectral imagery has approximately 30-meter resolution, and only a small patch of the whole image is used in the experiments. The image is composed mostly of cultivated and ready-to-cultivate fields. The sensor is capable of resolving 220 spectral bands (from 0.4 to 2.5 μm) with a 30-meter resolution. In both images, the model of the multishot CASSI system [16] is used to obtain a set of FPA compressive measurements using different numbers of shots corresponding to an approximate percentage of sensed information of the image.
The target detector proposed in Algorithm 1 is used over the
representation of the image. The sparse vector
that solves the problem formulated in Eq. (1.6) is obtained using the GPSR algorithm proposed in [26]
The sparsity-constrained problem in Eq. (1.8) is solved using the SOMP algorithm, with a fixed sparsity level K0=4 and a joint sparsity neighborhood with T=9. The estimated matrix
is later used to calculate the score matrix D, whose entries determine the probability that the pixel area is a target, as shown in Eq. (1.9)
The proposed algorithm is compared with three target detection algorithms for hyperspectral images, i.e., Adaptive Matched Subspace Detector (AMSD), Orthogonal Subspace Projection (OSP), and Constrained Energy Minimization (CEM), which are available in the Matlab signal-processing toolbox. These algorithms were used without compression using 100% of the spectral data
The results are analyzed both visually and quantitatively using the receiver operating characteristic (ROC) curves, as shown Graph 1 in and Graph 2. The ROC curve describes the probability of detection (PD) as a function of the probability of false alarms (PFA). To calculate the ROC curve, we pick thousands of thresholds between the minimum and maximum of the detector output. The target or background labels for all pixels in the test region are determined at each threshold. The PFA is calculated by using the number of false alarms (background pixels determined as target) over the total number of pixels in the test region, and the PD is the ratio of the number of hits (target pixels determined as target) and the total number of true target pixels.
RIT-CIS-DIRT Dataset: Cooke City
The first spectral image used to test the performance of the classifier is the Cooke City self-test [27]. In the following simulations, the number of bands is reduced to 90 (3rd-46th, 49th, 51st-62nd, 66th, 69th-72nd, and 86th-122nd) by eliminating 38 absorption and low-SNR bands. This image has a spatial resolution of 3m per pixel and a spatial dimension of 800×200 pixels.
The proposed algorithm was tested by varying two specific parameters: the compression level that we expect to achieve and the transmittance level of the coded apertures. The numerical results in Table 1 show the area under the curve (AUC) of the detector under the selected transmittance level and compression rate.

The detection results for the proposed algorithm and the comparative results from other detection algorithms are shown in Figure 3. For additional clarity, the ROC curves of the algorithms are displayed in Graph 1.

Based on these results, it is reasonable to conclude that the proposed method achieves a performance similar to that of the target detection algorithms used in traditional spectral imaging.
Hyperion Image
The spatial dimensions used for this spectral image is 32×32. A set of 10 target pixels was taken from the image for use as training samples. A set of 206 pixels was also taken from the image for use as test pixels to be assigned as either target or background. As in the previous experiments, the GPRS algorithm was used to solve Eq. (1.6).
The results of the simulations performed in this image are shown in Fig. 4 and Graph. 2. As in the previous image, Fig 4 shows a grayscale view of the image processed together with the ground truth of the selected target and the outcomes of the proposed algorithm using 20% of the data cube and 40% transmittance.
Similarly, Graph 2 shows the behavior of the proposed algorithm compared with traditional approaches using ROC curves. The AUC corresponds to the probability of detection of the algorithm. In addition, the numerical results are shown in Table 2. It can be observed that the proposed algorithm obtains a probability of detection near when using only 15% of the spectral information in the images with a well-defined target spatial location.


4. Conclusion
This work proposes a spectral image target detector that directly labels each spectral pixel as either target or background from a set of compressive CASSI measurements. This detector uses the sparsity of spectral pixels in a given training basis. The sparse vector representing each pixel in the training basis is recovered from the CASSI measurements and is then used to determine whether or not the test pixel is a target pixel. The interpixel correlation in HSI is incorporated using a joint sparsity model, where the pixels in a small neighborhood in the test image are represented by a linear combination of a few common training samples weighted with a different set of coefficients for each pixel. The resulting sparse representations are used directly for target detection. The proposed detection method achieves a probability of detection of 98.92% if only 40% of the spectral information is used. A transmittance level between 10% and 30% produces the most accurate results.
References
[1] J.-S. Pan, P.-W. Tsai, and H.-C. Huang, Advances in Intelligent Information Hiding and Multimedia Signal Processing. 2016.
[2] R. Singh, M. Vatsa, A. Majumdar, and A. Kumar, Machine Intelligence and Signal Processing. 2016.
[3] G. Hughes, “On the mean accuracy of statistical pattern recognizers,” IEEE Trans. Inf. Theory, vol. 14, no. 1, 1968.
[4] L. Zhang, W. Wei, Y. Zhang, C. Shen, A. Van Den Hengel, and Q. Shi, “Dictionary Learning for Promoting Structured Sparsity in Hyperspectral Compressive Sensing,” IEEE Trans. Geosci. Remote Sens., vol. PP, no. 99, pp. 7223–7235, 2016.
[5] A. M. Bruckstein, D. L. Donoho, and M. Elad, “From Sparse Solutions of Systems of Equations to Sparse Modeling of Signals and Images,” SIAM Rev., vol. 51, no. 1, pp. 34–81, 2009.
[6] M. F. Duarte and Y. C. Eldar, “Structured Compressed Sensing : From Theory to Applications,” IEEE Trans. Signal Process., vol. 59, no. 9, 2011.
[7] M. Ying Yang, S. Feng, H. Ackermann, and B. Rosenhahn, “Global and Local Sparse Subspace Optimization for Motion Segmentation,” ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci., vol. II-3/W5, pp. 475–482, 2015.
[8] Z. Wang, D. Liu, J. Yang, W. Han, and T. Huang, “Deep networks for image super-resolution with sparse prior,” Proc. IEEE Int. Conf. Comput. Vis., vol. 11–18–Dece, pp. 370–378, 2016.
[9] W. Dong, G. Shi, Y. Ma, and X. Li, “Image Restoration via Simultaneous Sparse Coding: Where Structured Sparsity Meets Gaussian Scale Mixture,” Int. J. Comput. Vis., pp. 1–16, 2015.
[10] X. Jiang and J. Lai, “Sparse and dense hybrid representation via dictionary decomposition for face recognition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 37, no. 5, pp. 1067–1079, 2015.
[11] J. K. Pillai, V. Patel, R. Chellappa, and N. Ratha, “Robust and Secure Iris Recognition,” in Handbook of Iris Recognition, 2016.
[12] S. D. S. Al-Shaikhli, M. Y. Yang, and B. Rosenhahn, “Brain tumor classification and segmentation using sparse coding and dictionary learning,” in Biomedical Engineering / Biomedizinische Technik, 2016.
[13] G. Camps-Valls, T. V Bandos Marsheva, and D. Zhou, “Semi-supervised graph-based hyperspectral image classification,” IEEE Trans. Geosci. Remote Sens., vol. 45, no. 10, pp. 3044–3054, 2007.
[14] L. Fang, S. Li, X. Kang, and S. Member, “Spectral – Spatial Hyperspectral Image Classification via Multiscale Adaptive Sparse Representation,” IEEE Trans. Geosci. Remote Sens., vol. 52, no. 12, pp. 7738–7749, 2014.
[15] W. Li, S. Prasad, and J. E. Fowler, “Classification and reconstruction from random projections for hyperspectral imagery,” IEEE Trans. Geosci. Remote Sens., vol. 51, no. 2, pp. 833–843, 2013.
[16] H. Arguello and G. R. Arce, “Colored coded aperture design by concentration of measure in compressive spectral imaging,” IEEE Trans. Image Process., vol. 23, no. 4, pp. 1896–1908, 2014.
[17] G. R. G. R. . Arce, D. J. Brady, L. Carin, H. Arguello, and ] D. S. K., “Compressive coded aperture imaging,” IEEE Signal Process. Mag., vol. 1, no. 919, pp. 105–115, 2014.
[18] H. Arguello and G. R. Arce, “Code aperture optimization for spectrally agile compressive imaging,” J. Opt. Soc. Am. A, vol. 28, no. 11, p. 2400, 2011.
[19] A. Ramirez, S. Member, H. Arguello, S. Member, G. R. Arce, and B. M. Sadler, “Spectral Image Classification From Optimal Coded-Aperture Compressive Measurements,” vol. 52, no. 6, pp. 3299–3309, 2014.
[20] X. Lin, Y. Liu, J. Wu, and Q. Dai, “Spatial-spectral encoded compressive hyperspectral imaging,” ACM Trans. Graph., vol. 33, no. 6, 2014.
[21] A. Ramirez, G. R. Arce, and B. M. Sadler, “Spectral Image Unmixing From Optimal Measurements,” vol. 53, no. 1, pp. 405–415, 2015.
[22] H. Rueda, H. Arguello, and G. R. Arce, “DMD-based implementation of patterned optical filter arrays for compressive spectral imaging,” J. Opt. Soc. Am. A, vol. 32, no. 1, p. 80, 2014.
[23] Y. Chen, N. M. Nasrabadi, and T. D. Tran, “Simultaneous joint sparsity model for target detection in hyperspectral imagery,” IEEE Geosci. Remote Sens. Lett., vol. 8, no. 4, pp. 676–680, 2011.
[24] Y. Chen, N. M. Nasrabadi, and T. D. Tran, “Sparse Representation for Target Detection in Hyperspectral Imagery,” Sel. Top. Signal Process. IEEE J., vol. 5, no. 3, pp. 629–640, 2011.
[25] S. Kim, “An Interior-Point Method for Large-Scale Logistic Regression,” J. Mach. Learn. Res., vol. 8, pp. 1519–1555, 2007.
[26] M. Figueiredo, R. Nowak, and S. J. Wright, “Gradient projection for sparse reconstruction: application to compressed sensing and other inverse problems,” IEEE J. Sel. Top. Signal Process., vol, no. 1, p. 1pp586--597, 2007.
[27] D. Snyder, J. Kerekes, I. Fairweather, R. Crabtree, J. Shive, and S. Hager, “Development of a web-based application to evaluate target finding algorithms,” Int. Geosci. Remote Sens. Symp., vol. 2, no. 1, pp. 915–918, 2008.
Notes
Author notes