Abstract: Hyperspectral imaging requires handling a large amount of multidimensional spectral information. Hyperspectral image acquisition, processing and storage are computationally and economically expensive and, in most cases, slow processes. In recent years, optical architectures have been developed for acquisition of spectral information in compressed form by using a small set of measurements coded by a spatial modulator. This work formulates a processing scheme that allows the measurements acquired by such compressive sampling systems to be used to perform spectral detection of targets by adapting traditional detection algorithms for use in the compressive sampling model and shows that the performance is comparable with that obtained by detection processes without compression.
Keywords:Hyperspectral ImagingHyperspectral Imaging, Compressive Sensing Compressive Sensing, Target Detection Target Detection, Sparsity Model Sparsity Model.
Resumen: La adquisición y procesamiento de imágenes espectrales involucra el manejo de grandes cantidades de información espectral multidimensional. Su adquisición, procesamiento y almacenamiento son costosos temporal, computacional y económicamente. En los últimos años se han desarrollado arquitecturas ópticas para la adquisición de información espectral de forma comprimida usando un conjunto reducido de mediciones codificadas por un modulador espacial. Este trabajo busca formular un esquema de procesamiento que permita utilizar las mediciones adquiridas por dichos sistemas de muestreo compresivo para efectuar detección espectral de objetivos, se adaptaran algoritmos de detección tradicionales para ser usados en el modelo de muestreo compresivo y se mostrara que su desempeño es comparable al obtenido en procesos de detección sin compresión. alabras Clave Imágenes hiperespectrales; Muestreo compresivo; Detección de objetivos; Modelo de escasez Introduction
Palabras clave: Imágenes hiperespectrales, Muestreo compresivo, Detección de objetivos, Modelo de escasez.
A sparsity-based approach for spectral image target detection from compressive measurements acquired by the CASSI architecture [1]
Un enfoque basado en escasez para la detección de objetivos en imágenes espectrales a partir de medidas compresivas adquiridas por la arquitectura CASSI [2]
Received: 04 July 2016
Accepted: 19 May 2017
Over the last several decades, the development of optical sensors has facilitated remote sensing analysis with rich spatial, spectral, and temporal information. The increase in the spectral resolution of hyperspectral images (HSI) and infrared sounders has led to new application domains and poses new methodological challenges in data analysis. HSI allows characterization of the objects of interest (e.g., land-cover classes) with unprecedented accuracy and aids in keeping inventories up to date. Furthermore, improvements in spectral resolution have necessitated advances in signal processing and exploitation algorithms [1].
Hyperspectral image classification and target detection are among the most important
problems in various scientific disciplines, such as machine learning [2] image processing, and computer vision. Several critical issues should be considered in the classification of hyperspectral data. For instance, the high number of spectral channels and low number of labeled training samples lead to the curse-of-dimensionality problem (i.e., the Hughes phenomenon [3]) and result in the risk of overfitting of the training data [4]. To alleviate the problems that come with the great dimensionality of data, the spatial variability of spectral information, and the high cost of true sample labeling and to enhance the numerical and stability, a variety of approaches have been proposed [5]. In general, these approaches take advantage of the inherent sparsity in a certain basis of the natural signals, whereby they can be approximately represented by a few coefficients that carry the most relevant information [6]. Applications of sparse representations in computer vision and pattern recognition can be found in various fields, including motion segmentation [7] image super-resolution [8], image restoration [9], and discriminative tasks such as face recognition [10], iris recognition [11], tumor classification [12], and HSI classification [13]. In these applications, the use of sparsity as a prior condition often leads to state-of-the-art performance. Furthermore, the sparse nature of spectral imagery can be exploited when classifying images that were acquired using compressive spectral imaging systems, which require fewer measurements than do systems attained with traditional hyperspectral imaging sensors [14][15].
The coded-aperture snapshot spectral imaging (CASSI) system depicted in Figure 1 is a compressive hyperspectral imager that is used to acquire compressive spectral measurements. The CASSI system simultaneously encodes spatial and spectral information of a scene in a small set of coded focal plane array (FPA) measurements [16].

The main elements of the CASSI system are the coded apertures, a dispersive element, and the sensor responsible for capturing the energy of the scene encoded. The coded apertures are matrix arrays composed of translucent optical elements that block or unblock the path of light through the system. The dispersive elements (usually prisms or gratings) are responsible for splitting the light into its wavelengths. The quality of the images acquired by the system depend on three main factors: the percentage of translucent elements that allow light into the apertures (commonly known as transmittance), the size of the data cube and the compression rate [17].
Mathematically, CASSI projections measured in the i-th shot can be treated as shown in Figure 2 and are described by y=Hf+
, where H is a N(N+L+1)x(N2L) matrix whose structure is determined by the coded aperture entries and the dispersive element effect. For spectrally rich or spatially detailed images, a single-shot FPA measurement is not sufficient to achieve proper quality reconstructions, and additional shots are required. The CASSI architecture is capable of admitting multiple snapshots, each with a different coded aperture pattern, thus yielding a less ill-posed inverse problem and improved signal reconstructions [18]. The set of
L FPA measurements is given by y=Hf+
, where y=[yT0,...,HTk-1]T is the one-dimensional vectorized form of all FPA shots,H=[HT0,...,HTk-1]T
RM(N+L+1)x(N2L) is the sensing matrix, and is the additive noise of the sensing system. The spectral data cube is reconstructed as
, is an S-sparse representation of f on the basis
and
is a regularization constant [19].

Among the main limitations of the CASSI system are the mixture of spectral information with spatial information due to spectral shifting and the way in which the energy is integrated within the detector. In other architectures, only the spectral information is mixed [20]. In addition, the number of spectral bands available is limited by the size of the detector, . However, this example is one of the most broadly studied compressive spectral architectures and has been used in several applications [19][21] [22], which is why it was selected for this work.
This paper focuses on designing a target detection model that uses compressive measurements to find a sparse representation of image pixels from spectral information-based dictionaries. In addition, an algorithm is implemented that determines whether the evaluated pixel is a target pixel. The proposed algorithm is based on a joint sparsity model, where every f i pixel is approximately represented by a few sets of training signatures among the entire training dictionary. This dictionary is composed of subdictionaries of the target and background signatures. The sparse vector represents the atoms in the training dictionary, and their associated weights for each pixel can be recovered from the CASSI compressive measurements by solving a sparsity-constrained optimization problem. This process is used to determine whether the observation pixel is a target pixel.
Traditional spectral target detection based on sparsity takes advantage of the fact that any pixel in an image can be sparsely represented using a trained dictionary composed of the selected target and the background pixels. Considering the existent spatial correlation between each pixel and its spatial neighborhood, we can model a linear problem as follows:
[equation 1.1]where F is the neighborhood consisting of T number of pixels and A is the sparse coefficient matrix of the
pixels represented in the subspace spanned by M.
With a proper pixel-wise sparse representation, the goal of the detection task is to apply a detection function to each pixel in the image as follows:
[equation 1.2]where Ab consists of the first Nb rows of the matrix A corresponding to the background subdictionary Mb and At consists of the remaining rows in that correspond to the target subdictionary Mt If the output D(x) is greater than a fixed threshold, then the test sample is labeled a target; otherwise, it is labeled background. Further details on matrix estimation and sparse representation are explicitly presented in previous works [23] and [24].
A principal component transformation can be performed on the structured dictionary M
RLxTsuch that an orthonormal basis
is obtained from the set of NT training vectors. From [19], we know that the orthonormal basis
can be formed by the set of eigenvectors of the correlation matrix C
RLxL given by the following:
[equation 1.3]such that =VT . Principal component analysis (PCA) is one of the most frequently used approaches for hyperspectral dimensionality reduction and compression in HIS because it preserves the meaningful information of the image in a few of its components. Such basis transformation was successfully performed for classification in [19]. Thus, the original sparse vector α representing the test pixel f can be transformed into a new sparse vector
, which represents the pixel in the orthogonal basis
. The observation pixel fi can now be expressed as
[equation 1.4]Using the sparse representation of the pixels in the training basis
, the compressive CASSI measurements can be rewritten as
[equation 1.5]where
, identity matrix and
is the Kronecker product operator. Additionally,
, where
corresponds to the i-th PCA coefficient of the i-th pixel.
In Eq. (1.5),
is the noise of the system, and H is the CASSI sensing matrix. The proposed algorithm first finds an estimate of the sparse vector
directly from the FPA measurements by solving the sparsity-constrained optimization problem given by
[equation 1.6]where the
norm accounts for the
sparsity constraint and the error norm finds the closest sparse vector to the optimal CASSI compressive measurements. A variety of algorithms have been used in the literature to solve problems similar to the one stated in Eq. (1.6), such as the
-regularized least square solution via the interior point method [25] or, in this case, the gradient projection for sparse reconstruction (GPSR) [26]. Spatial interpixel correlation can be included in the optimization problem given in Eq. (1.6), by replacing the sparsifying basis
with
, where
is the 2D wavelet basis dictionary used in [21].
The sparse target detection model proposed in this paper requires the detection algorithm introduced in Eq. (1.2) to operate over the subspace described in Eq. (1.5) and Eq. (1.6). This process is depicted in Algorithm 1, where
, is an identity matrix, ⊗ is the Kronecker product operator,
is the sparse vector obtained in Eq. (1.6)and
is the sparse matrix of all spectral bands in the basis
.

The target detector is based on a joint sparse model to extract the contextual information in HSI. In particular, it is assumed that pixels for the same material in a region share a common sparsity pattern. Thus, similar neighboring pixels can be sparsely represented by a linear combination of a few shared atoms:
[equation 1.7]where
is the sparse representation of pixels in a spatial neighborhood formed by T neighboring pixels of the i-th test pixel. From the estimated matrix
, the joint sparsity problem can be formulated as
[equation 1.8]where
is a joint sparse matrix with only K non-zero rows, K0 denotes the sparsity level of
and ||.||F denotes the Frobenius norm. Once the sparse matrix
is obtained, the label of the test pixel
is determined using the minimal total residual.
[equation 1.9]where
and
consist of the Nb and Nt rows in
that are associated with the background and target sub-dictionaries
and
, respectively.
The performance of the proposed sparsity model in target detection is evaluated from compressive CASSI measurements obtained from two spectral images. The first image is the self-test dataset from the RIT-CIS-DIRT project [27], and the data of this image, as shown in Fig 2, were collected as a component of a field experiment conducted in July 2006, near the small town of Cooke City, MT, USA. The hyperspectral imagery was collected using the HyMap sensor operated by HyVista, with approximately 3-meter ground resolution. The sensor generates 128 bands across the reflective solar wavelength region of 0.45-2.5 μm, with contiguous spectral coverage (except in the atmospheric water vapor bands) and bandwidths between 15 and 20 μm. A small fabric panel was used as target, and its reflectance spectra were measured by a Cary 500 spectrophotometer in the laboratory.
The second image is the EO1H0070552014301110PF-SG1-01 spectral image, as shown in Fig. 3, collected by the EO-1 Hyperion sensor on October 28, 2014 in the region of Mogotes, Santander, Colombia. The spectral imagery has approximately 30-meter resolution, and only a small patch of the whole image is used in the experiments. The image is composed mostly of cultivated and ready-to-cultivate fields. The sensor is capable of resolving 220 spectral bands (from 0.4 to 2.5 μm) with a 30-meter resolution. In both images, the model of the multishot CASSI system [16] is used to obtain a set of FPA compressive measurements using different numbers of shots corresponding to an approximate percentage of sensed information of the image.
The target detector proposed in Algorithm 1 is used over the
representation of the image. The sparse vector
that solves the problem formulated in Eq. (1.6) is obtained using the GPSR algorithm proposed in [26]
The sparsity-constrained problem in Eq. (1.8) is solved using the SOMP algorithm, with a fixed sparsity level K0=4 and a joint sparsity neighborhood with T=9. The estimated matrix
is later used to calculate the score matrix D, whose entries determine the probability that the pixel area is a target, as shown in Eq. (1.9)
The proposed algorithm is compared with three target detection algorithms for hyperspectral images, i.e., Adaptive Matched Subspace Detector (AMSD), Orthogonal Subspace Projection (OSP), and Constrained Energy Minimization (CEM), which are available in the Matlab signal-processing toolbox. These algorithms were used without compression using 100% of the spectral data
The results are analyzed both visually and quantitatively using the receiver operating characteristic (ROC) curves, as shown Graph 1 in and Graph 2. The ROC curve describes the probability of detection (PD) as a function of the probability of false alarms (PFA). To calculate the ROC curve, we pick thousands of thresholds between the minimum and maximum of the detector output. The target or background labels for all pixels in the test region are determined at each threshold. The PFA is calculated by using the number of false alarms (background pixels determined as target) over the total number of pixels in the test region, and the PD is the ratio of the number of hits (target pixels determined as target) and the total number of true target pixels.
The first spectral image used to test the performance of the classifier is the Cooke City self-test [27]. In the following simulations, the number of bands is reduced to 90 (3rd-46th, 49th, 51st-62nd, 66th, 69th-72nd, and 86th-122nd) by eliminating 38 absorption and low-SNR bands. This image has a spatial resolution of 3m per pixel and a spatial dimension of 800×200 pixels.
The proposed algorithm was tested by varying two specific parameters: the compression level that we expect to achieve and the transmittance level of the coded apertures. The numerical results in Table 1 show the area under the curve (AUC) of the detector under the selected transmittance level and compression rate.

The detection results for the proposed algorithm and the comparative results from other detection algorithms are shown in Figure 3. For additional clarity, the ROC curves of the algorithms are displayed in Graph 1.

Based on these results, it is reasonable to conclude that the proposed method achieves a performance similar to that of the target detection algorithms used in traditional spectral imaging.
The spatial dimensions used for this spectral image is 32×32. A set of 10 target pixels was taken from the image for use as training samples. A set of 206 pixels was also taken from the image for use as test pixels to be assigned as either target or background. As in the previous experiments, the GPRS algorithm was used to solve Eq. (1.6).
The results of the simulations performed in this image are shown in Fig. 4 and Graph. 2. As in the previous image, Fig 4 shows a grayscale view of the image processed together with the ground truth of the selected target and the outcomes of the proposed algorithm using 20% of the data cube and 40% transmittance.
Similarly, Graph 2 shows the behavior of the proposed algorithm compared with traditional approaches using ROC curves. The AUC corresponds to the probability of detection of the algorithm. In addition, the numerical results are shown in Table 2. It can be observed that the proposed algorithm obtains a probability of detection near when using only 15% of the spectral information in the images with a well-defined target spatial location.


This work proposes a spectral image target detector that directly labels each spectral pixel as either target or background from a set of compressive CASSI measurements. This detector uses the sparsity of spectral pixels in a given training basis. The sparse vector representing each pixel in the training basis is recovered from the CASSI measurements and is then used to determine whether or not the test pixel is a target pixel. The interpixel correlation in HSI is incorporated using a joint sparsity model, where the pixels in a small neighborhood in the test image are represented by a linear combination of a few common training samples weighted with a different set of coefficients for each pixel. The resulting sparse representations are used directly for target detection. The proposed detection method achieves a probability of detection of 98.92% if only 40% of the spectral information is used. A transmittance level between 10% and 30% produces the most accurate results.






