Articles

Recepción: 01 Septiembre 2021
Publicación: 15 Diciembre 2021
DOI: https://doi.org/10.15388/LMR.2021.25214
Abstract: In this article we focus on the problem of supervised classifying of the spatio- temporal Gaussian random field observation into one of two classes, specified by different mean parameters. The main distinctive feature of the proposed approach is allowing the class label to depend on spatial location as well as on time moment. It is assumed that the spatio-temporal covariance structure factors into a purely spatial component and a purely temporal component following AR(p) model. In numerical illustrations with simulated data, the influence of the values of spatial and temporal covariance parameters to the derived error rates for several prior probabilities models are studied.
Keywords: separable covariance function, AR(p) model, Bayes discriminant function.
Summary: Nagrinėjama tiesinė diskriminantinė gausinių erdvės-laiko duomenų analizė, naudojant separabilius kovariacijų modelius ir erdvėje bei laike kintančias klasės žymes. Laiko kovariacijai naudojamas AR(p) modelis, o erdvinei kovariacijai naudojamas eksponentinis modelis. Nagrinėjamas klasifika- vimo procedūros efektyvumas, kuris vertinamas panaudojant generuotus Gauso atsitiktinių laukų duomenis.
Keywords: separabili kovariacinė funkcija, AR(p) modelis, Bajeso diskriminantinė funkcija.
Introduction
Spatial supervised classification is a problem of labeling observations based on fea- ture information and information about spatial adjacency relationships with training sample. This problem has been studied by numerous authors (see e.g. Atkinson and Lewis [1]). However, these studies are usually based on the assumption of con- ditional independence of feature observations. Comprehensive overview of methods for statistical classification and discrimination of Gaussian spatial data is provided by Berrett and Calder [3]. The novel approach to classification of Gaussian Random Importar imagen Importar tabla Field (GRF) observation by avoiding the assumption of conditional independence is developed by Dučinskas and Dreižienė [5]. However, statistical discriminant analy- sis of spatio-temporal data has been rarely considered previously. Šaltytė-Benth and Dučinskas [8] considered classification of spatio-temporal data modeled by GRF in particular case when observation of feature at focal location is uncorrelated with the training sample.
In the present paper, avoiding independence restrictions, we focus on the classi- fication of data modeled by random fields with separable spatio-temporal covariance structures specified by geostatistical spatial margins and discrete temporal margins (see e.g. [4]). Separability of covariances was assumed for the sake of reduction of complexity due to interdependencies between features.
The main distinctive feature of the proposed approach is allowing the class label to depend on spatial location as well as on time moment. That essentially widens the application area of presented investigations.
For the performance of classifiers, the values of derived in Local Bayes error rates and empirical error rates are used. For numerical illustrations, the exponential isotropic models for spatial covariance and AR(p) model of temporal covariance are considered. This is the extension of AR(1) case explored in Karaliutė and Dučinskas [7]. Performance of proposed classification rule is compared for various parameters of pure spatial and temporal covariances and prior class probabilities models.
This paper is organized as follows: proposed spatio-temporal data models and conditional distributions are delivered in the next section; in Section 2 conditional Bayes classification rule and its error rates are presented; in Section 3 the numerical illustrations and simulations for various separable stationary spatio-temporal covari- ance and prior probabilities models are displayed, and finally, the conclusions are in the last section.
1 Spatio-temporal data models and conditional distributions
The main objective of this paper is to classify feature observations of , where s and t define spatial and temporal coordinates, respectively. Let be a random field that represents class label and takes only the value 0 or 1.
In this study, we assume that for the model of observation con- ditional on is , where deterministic spatio-temporal drift. The error term is assumed to be generated by the univariate zero-mean GRF , with the separable spatio-temporal covariance model for all and .
Here denotes pure spatial covariance between observations in locations and and , denotes pure temporal covariance between observations at time points and. Under this assumption, the spatio-temporal covariance structure factors into a purely spatial and a purely temporal component, which allows for computationally efficient estimation and inference.
In this study we follow the popular tradition in environmental and agricultural research when the data are recorded at regular time intervals (time lags) and at irregular stations (locations) in compact area (see e.g. [2, 6]).
Let be a set of locations where observations are taken at time At every moment of time the set is split into two classes, and . Denote nlt the number of locations (of n) at time that belong to class thus nlt is the number of points in the set , and for every. Hence a set of class labels at any time moment can differ in composition.
Joint training sample is stratified training sample, specified by ma- trix , where This structure of data presentation is motivated by a model that assumes multivariate (in space) time se- ries. Denote by and the realized value of and respectively
In this study we consider the linear parametric drift where is the vector of a spatial covariates, and is a q dimensional vector of parameters, , and.
Denote by the matrix where
Then the matrix model for conditional on is , where with and matrix of Gaussian errors Here is identity matrix, and vector of parameters. Under covariance separability assumption with denoting the matrix of pure temporal covariances and denoting the matrix of pure spatial covariances.
In present paper we are dealing with the problem of classification of the observa- tions into one of two classes with given joint training sample or, in other words, based on training sample information we want to predict label at an observed location at the time moment and the ith row of identity matrix .
Under spatio-temporal data model specification, we can conclude that in , the conditional distribution of given and is Gaussian,
where with .
In this study, we assume that the conditional distribution of label , given joint training sample depends only on class labels values, conditional distribution of is identical to conditional distribution of. Set P , and, for simplicity, call them prior class probabilities.
2 Conditional Bayes discriminant functions and its error rates
Under the assumption that the classes are completely specified, the conditional Bayes discriminant function (CBDF) minimizing the probability of misclassification is formed by the log-ratio of conditional likelihood of distribution specified in (1), that is
where ln.
Let us call the probability of misclassification for as Local Bayes error rate and denote it by . Also let us denote squared Mahalanobis distance between conditional distributions by
Specific attention is given to the Gaussian spatio-temporal model with pure spatial covariances belonging to the family of powered-exponential isotropic models and with pure temporal covariance of stationary AR(p) model.
Lemma 1. The Local Bayes error rate is
where is the standard normal cumulative distribution function and
Proof. It is known that for AR(p) model parameters quantify the temporal depen-dency and for (j), where is the variance of the temporal white noise.
Then and
By using the properties of multivariate Gaussian distribution and inserting the above expressions in formula (3) we complete the proof of Lemma 1. H
Error estimation is critical to classification because the validity of the resulting classifier model, composed of the classifier and its error estimate, is based on the accuracy of the error estimation procedure. Given a set of sample data, the data can be split between training and test data, with a classifier being designed on the training data and its error being validated on the test data. In this paper our focus is on using p temporal observations for training and the observations at m + 1th time moment is using for testing.
3 Numerical illustrations and simulations
For numerical illustrations of the proposed classifier performance, we considered the Gaussian spatio-temporal model with pure spatial exponential covariances and with

pure temporal covariance stationary AR(1) model. Then it is easy to derive that
In the study, the exponential isotropic nugetless spatial covariance is considered.
So , where denotes the spatial correlation matrix withHere is the so called range parameter that represents the spatial dependence.
This choice is based on the smoothness level of sample paths. Sample paths of a GRF with the exponential covariance function are not smooth, when the squared exponential covariance model has smooth sample paths.
Two methods for prior class probabilities is proposed.
First one is based on Temporal Weighted Moving Average (TWMA) method
Second one adds spatial correlations for weighting
where denotes the index of the nearest neighbor to . Denote this method by (STWMA). We have compared these four particular cases by calculating the for. Numerical illustrations performed on 20 locations in two dimensional Euclidean area with and for label indicators are depicted in Fig. 1.
The averages for two models of prior probabilities and for 2.5 and various at momentare presented in Fig. 2. As it might be seen from Fig. 2, incorporation of spatial correlation in class prior probabilities ( method STWMA) does not have advantage against method TWMA for almost all values of parameter.
4 Conclusions
In this paper we propose approach to classification of spatio-temporal data in the framework of Bayes discriminant analysis for separable spatio-temporal covariances.

Several simulation studies were conducted to estimate and compare empirically the classifiers for particular separable stationary spatio-temporal covariance and various prior class probabilities models. Numerical analysis showed that there is no reason to incorporate spatial correlation in the prior probabilities since it does not improve the performance of the proposed classifier.
References
[1] P.M. Atkinson, P. Lewis. Geostatistical classification for remote sensing: an introduction. Comput. Geosc., 26(4):361–371, 2000. https://doi.org/10.1016/S0098-3004(99)00117- X.
[2] G. Atluri, A. Karpatne, V. Kumar. Spatio-temporal data mining: a sur- vey of problems and methods. ACM Comput Surv: CSUR, 51(4):83, 2018. https://doi.org/10.1145/3161602.
[3] C. Berrett, C.A. Calder. Bayesian spatial binary classification. Spatial Stat., 16:72–102, 2016. https://doi.org/10.1016/j.spasta.2016.01.004.
[4] S.S. Demel, J. Du. Spatio-temporal models for some data sets in continuous space and discrete time. Stat. Sin., 25:81–98, 2015. https://doi.org/10.5705/ss.2013.223w.
[5] K. Dučinskas, L. Dreižienė. Risks of classification of the Gaussian Markov random field observations. J. Class., 35:422–436, 2018. https://doi.org/10.1007/s00357-018-9269-7.
[6] J. Haslett, A.E. Raftery. Space-time modelling with long memory depen- dence: Assessing Ireland’s wind power resourse. Appl. Stat., 38:1–50, 1989. https://doi.org/10.2307/2347679.
[7] M. Karaliutė, K. Dučinskas. Classification of gaussian spatio-temporal data with sta- tionary separable covariances. Nonlinear Anal. Model. Control, 26(2):363–374, 2021. https://doi.org/10.15388/namc.2021.26.22359.
[8] J. Šaltytė-Benth, K. Dučinskas. Linear discriminant analysis of multivariate spatial- temporal regressions. Scand. J. Stat., 32:281–294, 2005.