12
Spatial Statistics

12.1 Introduction

Spatial statistics is a part of applied statistics and is concerned with modelling and analysis of spatial data. By spatial data we mean data where, in addition to the (primary) phenomenon of interest, the relative spatial locations of observations are also recorded because these may be important for the interpretation of data. This is of primary importance in earth‐related sciences such as geography, geology, hydrology, ecology, and environmental sciences, but also in other scientific disciplines concerned with spatial variations and patterns such as astrophysics, economics, agriculture, forestry, and epidemiology, and, at a microscopic scale, medical and health research. Spatial statistics uses nearly all methods described in the first eleven chapters of this book and also multivariate analysis and Bayesian methods, neither of which are discussed in this book. We therefore restrict ourselves in this chapter to a few basic principles and give hints for further reading for other important methods. As a consequence of this, the list of references is relatively long.

We restrict our attention to continuous characteristics and to Gaussian distributions and analyse examples with the program package R as we did in other chapters of this book. Those who prefer SAS and understand a bit of German are referred to the procedures in 6/61 of Rasch et al. (2008).

  • 6/61/0000 Spatial Statistics – Introduction
  • 6/61/1010 Estimation of the covariance function of a random variable with constant trend
  • 6/61/1020 Estimation of the semi‐variogram of a random variable
  • 6/61/1021 Estimation of the parameter of an exponential semi‐variogram model
  • 6/61/1022 Estimation of the parameter of a spherical semi‐variogram model
  • 6/61/1030 Definition of increments and of the generalised covariance function for non‐steady state random variables
  • 6/61/1031 Estimation of the generalised covariance function for non‐steady state random variables
  • 6/61/1100 Modelling spatial dependencies between two variables
  • 6/61/2000 Spatial prediction – survey
  • 6/61/2010 Prediction of stationary random variables using the covariance function
  • 6/61/2020 Prediction of stationary random variables using the semi‐variogram
  • 6/61/2030 Prediction of non‐steady state random variables
  • 6/61/2040 Spatial prediction: Co‐kriging
  • 6/61/2050 Prediction of probabilities
  • 6/61/2051 Prediction of exceedance probabilities
  • 6/61/2052 Hermitean prediction.

We restrict ourselves to one‐ and two‐dimensional regions D ⊂ R1 and D ⊂ R2, respectively. However, D ⊂ R3 (oil and mineral prospection, 3D imaging) is also possible. In some fields such as Bayesian data analysis, design and simulation one even requires spaces D of dimension >3; this pertains, in particular, to the design and analysis of computer experiments with a moderate to large number of input variables.

Points in D ⊂ R2 are written as sT = (x1, x2) images and the coordinates x1, x2 are in geostatistics often the Gauss–Krüger coordinates on the earth based on the degrees (°) of the longitude meridional zone of the surface of the earth, see Krüger (1912). The surface is subdivided in meridional zones of a latitude of 3° running from the North Pole to the South Pole parallel to its central meridian. The degrees of the central meridian of each meridional zone counted from 0° eastwards are mapped to code numbers by dividing them by three as shown in Table 12.1.

Table 12.1 Gauss–Krüger code numbers.

Central meridian Western longitude Eastern longitude
Degree
Degree east from 0° 354° 357°
Code number 118 119 0 1 2

The meridional zone is conformally mapped on a cylinder barrel with the axis in the equatorial plane and a radius equal to the curvature radius of the meridian. Its origin is the intersection of the central meridian and the equator. From the origin the coordinates of the points on the surface of the earth are defined like in a usual Cartesian coordinate system, positive to the east by the so‐called easting (x1), and to the north by the so‐called grid north (x2). The coordinates on the earth can be transformed to the Gauss–Krüger coordinates via https://www.koordinaten‐umrechner.de. As an example, we give the Gauss–Krüger coordinates of the (first author's) house in Feldrain 73 in Rostock, Germany.

Degrees minutes seconds E 12° 11′ 41; N 54° 06′ 18

Easting: 4507572; grid north 59922353.

First we must select a spatial model for the observations (variables) at the points in D ⊂ R2; their realisations are our observations.

The general model is for observations y(s) at images

12.1equation

with side conditions for weak stationarity (second order stationarity):

12.2equation
12.3equation

Formula (12.3) leads for h = 0 to var(y(s))C(0).

If we further assume that C(h) = C(∥h∥) with the Euclidian norm h of h, then C is called isotropic, otherwise anisotropic.

The positions of observation sites s ∈ D can be fixed in advance (as, e.g. the position of wind power stations in an area) or may be random.

As examples where observation points occur randomly, we mention meteor strikes in a special area. To this situation also belongs the possibly oldest mapping of clusters of cholera cases in the London epidemic of 1854 (Snow 1855). Randomly means in this connection that we assume that a point is equally probable to occur at any location and that the position of a point is not affected by any other point.

Further, the observation points may not be fixed but determined by the scientist (monitoring). In this case, besides problems of analysis, design problems also exist by selecting the optimal observation points in an area. For this, readers are referred to Müller (2007).

Comprehensive treatments of the whole field of spatial statistics are given in Ripley (1988), Cressie (1993), and Gaetan and Guyon (2010).

Basically, there are four classes of problems which spatial statistics is concerned with: point pattern analysis, geostatistical data analysis, areal/lattice data analysis and spatial interaction analysis. These sub‐problems are treated in overview papers such as: Pilz (2010), Mase (2010), Kazianka and Pilz (2010a), Diggle (2010), and Spöck and Pilz (2010).We discuss mainly geostatistical data analysis with some hints to areal data analysis.

For a good overview on software for different problem areas of spatial data analysis with R we recommend the book by Bivand et al. (2013), for the important issue of simulation of spatial models we refer to Lantuéjoul (2002) and Gaetan and Guyon (2010). An overview of methodology and software for interfacing spatial data analysis and geographic information system (GIS) for visualising spatial data is given in Pilz (2009).

Due to the fact that in some fields of application of spatial statistics special methods have been applied, and further because the theory and applications are still in development, we cannot give a closed presentation of the field. We describe basic methods used in geostatistics using Euclidean distances and then give examples. For point pattern analysis, areal/lattice data analysis, and spatial interaction analysis the reader is referred to the books mentioned above.

12.2 Geostatistics

In geostatistics D is a continuous subspace of R2 or R3 and the random variable (field) is observed at n > 2 fixed sites s1, s2, … , sn ∈ D. Typical examples include rainfall data, data on soil characteristics (porosity, humidity, etc.), oil and mineral exploration data, air quality, and groundwater data. In this chapter only one characteristic observation variable is measured per observation point on a line or plane. Multivariate geostatistics dealing with observation vectors per observation point is described in detail in Wackernagel (2010).

The concept of stationarity is key in the analysis of spatial and/or temporal variation: roughly speaking, stationarity means that the statistical characteristics (e.g. mean and variance) of the random variable of interest do not change over the considered area. However, testing for stationarity is not possible. For spatial prediction the performance of a stationary and a non‐stationary model could be compared through assessment of the accuracy of predictions.

In this chapter we assume that the random vectors yT = [y(s1), y(s2), … , y(sn) ] follow an n‐dimensional normal (Gaussian) distribution for any collection of spatial locations {s1, s2, … , sn} ⊂ D and any n ≥ 1. In the literature the collection of random variables {y(s) : s ∈ D} is then usually termed a Gaussian random field (GRF) over D. For other types of random fields and a detailed explanation of their mathematical structure and most important properties we refer to Cressie and Wikle (2011), where also extensions to so‐called spatio‐temporal random fields are considered. Often non‐normal random variables may be transformed by a so‐called Box–Cox transformation as a generalisation of the normal case by including an additional parameter λ. This transformation is given by:

12.4equation

A GRF is completely determined by its expectation (trend function)

equation

and covariance function C(si − sj) = cov[y(si), y(sj)] : i, j = 1, … , n.

Contrary to traditional statistics, in a geostatistical setting we usually observe only one realisation of y at a finite number of locations images. Therefore, the distribution underlying the random field cannot be inferred without imposing further assumptions. The simplest assumption is that of (strict) stationarity, which means that the normal distributions do not change when all positions are translated by the same (lag) vector h and this implies that (12.2) and (12.3) are valid.

Often (in areal/lattice data analysis) measurements are not related to points but to areas Ai, i = 1, … n . In such cases in place of distances between points measures of spatial proximity wij between two areas Ai and Aj are used and represented in a square n × n matrix W = (wij).

According to Bailey and Gatrell (1995) some possible criteria for determining proximities might be:

  • wij = 1 if Aj shares a common boundary with Ai and wij = 0 else.
  • wij = 1 if the centroid of Aj is one of the k nearest centroids to that of Ai and wij = 0 else.
  • wij = images if the inter‐centroid distance dij < δ (δ > 0, γ < 0); and wij = 0 else.
  • wij = images, where lij is the length of the common boundary between Aiand Aj and li is the perimeter of Ai.

All diagonal elements wii are set to 0. Note that the spatial proximity matrix W must not necessarily be symmetric.

For more proximity measures we refer to Bailey and Gatrell (1995) and any other publications on areal spatial analysis like Anselin and Griffith (1988).

12.2.1 Semi‐variogram Function

From now on, we focus on geostatistics and assume second order stationarity, i.e. (12.2) and (12.3) hold, and additionally, we assume isotropy. In geostatistics it is common to use the so‐called semi‐variogram function

12.5equation

Observing that under the assumption of stationarity it holds that

12.6equation

we can simply use the empirical moment estimate to estimate the semi‐variogram according to

12.7equation

where images denotes the number of sampling locations separated from each other by the images.

We call images the estimated semi‐variogram function or the sample semi‐variogram function. Since a vector h is uniquely determined by its length images and its direction, it is customary to form lag distance classes along given directions, usually this is done for the four main directions 0°, 45°, 90°, and 135°.

Then

equation

This is called the variogram function. Dividing by two leads to the semi‐variogram function

12.9equation

as a special case of (12.5).

For ‘classical’ estimation methods for the trend function and variogram parameters see Mase (2010), for Bayesian approaches we refer to Banerjee et al. (2014), Kazianka and Pilz (2010a) and Pilz et al. (2012). For non‐stationary variogram modelling we refer to the review provided by Sampson et al. (2001) and Schabenberger and Gotway (2005).

Theoretically, at zero separation distance (lag = 0), the semi‐variogram value γ(0) is zero, because images . However, at an infinitesimally small separation distance, the semi‐variogram function often exhibits a so‐called nugget effect, which is some value greater than zero. For example, if the semi‐variogram model intercepts the x2‐axis at 3, then the nugget effect is 3. Furthermore, often images is an asymptotic value of the semi‐variogram model called sill. The nugget effect can be attributed to measurement errors or spatial sources of variation at distances smaller than the sampling interval (or both). Measurement error occurs because of the error inherent in measurement devices. Natural phenomena can vary spatially over a range of scales. Variation at microscales smaller than the sampling distances will appear as part of the nugget effect. Before collecting data, it is important to gain some understanding of the scales of spatial variation.

12.2.2 Semi‐variogram Parameter Estimation

The unknown parameters occurring in the semi‐variogram models (12.10)–(12.14) can be estimated by the least squares method of Section 8.2.1.1 or in the case of GRF the maximum likelihood method could be applied.

12.2.3 Kriging

In geostatistics kriging is a method of interpolation to predict values of the variable of interest in D at positions where no measurement has been made. It is based on a master thesis of Krige (1951) and the mathematical foundation by Matheron (1963). Observed values are modelled by a GRF. Under suitable assumptions, kriging gives the best linear unbiased prediction (BLUP) of values not observed in the corresponding area. The basic idea of kriging is to predict the value of a function at a given point by computing a weighted average or linear combination of the observed values of the function in the neighbourhood of the point. The methods used are a kind of regression analysis in two dimensions. Basically, we distinguish between ordinary and universal kriging: ordinary kriging assumes a constant trend function, E(y(s)) = const for all s in D, whereas universal kriging assumes a non‐constant trend. Placing the problem in a stochastic framework permits precision‐defining optimality for estimations of unknown parameters from the random variables for which the measurements are realisations. A criterion imposed is that the estimator be unbiased, or that in an average sense the difference between the predicted value and the actual value is zero. Another optimality criterion is that the prediction variance be minimised. This variance (kriging variance) is defined to be the expectation of the average squared difference between predicted and actual values. The kriging estimator minimises this variance. This minimisation is performed algebraically and results in a set of equations known as the kriging equations – in ordinary kriging we call them ordinary kriging equations (OK system).

For ordinary kriging, we must make the following assumptions already made above:

  • yi; i = 1, … , n are normally distributed.
  • Second order stationarity, meaning, in particular, that the yi; i = 1, … , n all have the same constant mean and variance.

We restrict ourselves to observation points in D ⊂ R2 and the univariate case with one character measured at each of n points in D. In the case that in (12.1) μ(s) is known we speak about simple kriging. This is in many practical situations an unrealistic assumption. Therefore, we describe the so‐called ordinary kriging where μ(s) is assumed to be unknown and constant.

The kriging prediction at an unobserved location s0 ∈ D is then given by a linear combination

12.16equation

where the weights in λT = (λ1, … , λn) are determined as solutions of the OK system of the observations images.

Here G = (γ(‖si − sj‖))i, j = 1, … , n is the (n × n)‐semi‐variogram matrix of the observations,

equation

is the vector of semi‐variogram values, and λ0 is the Lagrange multiplier, which is necessary due to the minimisation under the equality constraint images, which, in turn, results from the unbiasedness condition E(images) = μ. The kriging variance then becomes σ2(s0) = var(images) = λ0 + λTγ0. We illustrate this with the following toy example where n = 4.

12.2.4 Trans‐Gaussian Kriging

Now we consider the case of non‐normal random fields. It is then often possible to transform the observation such that the transformed data are realisations of a normal distribution. A striking example of this is to apply the Box–Cox transformation (12.4). This transformation depends on an additional parameter λ. We can find an appropriate value of λ with the function boxcox(.) in the package geoR. The function boxcox(.) gives a graphical summary of the log‐likelihood function depending on λ and indicates a realised 95% confidence interval for λ. For λ we use an easy to interpret value from the set {λ = 1 (no transformation), λ = 0.5 (square root transformation), λ = 0 (logarithmic transformation), λ = −0.5 (inverse square root transformation), λ = −1 (reciprocal transformation)}. Kriging with Box–Cox transformed data is usually called trans‐Gaussian kriging in the literature. This type of kriging and Bayesian extensions thereof were introduced by De Oliveira et al. (1997). Spöck and Pilz (2015) deal with the sampling design problem for optimal prediction with non‐normal random fields and give an example of the optimal placement of monitoring stations for an existing rainfall monitoring network in Upper Austria.

The R package gstat also implements trans‐Gaussian kriging using a boxcox(.) function, which may be found in the library (MASS).

For illustration we now use the Meuse Data meuse.all from the package gstat. This data set gives locations and top soil heavy metal concentrations (ppm), along with a number of soil and landscape variables, collected in a flood plain of the river Meuse, near the village Stein. Heavy metal concentrations are bulk sampled from an area of approximately 15 × 15 m.

12.3 Special Problems and Outlook

We now briefly outline more recent developments which go beyond the traditional kriging and trans‐Gaussian kriging considered in Section 12.2.3.

12.3.1 Generalised Linear Models in Geostatistics

In the same way as we had extended linear regression models to generalised linear regression models in Chapter 11, we may take the step from spatial linear models to so‐called generalised linear geostatistical models (GLGMs). This allows us to model spatial random variables [with observations (realisations)] following a distribution from the exponential family. Spatial modelling and prediction for such observations is implemented in the R package geoRglm and is well described in Diggle and Ribeiro (2007), where in particular worked‐through examples of binomially and Poisson distributed environmental and public health data are considered. Extensions to hierarchical Bayesian GLGMs can be found in Banerjee et al. (2014).

12.3.2 Copula Based Geostatistical Prediction

The GLGM framework does not allow distributions outside the exponential family. In particular we cannot use it for heavy‐tailed/extreme value distributions, which have a much slower decay of probability in the tails than the normal distribution. A prominent example of such a distribution is the generalised extreme value distribution with distribution function

equation

with location, scale, and shape parameter μ, σ, and τ, respectively. Observations from non‐Gaussian, skewed or heavy‐tailed distributions are dealt with in the library (intamap). A general framework of handling such data are copulas which are distribution functions on the unit cube [0, 1]n with uniformly distributed margins, introduced by Sklar (1959). Copulas are invariant under strictly increasing transformations of the marginals; thus, frequently applied data transformations (e.g. square root and log transformations) do not change the copula. The relation between two locations separated by the lag‐vector h is characterised by the bivariate distribution with distribution function

12.23equation

The copula Ch thus becomes a function of the separating vector h. Spatial copulas have been introduced by Bardossy (2006) and Kazianka and Pilz (2010b). Spatial copulas describe the spatial dependence over the whole range of quantiles for a given separating vector h, not only the mean dependence, as the variogram does. The main difference to GLGMs is that these only model the means μi = E(y(si)), i = 1, … , n through some link function images, where Z(·) is a stationary GRF with mean zero and given covariance function. The spatial copula, however, allows us to build a complete multivariate distribution of the random variables whose realisations are the observed values [y(s1), … , y(sn)]. Recently, so‐called vine copula have been developed which extend the concept of the bivariate copula Ch in (12.23) to higher dimensions – see e.g. Gräler (2014). The corresponding R packages copula, spcopula and VineCopula allow flexible spatial data modelling. A Matlab toolbox for copula‐based spatial analysis is given in Kazianka (2013).

References

  1. Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle. In: 2nd International Symposium on Information Theory, Tsaghkadzor, Armenia, USSR, September 2–8, 1971 (ed. B.N. Petrov and F. Csáki), 267–281. Budapest: Akadémiai Kiadó.
  2. Anselin, L. and Griffith, D.A. (1988). Do spatial effects really matter in regression analysis? Reg. Sci. 65: 11–34.
  3. Bailey, T.C. and Gatrell, T. (1995). Interactive Spatial Data Analysis. London: Longman Scientific & Technical.
  4. Banerjee, S., Carlin, B.P., and Gelfand, A.E. (2014). Hierarchical Modeling and Analysis for Spatial Data, 2e. Boca Raton, Florida: CRC Press/Chapman & Hall.
  5. Bardossy, A. (2006). Copula‐based geostatistical models for groundwater quality parameters. Water Resour. Res. 42: W11416.
  6. Bivand, R.S., Pebesma, E.J., and Gomez‐Rubio, V. (2013). Applied Spatial Data Analysis with R, 2e. Berlin: Springer.
  7. Cressie, N.A.C. (1993). Statistics for Spatial Data. New York: Wiley.
  8. Cressie, N.A.C. and Wikle, C.K. (2011). Statistics of Spatio‐Temporal Data. New York: Wiley.
  9. Davis, J.C. (2002). Statistics and Data Analysis in Geology, 3e. New York: Wiley.
  10. De Oliveira, V., Kedem, B., and Short, D.A. (1997). Bayesian prediction of transformed Gaussian random fields. J. Am. Stat. Assoc. 92: 1422–1433.
  11. Diggle, P.J. (2010). Spatial point pattern. In: International Encyclopedia of Statistical Science, Volumes I, II, III (ed. M. Lovric), 1361–1363. Berlin: Springer.
  12. Diggle, P. and Ribeiro, P. (2007). Model‐Based Geostatistics. New York: Springer.
  13. Gaetan, C. and Guyon, H. (2010). Spatial Statistics and Modeling. New York: Springer.
  14. Gräler, B. (2014). Modelling skewed spatial random fields through the spatial vine copula. Spatial Stat. 10: 87–102.
  15. Kazianka, H. (2013). spatialCopula: a Matlab toolbox for copula‐based spatial analysis. Stochastic Environ. Res. Risk Assess. 27: 121–135.
  16. Kazianka, H. and Pilz, J. (2010a). Model‐based Geostatistics. In: International Encyclopedia of Statistical Science, Volumes I, II, III (ed. M. Lovric), 833–836. Berlin: Springer.
  17. Kazianka, H. and Pilz, J. (2010b). Copula‐based geostatistical modeling of continuous and discrete data including covariates. Stochastic Environ. Res. Risk Assess. 24: 661–673.
  18. Krige, D.G, (1951) A statistical approach to some mine valuations and allied problems at the Witwatersrand, Master's thesis of the University of Witwatersrand, Johannesburg, S.A..
  19. Krüger, L., (1912) Konforme Abbildung des Erdellipsoids in die Ebene, In: Veröff. Kgl. Preuß. Geod. Inst. Nr. 51.
  20. Lantuéjoul, C. (2002). Geostatistical Simulation. Models and Algorithms. Berlin: Springer.
  21. Mase, S. (2010). Geostatistics and kriging predictors. In: International Encyclopedia of Statistical Science, Volumes I, II, III (ed. M. Lovric), 609–612. Berlin: Springer.
  22. Matheron, G. (1963). Principles of Geostatistics. Econ. Geol. 58: 1246–1266.
  23. Müller, W. (2007). Collecting Spatial Data, 3e. Heidelberg: Springer.
  24. Pilz, J. (ed.) (2009). Interfacing Geostatistics and GIS. Berlin‐Heidelberg: Springer.
  25. Pilz, J. (2010). Spatial statistics. In: International Encyclopedia of Statistical Science, Volumes I, II ,III (ed. M. Lovric), 1363–1368. Berlin: Springer.
  26. Pilz, J., Kazianka, H., and Spöck, G. (2012). Some advances in Bayesian spatial prediction and sampling design. Spatial Stat. 1: 65–81.
  27. Rasch, D., Herrendörfer, G., Bock, J. et al. (eds.) (2008). Verfahrensbibliothek Versuchsplanung und ‐ auswertung, 2e. München Wien: R. Oldenbourg.
  28. Ripley, B.D. (1988). Statistical Inference for Spatial Processes. Cambridge, UK: Cambridge University Press.
  29. Sampson, P.D., Damien, D., and Guttorp, P. (2001). Advances in Modelling and Inference. London: Academic Press.
  30. Schabenberger, O. and Gotway, C.A. (2005). Statistical Methods for Spatial Data Analysis. Boca Raton: Chapman & Hall/CRC Press.
  31. Schwarz, G.E. (1978). Estimating the dimension of a model. Ann. Stat. 6: 461–464.
  32. Sklar, A. (1959). Fonctions de repartition á n dimensions et leurs marges. Publ. Inst. Statistique Univ. Paris 8: 229–231.
  33. Snow, J. (1855). On the Mode of Communication of Cholera. London: John Churchill.
  34. Spöck, G. and Pilz, J. (2010). Analysis of areal and spatial interaction data. In: International Encyclopedia of Statistical Science, Volumes I, II, III (ed. M. Lovric), 35–39. Berlin: Springer.
  35. Spöck, G. and Pilz, J. (2015). Taking account of covariance estimation uncertainty in spatial sampling design for prediction with trans‐Gaussian random fields. Front. Environ. Sci. 3: 39, 1–39, 22.
  36. Wackernagel, H. (2010). Multivariate Geostatistics. Heidelberg‐Berlin: Springer.
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
3.145.76.250