The VARIOGRAM Procedure

Characteristics of Semivariogram Models

When you obtain a valid empirical estimate of the theoretical semivariance, it is then necessary to choose a type of theoretical semivariogram model based on that estimate. Commonly used theoretical semivariogram shapes rise monotonically as a function of distance. The shape is typically characterized in terms of particular parameters; these are the range a 0, the sill (or scale) c 0, and the nugget effect c Subscript n. Figure 18 displays a theoretical semivariogram of a spherical semivariance model and points out the semivariogram characteristics.

Figure 18: A Theoretical Semivariogram of Spherical Type and Its Characteristics

 A Theoretical Semivariogram of Spherical Type and Its Characteristics


Specifically, the sill is the semivariogram upper bound. The range a 0 denotes the distance at which the semivariogram reaches the sill. When the semivariogram increases asymptotically toward its sill value, as occurs in the exponential and Gaussian semivariogram models, the term effective (or practical) range is also used. The effective range r Subscript epsilon is defined as the distance at which there is essentially no lingering spatial correlation, which is usually considered to be the distance at which the semivariance value achieves 95% of the sill. This is based on the interpretation of effective range as provided in Banerjee, Carlin, and Gelfand (2014, pp. 26–27). In particular, for these models the relationship between the range and effective range is r Subscript epsilon Baseline equals 3 a 0 (exponential model) and r Subscript epsilon Baseline equals StartRoot 3 EndRoot a 0 (Gaussian model).

The nugget effect c Subscript n represents a discontinuity of the semivariogram that can be present at the origin. It is typically attributed to microscale effects or measurement errors. The semivariance is always 0 at distance bold-italic h equals 0; hence, the nugget effect demonstrates itself as a jump in the semivariance as soon as bold-italic h greater-than 0 (note in Figure 18 the discontinuity of the function at bold-italic h equals 0 in the presence of a nugget effect).

The sill c 0 consists of the nugget effect, if present, and the partial sill sigma 0 squared; that is, c 0 equals c Subscript n Baseline plus sigma 0 squared. If the SRF upper Z left-parenthesis bold-italic s right-parenthesis is second-order stationary (see the section Stationarity), the estimate of the sill is an estimate of the constant variance normal upper V normal a normal r left-bracket upper Z left-parenthesis bold-italic s right-parenthesis right-bracket of the field. Nonstationary processes have variances that depend on the location bold-italic s. Their semivariance increases with distance; hence their semivariograms have no sill.

Not every function is a suitable candidate for a theoretical semivariogram model. The semivariance function gamma Subscript z Baseline left-parenthesis bold-italic h right-parenthesis, as defined in the following section, is a so-called conditionally negative-definite function that satisfies (Cressie 1993, p. 60)

sigma-summation Underscript i equals 1 Overscript m Endscripts sigma-summation Underscript j equals i Overscript m Endscripts q Subscript i Baseline q Subscript j Baseline gamma Subscript z Baseline left-parenthesis bold-italic s Subscript i Baseline minus bold-italic s Subscript j Baseline right-parenthesis less-than-or-equal-to 0

for any number m of locations bold-italic s Subscript i, bold-italic s Subscript j in script upper R squared with bold-italic h equals bold-italic s Subscript i Baseline minus bold-italic s Subscript j, and any real numbers q Subscript i such that sigma-summation Underscript i equals 1 Overscript m Endscripts q Subscript i Baseline equals 0. PROC VARIOGRAM can use a variety of permissible theoretical semivariogram models. Specifically, Table 4 shows a list of such models that you can use for fitting in the MODEL statement of the VARIOGRAM procedure.

Table 4: Permissible Theoretical Semivariogram Models (a 0 greater-than 0, unless noted otherwise)

Model Type Semivariance
Exponential gamma Subscript z Baseline left-parenthesis bold-italic h right-parenthesis equals StartLayout Enlarged left-brace 1st Row 1st Column 0 2nd Column if bar bold-italic h bar equals 0 2nd Row 1st Column c Subscript n Baseline plus sigma 0 squared left-bracket 1 minus exp left-parenthesis minus StartFraction bar bold-italic h bar Over a 0 EndFraction right-parenthesis right-bracket 2nd Column if 0 less-than bar bold-italic h bar EndLayout
Gaussian gamma Subscript z Baseline left-parenthesis bold-italic h right-parenthesis equals StartLayout Enlarged left-brace 1st Row 1st Column 0 2nd Column if bar bold-italic h bar equals 0 2nd Row 1st Column c Subscript n Baseline plus sigma 0 squared left-bracket 1 minus exp left-parenthesis minus StartFraction bar bold-italic h bar Over a 0 squared EndFraction right-parenthesis right-bracket 2nd Column if 0 less-than bar bold-italic h bar EndLayout
Power gamma Subscript z Baseline left-parenthesis bold-italic h right-parenthesis equals StartLayout Enlarged left-brace 1st Row 1st Column 0 2nd Column if bar bold-italic h bar equals 0 2nd Row 1st Column c Subscript n Baseline plus sigma 0 squared bold-italic h Superscript a 0 Baseline 2nd Column if 0 less-than bar bold-italic h bar comma 0 less-than-or-equal-to a 0 less-than 2 EndLayout
Spherical gamma Subscript z Baseline left-parenthesis bold-italic h right-parenthesis equals StartLayout Enlarged left-brace 1st Row 1st Column 0 2nd Column if bar bold-italic h bar equals 0 2nd Row 1st Column c Subscript n Baseline plus sigma 0 squared left-bracket three-halves StartFraction bar bold-italic h bar Over a 0 EndFraction minus one-half left-parenthesis StartFraction bar bold-italic h bar Over a 0 EndFraction right-parenthesis cubed right-bracket 2nd Column if 0 less-than bar bold-italic h bar less-than-or-equal-to a 0 3rd Row 1st Column c 0 2nd Column if a 0 less-than bar bold-italic h bar EndLayout
Cubic gamma Subscript z Baseline left-parenthesis bold-italic h right-parenthesis equals StartLayout Enlarged left-brace 1st Row 1st Column 0 2nd Column if bar bold-italic h bar equals 0 2nd Row 1st Column c Subscript n Baseline plus sigma 0 squared left-bracket 7 left-parenthesis StartFraction bar bold-italic h bar Over a 0 EndFraction right-parenthesis squared minus StartFraction 35 Over 4 EndFraction left-parenthesis StartFraction bar bold-italic h bar Over a 0 EndFraction right-parenthesis cubed plus seven-halves left-parenthesis StartFraction bar bold-italic h bar Over a 0 EndFraction right-parenthesis Superscript 5 Baseline minus three-fourths left-parenthesis StartFraction bar bold-italic h bar Over a 0 EndFraction right-parenthesis Superscript 7 Baseline right-bracket 2nd Column if 0 less-than bar bold-italic h bar less-than-or-equal-to a 0 3rd Row 1st Column c 0 2nd Column if a 0 less-than bar bold-italic h bar EndLayout
Pentaspherical gamma Subscript z Baseline left-parenthesis bold-italic h right-parenthesis equals StartLayout Enlarged left-brace 1st Row 1st Column 0 2nd Column if bar bold-italic h bar equals 0 2nd Row 1st Column c Subscript n Baseline plus sigma 0 squared left-bracket StartFraction 15 Over 8 EndFraction StartFraction bar bold-italic h bar Over a 0 EndFraction minus five-fourths left-parenthesis StartFraction bar bold-italic h bar Over a 0 EndFraction right-parenthesis cubed plus three-eighths left-parenthesis StartFraction bar bold-italic h bar Over a 0 EndFraction right-parenthesis Superscript 5 Baseline right-bracket 2nd Column if 0 less-than bar bold-italic h bar less-than-or-equal-to a 0 3rd Row 1st Column c 0 2nd Column if a 0 less-than bar bold-italic h bar EndLayout
Sine hole effect gamma Subscript z Baseline left-parenthesis bold-italic h right-parenthesis equals StartLayout Enlarged left-brace 1st Row 1st Column 0 2nd Column if bar bold-italic h bar equals 0 2nd Row 1st Column c Subscript n Baseline plus sigma 0 squared left-bracket 1 minus StartFraction sine left-parenthesis pi bar bold-italic h bar slash a 0 right-parenthesis Over pi bar bold-italic h bar slash a 0 EndFraction right-bracket 2nd Column if 0 less-than bar bold-italic h bar EndLayout
Matérn class gamma Subscript z Baseline left-parenthesis bold-italic h right-parenthesis equals StartLayout Enlarged left-brace 1st Row 1st Column 0 2nd Column if bar bold-italic h bar equals 0 2nd Row 1st Column c Subscript n Baseline plus sigma 0 squared left-bracket 1 minus StartFraction 2 Over normal upper Gamma left-parenthesis nu right-parenthesis EndFraction left-parenthesis StartFraction bar bold-italic h bar StartRoot nu EndRoot Over a 0 EndFraction right-parenthesis Superscript nu Baseline upper K Subscript nu Baseline left-parenthesis 2 StartFraction bar bold-italic h bar StartRoot nu EndRoot Over a 0 EndFraction right-parenthesis right-bracket 2nd Column if 0 less-than bar bold-italic h bar comma nu greater-than 0 EndLayout


All of these models, except for the power model, are transitive. A transitive model characterizes a random process whose variation reaches the sill value c 0 within a specific range from any location in the field.

The power model is nontransitive and applies to processes whose variance increases with distance. It has no scale and range; instead, it quantifies the process variation by using a positive slope parameter and a dimensionless power exponent alpha that indicate how fast the variance increases. The expression for the power model is a valid semivariogram only when the exponent parameter ranges within 0 less-than-or-equal-to alpha less-than 2. For convenience, PROC VARIOGRAM registers the power model slope parameter under the SCALE= option parameters in the MODEL statement. For the same reason, the scale and power slope parameters are represented with the common symbol sigma 0 squared in Table 4. Also for convenience, PROC VARIOGRAM registers the power model exponent parameter under the RANGE= option parameters. The range and the power exponent parameters are represented with the common symbol a 0 in Table 4.

The power model is a generalized case of the linear model, which is not included explicitly in the model set of PROC VARIOGRAM. The linear model is derived from the power model when you specify the exponent alpha equals 1.

Among the models displayed in Table 4, the Matérn (or K-Bessel) class is a class of semivariance models that distinguish from each other by means of the positive smoothing parameter nu. Different values of nu correspond to different correlation models. Most notably, for nu equals 0.5 the Matérn semivariance is equivalent to the exponential model, whereas nu right-arrow normal infinity gives the Gaussian model. Also, Table 4 shows that the Matérn semivariance computations use the gamma function normal upper Gamma left-parenthesis nu right-parenthesis and the second kind Bessel function upper K Subscript nu.

In PROC VARIOGRAM, you can input the model parameter values either explicitly as arguments of options, or as lists of values. In the latter case, you are expected to provide the values in the order the models are specified in the SAS statements, and furthermore in the sequential order of the scale, range, and smoothing parameter for each model as appropriate, and always starting with the nugget effect. If the parameter values are specified through an input file, then the total of n parameters should be provided either as one variable named Estimate or as many variables with the respective names Parm1Parmn.

You can review in further detail the models shown in Table 4 in the section Theoretical Semivariogram Models in Chapter 74, The KRIGE2D Procedure.

The theoretical semivariogram models are used to describe the spatial structure of random processes. Based on their shape and characteristics, the semivariograms of these models can provide a plethora of information (Christakos 1992, section 7.3):

  • Examination of the semivariogram variation in different directions provides information about the isotropy of the random process. (See also the discussion about isotropy in the following section.)

  • The semivariogram range determines the zone of influence that extends from any given location. Values at surrounding locations within this zone are correlated with the value at the specific location by means of the particular semivariogram.

  • The semivariogram behavior at large distances indicates the degree of stationarity of the process. In particular, an asymptotic behavior suggests a stationary process, whereas either a linear increase and slow convergence to the sill or a fast increase is an indicator of nonstationarity.

  • The semivariogram behavior close to the origin indicates the degree of regularity of the process variation. Specifically, a parabolic behavior at the origin implies a very regular spatial variation, whereas a linear behavior characterizes a nonsmooth process. The presence of a nugget effect is additional evidence of irregularity in the process.

  • The semivariogram behavior within the range provides description of potential periodicities or anomalies in the spatial process.

A brief note on terminology: In some fields (for example, geostatistics) the term homogeneity is sometimes used instead of stationarity in spatial analysis; however, in statistics homogeneity is defined differently (Banerjee, Carlin, and Gelfand 2004, section 2.1.3). In particular, the alternative terminology characterizes as homogeneous the stationary SRF in script upper R Superscript n Baseline comma n greater-than 1, whereas it retains the term stationary for such SRF in script upper R Superscript 1 (SRF in script upper R Superscript 1 are also known as random processes). Often, studies in a single dimension refer to temporal processes; hence, you might see time-stationary random processes called "temporally stationary" or simply stationary, and stationary SRF in script upper R Superscript n Baseline comma n greater-than 1, characterized as "spatially homogeneous" or simply homogeneous. This distinction made by the alternative nomenclature is more evident in spatiotemporal random fields (S/TRF), where the different terms clarify whether stationarity applies in the spatial or the temporal part of the S/TRF.

Last updated: December 09, 2022