The Variogram: a spatial decomposition of variance
Suppose that we have measured the variables $z$ at $N$ locations, whose
geographical positions are denoted $\underline{x}_i$. Consider all pairs of
data locations, of which there are $N_p=\frac{N(N-1)}{2}$.
Now each pair of data locations is placed into a "lag class",
determined by a vector $\underline{h}$ (the "lag"), which means that
the two points are separated by (roughly) $\underline{h}$. Define
$N_c$ classes, each described by a set $P_{\underline{h}}$ of pairs of
indices, such that
$
(i,j) \in P_{\underline{h}}
\iff
\underline{x}_i-\underline{x}_j \approx \underline{h}.
$
The univariate variogram is defined as
$
\gamma(\underline{h}) = {\frac{1}{2}}E \left [(z
(\underline{x}+\underline{h})-z(\underline{x}))^2 \right ]
$
and the estimator of the variogram function for lag $\underline{h}$ is
$
\gamma(\underline{h})={\frac{1}{2N_{\underline{h}}}}\, \, \, \sum_{(i,j)\in P_{\underline{h}}}^{N_{\underline{h}}}(z_i-z_j)^2,
$
where $N_{\underline{h}}$ is the number of distinct pairs of data values,
placed in the set $P_{\underline{h}}$ (pairs displaced by the vector
$\underline{h}$). $\gamma(\underline{h})$ is sometimes called the
semi-variogram, because of the factor of 1/2 in its definition.
Now the sample variance, $s^2$, can be written as a weighted sum
$
s^2 = \sum_{c=1}^{N_c} \left({\frac{N_{\underline{h}}}{N_p}}\right) \left( {\frac{1}{2N_{\underline{h}}}}\, \, \,
\sum_{(i,j)\in P_{\underline{h}}}^{N_{\underline{h}}}
(z_{i}-z_j)^2 \right),
$
or
$
s^2 = \sum_{c=1}^{N_c} \gamma(\underline{h}) \left (\frac{N_{\underline{h}}}{N_p}}\right).
$
It is this equation which leads me to declare that
The sample variogram is the spatial decomposition of the sample variance.