Ripley's K function

Ripley's K function is a statistic useful for point pattern analysis. Here are more details on the use of Ripley's K, and other more general functions, while this document summarizes the information I used to create the test in GeoMed.

Ripley's K function for the data is compared with the expected line y=x under Poisson distribution of the points (actually, a scaled version of K is used, called L: L is the square root of the quotient of K and pi). In addition, simulated Poisson fields of the same N for the same size study area are generated, as are corresponding Ripley's K functions; these give some idea of how extreme the result of the data's Ripley's K function is.

Example: towns data, the sample data for the use of Ripley's K function in Splus.

In this data, a distance of 10 is about a quarter of the way across the point space in either direction.

We begin with a plot of the data:

When I look at this pattern does not look significantly different from a Poisson pattern; when someone with a little more experience (like Dr. Art Getis) looks at it, they find that the pattern appears roughly evenly spaced in a roughly homogeneous plane (i.e. more regular than a Poisson process would produce).
This first plot is produced in R; the envelope shown is for 100 Poisson simulations (only 10 simulations are actually shown).
This second plot is the related GeoMed output, where we show the same information (although the blue envelope is for the 10 simulations shown). The actual results from the data are given by the black function, with the points added on to show the 100 subintervals for which the function is plotted. The green and red in the center represent the average simulation and the function f(x)=x, respectively.
The third plot was created in Xlispstat using the output of the web version of Dr. Getis's PPA software. PPA doesn't allow any adjustment of the region, but uses the minimum bounding rectangle (i.e., uses as rectangular boundaries the max and min x, y points). This was a convenience for the programmers, and Dr. Getis notes that one should use the actual study boundaries. In R one of the inputs is the bounding rectangular region for the point data; I've included this option in the GeoMed software.
What conclusion might we draw? The fact that the K function of the data seems low by comparison with the Poisson simulated plots for small values of h might be interpreted as follows: towns are not likely to be located too close to each other, for, as Dr. Getis says, "the competitive economies of towns, especially in agricultural areas, requires that they be dispersed rather than random or clustered."

Example: Diggle's data

an additional sample data set (provided by Peter Diggle for entirely other purposes!). In this case, a distance of 10 is about half way across the point space in either direction.

Again, we begin with a plot of the data:

As you can see, this pattern is significantly different from a Poisson pattern, with a very high intensity of cases close together.
What conclusion might we draw? Obviously the Diggle data is significantly non-Poisson; the data cluster much more than expected under Complete Spatial Randomness, at least for scales below 10.
Dr. Getis makes the following observation: "the distance at which the L(t) observation is above the random expectation (the dashed line) defines the extent of the clustering. In this case, I roughly make it out to be at a distance of 2. When the slope of the observed function becomes less than the random expectation, there is no longer a tendency for clustering. Notice that the clusters are about 2 in diameter."

Acknowledgement: Thanks to Dr. Art Getis for taking a critical look at this analysis, and making many useful suggestions.

Dr. Getis, of San Diego State University's Department of Geography and an expert and author in the field of spatial statistics, suggests that another comparison plot that is of interest is the comparison of two plots, e.g., a disease distribution and a susceptibles distribution.

Website maintained by Andy Long. Comments appreciated.

longa@nku.edu