Geostatistics is a branch of statistics focusing on spatial or spatiotemporal datasets. Developed originally to predict probability distributions of ore grades for mining operations,[1] it is currently applied in diverse disciplines including petroleum geology, hydrogeology, hydrology, meteorology, oceanography, geochemistry, geometallurgy, geography, forestry, environmental control, landscape ecology, soil science, and agriculture (esp. in precision farming). Geostatistics is applied in varied branches of geography, particularly those involving the spread of diseases (epidemiology), the practice of commerce and military planning (logistics), and the development of efficient spatial networks. Geostatistical algorithms are incorporated in many places, including geographic information systems (GIS).
Geostatistics is intimately related to interpolation methods, but extends far beyond simple interpolation problems. Geostatistical techniques rely on statistical models that are based on random function (or random variable) theory to model the uncertainty associated with spatial estimation and simulation.
A number of simpler interpolation methods/algorithms, such as inverse distance weighting, bilinear interpolation and nearest-neighbor interpolation, were already well known before geostatistics.[2] Geostatistics goes beyond the interpolation problem by considering the studied phenomenon at unknown locations as a set of correlated random variables.
By applying a single spatial model on an entire domain, one makes the assumption that Z is a stationary process. It means that the same statistical properties are applicable on the entire domain. Several geostatistical methods provide ways of relaxing this stationarity assumption.
A number of methods exist for both geostatistical estimation and multiple realizations approaches. Several reference books provide a comprehensive overview of the discipline.[6][2][7][8][9][10][11][12][13][14][15]
Kriging is a group of geostatistical techniques to interpolate the value of a random field (e.g., the elevation, z, of the landscape as a function of the geographic location) at an unobserved location from observations of its value at nearby locations.
Bayesian inference is a method of statistical inference in which Bayes' theorem is used to update a probability model as more evidence or information becomes available. Bayesian inference is playing an increasingly important role in Geostatistics.[16] Bayesian estimation implements kriging through a spatial process, most commonly a Gaussian process, and updates the process using Bayes' Theorem to calculate its posterior. High-dimensional Bayesian Geostatistics [17]
Considering the principle of conservation of probability, recurrent difference equations (finite difference equations) were used in conjunction with lattices to compute probabilities quantifying uncertainty about the geological structures. This procedure is a numerical alternative method to Markov chains and Bayesian models.[18]
Knowledge of ore grades and ore reserves as well as error estimation of these values, is fundamental for mining engineers and mining geologists. Until now no appropriate scientific approach to those estimation problems has existed: geostatistics, the principles of which are summarized in this paper, constitutes a new science leading to such an approach. The author criticizes classical statistical methods still in use, and shows some of the main results given by geostatistics. Any ore deposit evaluation as well as proper decision of starting mining operations should be preceded by a geostatistical investigation which may avoid economic failures.
Geostatistics provides many different tools for specific problems in spatial prediction. These tools come from many fields of application and are documented in many forms; however, there are few places for the geomodeler to find specific advice on important decisions in geostatistics. Geostatistics Lessons is a collection of brief lessons providing guidance in geostatistical modeling.
It is not possible to consider all geological situations, problem settings, project goals and data. Nevertheless, there is value in experienced geostatisticians publishing their belief of what constitutes best practice. Those beliefs will change as experience is gained; Geostatistics Lessons will be updated as new lessons are authored, revised and reviewed.
Following the successful conferences in Biarritz (France) in 2015 and Florence (Italy) in 2019, we are proud to announce the 5th edition of the EAGE Conference on Petroleum Geostatistics. This conference focuses on new methods and applications in the field of geostatistics for the petroleum industry.
With this ediition, we are looking to spark new waves of geostatistics capabilities addressing the challenges related to carbon neutrality and the energy transition, while leveraging on the strong expertise in petroleum geostatistics.
We welcome new topics, including CO2/H2 geostorage, geothermal applications, near-surface applications, and other energy resources, but also expect to see the recent advancements in petroleum geostatistics.
Where do statistics, spatial statistics, and geostatistics fit in GIS projects? Dr. Lauren Scott, a product engineer on Esri's geoprocessing team and an expert in the use of statistics in a geospatial context, answers that question and others in an interview conducted by Matt Artz, Esri's GIS and science marketing manager and editor of the GISandScience.com blog.
At Esri, Scott is responsible for software support, education, documentation, and development of spatial statistics tools in ArcGIS. She received her Ph.D. in 1999 from the Joint Doctoral Program at San Diego State University and the University of California, Santa Barbara. She holds an M.A. and a B.A. in geography from California State University, Fullerton.
Scott: Traditional or non-spatial statistics are typically used in two different ways. In the first case, we have a large set of data values that we want to understand, and we can use descriptive statistics to try to summarize them. In the second case, we may have a set of samples and we want to know how reflective those samples are of the broader population.
Scott: Spatial statistics were designed specifically for use with spatial data-with geographic data. These methods actually use space-area, length, proximity, direction, orientation, or some notion of how the features in a dataset interact with each other-right in the mathematics. That's really what makes spatial statistics different from traditional statistical methods.
Scott: Yes, there are many different types. There are descriptive spatial statistics similar to descriptive traditional statistics. For example, if we have lots of points on the map, we might want to know where the center of those points is located. (The equivalent traditional statistic would involve computing the mean for a set of data values.) We might also want to know how spread out those points are around the center. (This is similar to computing the standard deviation for a set of values.)
Other statistical methods involve spatial pattern analysis: We try to identify if there is any structure to the data we're looking at-for example, are features clustered? Are they dispersed? Are high values all found together? Are there "hot spots" in the data? Spatial pattern analysis tools can help us to identify anomalous or unusual spending patterns, find unexpected areas with high disease rates, crime, or fire incidents, or track diffusion of some environmental contaminant. There are really lots of applications.
Then there are spatial statistics concerned with identifying and measuring spatial relationships. Imagine we are looking at a hot spot map for 911 calls. We might be curious about why we are seeing so many calls, or hot spots, in certain locations. We can use regression and spatial regression analysis to examine relationships and to identify the factors promoting the spatial pattern we're observing-factors that would help us explain why 911 rates are so high.
Scott: Geostatistics are a type of spatial statistics. Kriging, for example, is a very powerful geostatistical technique that goes beyond interpolation, looking not only at nearby features to predict values where you don't have sample data, but actually utilizing spatial relationships to give you stronger, more accurate predictions.
Traditionally, geostatistics have been used to analyze geologic and environmental data-for example, rainfall, or elevation-the goal being to create a surface from sampled data points. These methods are widely used in the petroleum and mining industries. But geostatistics are ideal for analyzing and predicting the values associated with nearly any kind of spatially continuous phenomena.
Scott: Many people have probably heard of the ArcGIS Geostatistical Analyst extension, a specialized set of geostatistical tools. It's most useful if you're working with sample data taken from a continuous phenomenon such as rainfall, temperature, geology, or soils and your goal is to create a surface-a probability surface, a prediction surface, or an error surface. However, as the product has been enhanced over the years, its capabilities now extend beyond creating surfaces and the tools are valuable for a large variety of applications.
All ArcGIS users also get the Spatial Statistics Toolbox with tools for analyzing spatial distributions, patterns, processes, and relationships as part of the core software at all license levels. These statistical tools let you do a number of things, including determining central tendency or identifying the overarching directional trend, identifying hot and cold spots or spatial outliers, assessing overall patterns of clustering or dispersion, and modeling spatial relationships. I'm so happy with how many people now use these tools! When I first started developing the Spatial Statistics Toolbox as a set of sample scripts, I didn't really envision how successful they would become.
795a8134c1