1. Geospatial Data#

1.1. Topics#

  • Geospatial data

  • Measurement and reference systems

  • Mapping and measurement frameworks

  • Map as stored function

  • Types of maps

1.2. Geospatial/Geographic Data#

  • Data about things on or close to the Earth’s surface

  • Terminology: Spatial, geospatial, or geographic data

  • Scale: Operates at an Earth or planet scale

    • Spatial location

    • Spatial > geospatial

    • Geospatial = geographical

  • Three components

    • Attribute (what)

    • Location (where)

    • Time (when)

1.3. Why Do We Need Geospatial Data?#

  • Life is spatial

    • Life itself (matter) takes up space

    • It distinguishes itself from its environment

    • It interacts with the environment

    • It lives in space

  • Spatial intelligence

    • One of Howard Gardner’s multiple intelligences

    • Involves immediate sensory perception

  • Geographical intelligence

    • Insight beyond the line-of-sight

    • Insight beyond sensory perception

[Image of Howard Gardner’s multiple intelligences theory]

1.4. Life and Information#

  • What is life?

    • The basis of life is information processing

    • Life collects, stores, and uses information

    • Applies to biological and artificial life, organizations, and the economy

  • What is information?

    • Something that allows an observer to make predictions about the state of another system that are better than chance

  • Information vs. Data

    • Data becomes information when it enables better-than-chance predictions about another system

1.5. Measurement and Reference Systems#

  • Measurement

    • Classical physicist perspective: Provides a numerical relationship between a standard object and the object being measured

    • Representationalists perspective: Assignment of numbers to objects according to certain rules

    • Useful data includes both the numbers and the standard objects or rules used

  • Measurement reference systems

    • A set of rules for measurement or standard objects

    • Provides a means to compare a particular number to others measured under the same set of rules

1.6. Measurement Reference Systems#

  • Reference systems exist for attribute, time, and space

  • Temporal reference systems

    • Local vs. global time

    • Gregorian vs. lunar calendars

  • Attribute reference systems

    • Units (standard objects)

    • Levels of measurement: Nominal, Ordinal, Interval, and Ratio

  • Running race example

    • A number to wear (Nominal)

    • The order of finishing the race (Ordinal)

    • Arrival time (Interval)

    • Elapsed running time (Ratio)

Su

Mo

Tu

We

Th

Fr

Sa

31

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

1

2

3

1.7. Spatial Reference Systems#

  • Location distinguishes geospatial data from other types of data

    • Specifying location on the Earth’s surface is essential

  • Geospatial coordinate systems are designed to:

    • Define locations on or close to the Earth’s irregular curved surface

    • Ensure each location is uniquely represented

    • Allow calculations of relationships such as distance and direction

1.8. Geographic Coordinate Systems (GCS)#

  • Based on a spheroid model of the Earth

  • Uses two angular measurements (latitude and longitude) to specify a location

  • Components of a GCS:

Geographic Coordinate System

Datum (Sphere or Ellipsoid)

Unit of Measure (DMS, DD, or Radian)

Prime Meridian

[Image of geographic coordinate system latitude and longitude]

1.9. (Horizontal) Datum#

  • A datum specifies a sphere or ellipsoid used to approximate the Earth’s shape and size

  • It defines how that model is aligned with the Earth

  • Latitude and longitude are defined based on specific datums

  • The same location may have different coordinates depending on the datum used

    • Earth-centered (WGS84) datum

    • Local (NAD27) datum

1.10. Projected Coordinate Systems (PCS)#

  • A PCS consists of:

Projected Coordinate System

A linear unit of measure (usually meters or feet)

A map projection with specific parameters

An underlying Geographic Coordinate System

1.11. GCS vs. PCS#

  • PCS is suitable for small areas

  • PCS always contains some form of distortion

  • GCS calculations for distance and area are more accurate but more complex

    • Uses spherical geometry

  • Most GIS analytical functions are implemented based on PCS

  • Use PCS primarily for printing or display on flat media

1.12. Height and Vertical Datum#

  • The vertical datum is the zero surface to which elevation refers

  • Geoids (equipotential surfaces) are standard zero surfaces

  • Ellipsoids can also be used, such as in GPS readings

  • The same location may have different elevation values based on different vertical datums

1.13. Mapping—Creating Geographic Data#

  • The process involves turning things on or near the Earth’s surface into data

    • Representing reality as numbers in a computer

  • What is measured:

    • Objects and fields on or close to the Earth’s surface

    • Location, attribute, and time components

  • Considerations and constraints:

    • Purpose and resources/cost

    • Resolution and precision

    • Limitations of finite computing systems

1.14. Mapping and Measurement Frameworks#

  • Geospatial data consists of location, attribute, and time components

    • All three components cannot be measured simultaneously

  • Measurement frameworks provide rules for ignoring or controlling some components to measure one

    • One component must be fixed (irrelevant)

    • Another component serves as control (imposes discrete divisions)

  • Discrete divisions constrain the resolution and accuracy of the measurement

1.15. Common Mapping Measurement Frameworks#

  • Time is fixed: Measured at a specific point in time or during a period

  • Location as control: Space is divided into cells, polygons, or administrative boundaries

    • Attributes are then measured for these divisions

  • Attribute as control: Specific attributes (e.g., land cover types or elevations) are chosen

    • Locations are then measured for these attributes

1.16. Mapping Surface Elevation—Raster Map#

  • Location serves as the control

    • Space is systematically divided into cells

  • Attributes are measured at each cell

    • Rules like min, max, or mean may be applied

  • Cell values are typically stored as a matrix

1.17. Mapping Surface Elevation—Vector Map#

  • Attribute serves as the control

    • Focuses on discrete elevations (e.g., every 100m)

  • Locations are measured for each elevation (e.g., contour lines)

  • Location-value pairs are stored as rows in a table

ID

Location/Geometry

Elevation

1

(x1,y1), (x2,y2) … (xn, yn)

100

2

(x1,y1), (x2,y2) … (xm, ym)

200

10

(x1,y1), (x2,y2) … (xk, yk)

1000

1.18. Vector Census Map#

  • Location serves as the control

    • Uses census blocks or aggregated boundaries

  • Population is counted based on set rules

1.19. Function and Map#

  • A function associates each element x of a domain to a single element y of a codomain

  • Analogy for maps: Location represents the domain, and the measured attribute represents the codomain

1.20. Stored Function#

  • Relations are explicitly stored when they cannot be represented mathematically

  • A stored function acts as a lookup table

  • Locations must be discretized (mapping units) due to finite computing power

1.21. Map as Stored Function#

  • Relation between location and attribute: \(Attribute = f(location)\)

  • Domain: Location on Earth

  • Codomain: Any measured attributes/variables

  • Map is a stored function because location-value relations usually lack mathematical formulas

1.22. Raster Map#

  • Matrix serves as a stored function

    • Each cell is located by its row and column

  • Each cell represents a location-value relation

  • Usually, the upper-left corner cell is georeferenced; other locations are derived

1.23. Zone Raster Map#

  • Cells with the same value form a “zone”

  • The zone serves as the mapping unit location

  • Typically includes an attribute table with counts/histograms

Zone ID

Count

Value

0

50

200

1

50

325.6

1.24. Predicting Attribute Value at a Location#

  • Map as a function: Prediction is a fundamental capability (\(v = m(l)\))

  • Estimation/Interpolation is necessary when the target mapping unit differs from stored units

1.25. Interpolating Attribute Value#

  • Land cover interpolation: Uses majority types for target polygons

  • Population interpolation: Uses areal interpolation

  • Attribute types:

    • Intensive: Magnitude independent of size (e.g., density)

    • Extensive: Magnitude is additive (e.g., mass, population)

  • Spatially intensive: Value is the same everywhere within a mapping unit

  • Spatially extensive: Value changes if the mapping unit is modified

1.26. Multi-valued Vector Map#

  • Results from combining multiple single-valued maps (e.g., Soil + Land Cover)

  • Each mapping unit represents a set of homogeneous attributes

Map

Attributes

Soil

A, B

Land cover

C, D

Combined

(A, C), (A, D), (B, C), (B, D)

1.27. Multi-Valued Raster#

  • Each cell contains a set of attribute values (e.g., multi-band remote sensing)

  • Storage methods: Band sequential (BSQ), Band interleaved by line (BIL), and Band interleaved by pixel (BIP)

1.28. Meta-map: Map of Spatial Relationships#

  • Spatial relationship involves the exchange of matter, energy, and information between locations

    • Examples include watersheds and viewsheds

  • Types of map relations:

    • Single-valued: {Location, value}

    • Multi-valued: {Location, {values}}

    • Meta-map: {Location, {(location, {values})}} (a map of maps)

1.29. Flood Inundation Relations#

  • Floodplain pixels (FPP) can be flooded by flood source pixels (FSP) through backfill and spillover

  • Depth to flood (DTF): The minimum depth at an FSP required to flood a specific FPP

  • The FLDPLN model identifies these relations and DTFs through an iterative process