Introduction to GIS and Spatial Thinking
Geographic Information Systems (GIS) and spatial thinking form the foundation of modern water resource management. This section provides a comprehensive, deep-dive into what GIS is, its core structural components, how it compares to other analytical software, the principles of spatial thinking, and how these concepts are applied to solve real-world river basin challenges at the Water and Energy Commission Secretariat (WECS).
Presentation Slides
You can download or view the lecture slides for this topic: Spatial_Hydrology_Framework.pdf
1. What is GIS? A Deep-Dive Framework
At its core, a Geographic Information System (GIS) is an integrated suite of database, graphic, and analytical tools designed to capture, store, manipulate, analyze, manage, and present all types of geographical and spatial data.
To truly understand GIS, we must differentiate between Spatial Data (where something is) and Aspatial/Attribute Data (what something is). In traditional spreadsheets or databases, data is stored as disconnected rows and columns. In a GIS, every tabular record is explicitly tied to a coordinate geometry in physical space.
The Core Concept: Layering the World
The defining mechanism of GIS is the overlay concept (originally popularized by landscape architect Ian McHarg). Instead of representing the real world as a single, complex map, GIS breaks the environment down into thematic spatial layers. Each layer represents a specific class of features (e.g., rivers, soils, elevation, roads, administrative boundaries). By stacking these layers on top of one another using a common coordinate reference system, we can analyze the relationships between them.
graph TD
subgraph Real World Landscape
RW["Physical Terrain, Rivers,<br/>Land Use, Roads"]
end
subgraph GIS Layer Stack
L1["Roads Layer<br/>(Vector Line)"]
L2["Land Use Layer<br/>(Vector Polygon)"]
L3["Elevation Layer<br/>(Raster DEM)"]
L4["Hydrology Layer<br/>(Vector/Raster)"]
end
RW --> L4
RW --> L3
RW --> L2
RW --> L1
L1 --> OverlayAnalysis["Spatial Overlay &<br/>Intersection Analysis"]
L2 --> OverlayAnalysis
L3 --> OverlayAnalysis
L4 --> OverlayAnalysis
style RW fill:#f9f9f9,stroke:#333,stroke-width:2px
style OverlayAnalysis fill:#ffecd2,stroke:#fcb045,stroke-width:2px
Tobler's First Law of Geography
All spatial thinking and modeling in GIS are governed by Tobler's First Law of Geography:
"Everything is related to everything else, but near things are more related than distant things."
In hydrology, this law manifests continuously:
-
Rainfall in one headwater sub-catchment impacts the streamflow of adjacent streams more than streams in a separate river basin.
-
Runoff coefficients are highly correlated within continuous land-cover blocks.
-
Elevation values at a given point are highly dependent on the elevation values of neighboring pixels (spatial autocorrelation).
Spatial Representation: Discrete Objects vs. Continuous Fields
GIS represents spatial phenomena in two fundamental ways:
-
Discrete Object Model (Vector): Represents the world as distinct objects with clear boundaries (e.g., a water quality monitoring station as a point, a river channel as a line, or a lake boundary as a polygon).
-
Continuous Field Model (Raster): Represents phenomena that vary continuously across space without distinct boundaries (e.g., terrain elevation, soil moisture, surface temperature, or precipitation grids).
2. The Five Key Components of GIS
A GIS is not merely "mapping software." It is a dynamic system composed of five critical parts:
graph TD
GIS["Geographic Information System"]
GIS --> HW["1. Hardware"]
GIS --> SW["2. Software"]
GIS --> DT["3. Data"]
GIS --> MD["4. Methods"]
GIS --> PP["5. People"]
HW --> HW_D["CPUs/GPUs, Mobile GPS,<br/>Storage & Cloud Servers"]
SW --> SW_D["QGIS, ArcGIS, GRASS,<br/>SAGA, GDAL, PostGIS"]
DT --> DT_D["Vector/Raster Files,<br/>Databases & Tables"]
MD --> MD_D["Delineation, Projections,<br/>Spatial Interpolation"]
PP --> PP_D["Field Staff, Analysts,<br/>Hydrologists & Managers"]
style GIS fill:#caebea,stroke:#333,stroke-width:2px,color:#fff
style DT fill:#f4f9ff,stroke:#0066cc,stroke-width:1px
style MD fill:#f4f9ff,stroke:#0066cc,stroke-width:1px
1. Hardware
The computational demands of GIS require specialized hardware:
-
Processing Power (CPU/GPU): High-speed multi-core processors are necessary for running heavy spatial algorithms (like raster interpolation, viewshed analysis, and hydrodynamic routing). GPUs are increasingly used to accelerate parallel spatial computations.
-
Storage Systems: Spatial datasets (such as multi-temporal satellite imagery, high-resolution LiDAR DEMs, and global climate grids) consume gigabytes or terabytes of space. Fast solid-state drives (SSDs) and network-attached storage (NAS) are essential.
-
Data Capture Devices: GPS/GNSS receivers, drone sensors, and digitizing tablets used to collect coordinates in the field.
2. Software
GIS software provides the tools to store, analyze, and display spatial data:
-
Desktop GIS: Software like QGIS (Free and Open Source) or ArcGIS Pro (Proprietary). These provide graphical interfaces for spatial editing, styling, layout design, and processing.
-
Spatial Database Engines: Databases like PostgreSQL with the PostGIS extension. They allow writing spatial SQL queries (e.g., finding all water wells within 500 meters of a polluted river stretch).
-
Geospatial Libraries: Back-end libraries like GDAL/OGR (for raster/vector translation), PROJ (for coordinate transformations), and GEOS (for topology operations). These libraries power QGIS and custom python scripts.
3. Data
Geospatial data is divided into two primary types, which will be covered in detail on subsequent days:
-
Vector Data: Uses coordinates (\(X, Y\) and sometimes \(Z\)) to define points, lines, and polygons.
-
Points: Coordinate pairs representing zero-dimensional features (e.g., rain gauges, spring locations).

-
Lines: Connected sequences of points representing one-dimensional features (e.g., streams, roads).

-
Polygons: Closed loops representing two-dimensional areas (e.g., reservoirs, soil zones, watersheds).

-
-
Raster Data: Consists of a matrix of cells (or pixels) organized into a grid. Each cell contains a value representing the phenomenon at that location (e.g., elevation, land cover class, daily rainfall in millimeters).

4. Methods and Workflows
The analytical workflows, equations, and steps applied to solve spatial problems. For example:
-
Spatial Interpolation: Calculating rainfall values at unsampled locations using data from a limited number of weather stations (e.g., using Inverse Distance Weighting or Kriging).
-
Overlay Analysis: Intersecting a slope map, soil map, and land-use map to find optimal locations for groundwater recharge basins.
-
Terrain Routing: Algorithms that calculate how water flows from cell to cell based on slopes.
5. People
The most critical component. Even the most advanced hardware and software are useless without:
-
GIS Analysts & Developers: Who build the databases, write Python automation scripts, and run complex models.
-
Domain Experts (Hydrologists/Engineers): Who understand physical processes (like stream discharge or soil saturation) and translate physical models into GIS workflows.
-
Decision Makers: Managers and policymakers who interpret maps to make choices regarding water allocation, flood response, and infrastructure development.
3. Tool Comparison: GIS vs. CAD vs. MIS vs. Excel
Water resource projects often involve multiple teams using different tools. Understanding the strengths and boundaries of each system is critical for seamless integration.
| Technical Capabilities | Geographic Information System (GIS) | Computer-Aided Design (CAD) | Management Information System (MIS) | Spreadsheet (Excel) |
|---|---|---|---|---|
| Primary Data Type | Georeferenced features (Raster/Vector) tied to relational tables. | High-precision geometry vectors (lines, arcs, solids). | Structured relational tables (alphanumeric). | Flat tabular grids (rows, columns). |
| Spatial Reference Systems | Native support for projections (UTM, State Plane) and datums (WGS 84, NAD 83). | Typically local cartesian grid (\(0,0\) origin or local engineering benchmark). | None natively. Spatial data must be converted to text/IDs. | None. Coordinates are treated as plain text or numbers. |
| Spatial Topology | High. Enforces rules like "polygons must not overlap" or "lines must connect at nodes." | Low. Geometries exist independently; lines can cross without forming a node. | None. Relational integrity is logical, not spatial. | None. |
| Database Coupling | Tight. Every geographic feature is linked to a database row. | Loose. Attributes are often text annotations or basic layers. | Maximum. Relies heavily on SQL relational constraints. | Weak. Lacks relational integrity checks between separate sheets. |
| Spatial Analysis Tools | Buffer, overlay, viewshed, watershed routing, network analysis. | Dimensioning, volume calculation, geometric offsets. | Querying, grouping, business intelligence, aggregation. | Numerical calculations, sorting, charting. |
| Scale & Generalization | Designed for regional, basin, and global-scale models. | Designed for site-scale, high-precision engineering models (millimeter scale). | Organizational, departmental, or enterprise-scale database. | Personal or project-scale calculator. |
Practical Workflow Scenario: Building a Hydropower Reservoir
To understand how these tools interact, consider the workflow for planning a new reservoir:
-
Spreadsheets (Excel): Hydrologists compile 30 years of daily river gauge records to calculate flow duration curves and average monthly discharge.
-
GIS: Planners overlay regional elevation grids (DEMs) to locate the narrowest valley point, delineate the upstream catchment basin, calculate the reservoir's volume at various fill heights, and map the communities and forests that will be inundated.
-
CAD: Structural engineers import the reservoir boundary from GIS and design the concrete gravity dam, spillways, and intake tunnels in high-precision 3D environments.
-
MIS: The project management team tracks construction budgets, equipment inventory, labor logs, and land-compensation payments in a relational database system.
4. Spatial Thinking in Hydrology
Spatial thinking goes beyond identifying coordinates on a map. It involves conceptualizing how physical objects occupy space and how processes (like the water cycle) operate across space. In hydrology, water flow is determined by spatial relationships:
graph TD
Elev["1. Elevation Surface<br/>(DEM)"] --> Slope["2. Slope & Aspect<br/>Calculation"]
Slope --> FlowDir["3. Flow Direction<br/>(D8 Algorithm)"]
FlowDir --> FlowAcc["4. Flow Accumulation<br/>Grid"]
FlowAcc --> StreamNet["5. Stream Network<br/>Extraction"]
StreamNet --> BasinDel["6. Watershed & Basin<br/>Delineation"]
style Elev fill:#f9f9f9,stroke:#333
style BasinDel fill:#d2f8d2,stroke:#2b8a2b,stroke-width:2px
1. Spatial Adjacency and Connectivity
In a river basin, cells or polygons are spatially adjacent. Water flows from one cell to its neighbor based on gravity.
-
D8 Flow Model: A common raster method that inspects a pixel and its eight surrounding neighbors. Water is assumed to flow in the direction of the steepest downward slope.
-
Hydrological Connectivity: This connectivity determines how pollutants or sediments run off from farm fields, travel through small streams, and accumulate in larger rivers.
2. Terrain Derivatives: Slope, Aspect, and Flow Path
-
Slope: The steepness or rate of change of elevation. Steep slopes generate fast surface runoff; flat slopes encourage water ponding and infiltration.
-
Aspect: The compass direction that a slope faces. In mountainous regions (like the Himalayas), south-facing slopes receive more solar radiation, leading to faster snowmelt and different evaporation rates than north-facing slopes.
-
Flow Accumulation: A calculation that counts how many upstream cells drain into each cell. Cells with high flow accumulation represent stream channels; cells with zero accumulation represent ridge lines or watershed divides.
3. Spatial Scale and Resolution
Hydrological processes must be analyzed at appropriate spatial scales:
-
Micro-scale: Analyzing soil water movement in a single field requires high-resolution raster datasets (e.g., \(1\text{ m}\) grids from LiDAR).
-
Macro-scale: Modeling water balance for the entire Koshi or Karnali river basins requires regional-scale datasets (\(30\text{ m}\) to \(90\text{ m}\) grids from satellite missions like SRTM).
-
The Scale Effect: If a grid is too coarse (e.g., \(1\text{ km}\) pixels), narrow river channels disappear, causing flow path calculations to fail.
5. Real-World Applications in River Basin Management (WECS)
For the Water and Energy Commission Secretariat (WECS), GIS is an essential tool for evidence-based decision-making. Key applications include:
1. Automated Watershed Delineation
Manual delineation of watershed boundaries from paper maps is time-consuming and prone to errors. GIS automates this process using digital elevation datasets. Hydrologists can input any point along a river network (such as a planned hydropower intake site) and the GIS will automatically calculate the boundary of the entire catchment area draining to that specific point.
[!TIP] This allows rapid calculation of watershed areas, average basin slopes, and catchment-wide average rainfall—all of which are inputs for hydrologic design.
2. Stream Order Classification
GIS can automatically analyze river network geometries to calculate stream order based on classification systems:
-
Strahler Stream Order: Intersection of two first-order streams creates a second-order stream. Intersection of two second-order streams creates a third-order stream, and so on. If a lower-order stream intersects a higher-order stream, the order does not change.
-
Shreve Stream Order (Stream Magnitude): Additive system. When two streams intersect, their magnitudes are added together. This is highly correlated with cumulative river flow volumes.
graph TD
subgraph Strahler Stream Ordering
S1["Order 1"] --> S2["Order 2"]
S1_2["Order 1"] --> S2
S2 --> S3["Order 3"]
S2_2["Order 2"] --> S3
S1_3["Order 1"] --> S3_no_change["Order 3 (No change)"]
S3 --> S3_no_change
end
3. Soil Erosion Hotspot Analysis
By combining multiple spatial datasets within a GIS, planners can run the Revised Universal Soil Loss Equation (RUSLE): $\(A = R \times K \times LS \times C \times P\)$ Where:
-
\(A\) = Annual Soil Loss (Raster output)
-
\(R\) = Rainfall Erosivity (Raster interpolated from weather stations)
-
\(K\) = Soil Erodibility (Vector soil map converted to raster)
-
\(LS\) = Slope Length and Steepness (Calculated from DEM)
-
\(C\) = Crop/Vegetation Cover (Calculated from NDVI satellite data)
-
\(P\) = Support Practice Factor (Conservation measures map)
The output map identifies erosion hotspots, allowing WECS to allocate conservation budgets to the areas with the highest soil degradation rates.
4. Flood Inundation and Vulnerability Mapping
GIS intersects physical flood hazard zones with demographic datasets:
-
Hazard Layer: A raster output from hydraulic models (e.g., HEC-RAS) showing water depth and velocity during a 100-year flood event.
-
Vulnerability Layer: Vector polygons containing population density, building footprints, and location of critical infrastructure (hospitals, power grids).
-
Risk Mapping: By intersecting these layers, GIS identifies high-risk zones, helping planners design flood warning systems and select evacuation routes.