Equitable Flood Risk Assessment System

Flood Risk Assessment Hero Image

Field	Details
Domain	Environmental Science
Assurance Goal	Fairness

Overview

The Thames Valley Water Authority has deployed an AI-powered flood risk assessment system to inform infrastructure investment decisions across the region. The system analyses multiple data sources to generate flood risk scores for over 2,000 communities, directly influencing how a £50 million annual budget is allocated for flood defences, drainage improvements, and emergency preparedness.

Following concerns raised by community advocates that historically underserved areas may be systematically disadvantaged by algorithmic risk assessment, the Authority has commissioned an assurance case to demonstrate that the system operates fairly across all communities.

System Description

What the System Does

The Flood Risk Assessment System (FRAS) produces quarterly risk scores for each community in the Thames Valley region. These scores range from 1 (minimal risk) to 10 (critical risk) and are calculated based on factors including terrain elevation, proximity to water bodies, drainage infrastructure capacity, and historical flood frequency. The scores are used to:

Prioritise capital investment in flood defence infrastructure
Allocate emergency response resources and equipment
Determine eligibility for government flood resilience grants
Inform insurance risk assessments shared with partner insurers

How It Works

The system integrates data from multiple sources:

Satellite Imagery Analysis: A convolutional neural network (a type of AI model, commonly used to analyse visual imagery) processes monthly satellite imagery to detect changes in land use, vegetation cover, and surface water accumulation
Hydrological Modelling: Physics-based simulations of water flow patterns using terrain elevation data and historical rainfall records
Infrastructure Data: Information about existing drainage systems, flood barriers, and pumping stations
Historical Flood Records: 50 years of flood event data including severity, duration, and affected areas
Socioeconomic Data: Population density, property values, and critical infrastructure locations

An ensemble model (a system that combines multiple prediction methods) integrates these inputs to generate the final risk score, with a confidence interval and key contributing factors.

Key Technical Details

Aspect	Details
Model Architecture	Ensemble combining CNN (for satellite imagery analysis), gradient boosting (a decision-tree algorithm for structured data), and physics-based hydrological simulation
Training Data	15 years of historical flood events (2008-2023), satellite imagery archive, national terrain models
Input Features	27 features including terrain elevation, land use classification, drainage capacity, historical flood frequency, and proximity to water bodies
Output	Risk score (1-10), confidence interval, top 5 contributing factors from sensitivity analysis, trend indicator (improving/stable/worsening)
Validation	Quarterly back-testing against observed flood events; annual external audit
Explainability	SHAP values (a method for explaining which factors most influenced a prediction); natural language summaries from LLM agent for planners

Deployment Context

Coverage: 2,147 communities across Oxfordshire, Berkshire, and Buckinghamshire
Update Frequency: Risk scores updated quarterly; emergency re-assessment capability within 48 hours
Users: Strategic planners, emergency response coordinators, grant administrators, and partner insurers
Operational Since: March 2023

Stakeholders

Stakeholder	Interest	Concern
Community Residents	Receive appropriate flood protection	May be disadvantaged by biased risk assessment
Local Councils	Allocate resources effectively	Need confidence in risk score accuracy, transparency, and fairness
Strategic Planners	Make evidence-based investment decisions	Require transparent, defensible methodology
Insurance Partners	Set appropriate premiums	Need reliable, non-discriminatory risk data
Community Advocates	Ensure equitable treatment	Worry about historical bias affecting vulnerable communities
Environmental Regulators	Meet statutory flood protection duties	Require demonstrable fairness in resource allocation

Regulatory Context

The system operates within several regulatory frameworks:

Flood and Water Management Act 2010: Establishes duties for flood risk management and requires consideration of all communities at risk
Equality Act 2010: Prohibits discrimination in service provision; relevant where flood protection is considered a public service
UK GDPR: Governs processing of location and demographic data; Article 22 rights regarding automated decision-making
Environment Agency Guidance: National standards for flood risk assessment methodologies
Public Sector Equality Duty: Requires public authorities to consider equality implications of their decisions

Fairness Considerations

Several aspects of this system raise fairness concerns:

Historical Bias

Past flood protection investments may have favoured wealthier areas, meaning historical flood data could reflect protective infrastructure rather than inherent risk. Communities that experienced fewer floods due to underinvestment (poor drainage leading to gradual waterlogging rather than dramatic floods) may be scored as lower risk.

Data Quality Disparities

The 50 years of historical flood records vary significantly in quality and completeness across the region. Rural and less-monitored areas may have sparser data, with flood events going unrecorded or underreported. Similarly, satellite imagery resolution varies, potentially affecting risk score accuracy for different communities.

Proxy Discrimination

Features such as property values, infrastructure age, and population density may correlate with socioeconomic status, potentially encoding historical inequalities into risk assessments.

Feedback Loops

Communities that receive investment see improved infrastructure, potentially lowering future risk scores and creating self-reinforcing cycles of investment inequality.

Assurance Focus

The assurance case should demonstrate that:

The Flood Risk Assessment System fairly evaluates flood risk across all communities, without systematic disadvantage to historically underserved or socioeconomically vulnerable areas.

Deliberative Prompts

What does fairness mean when assessing risk? Equal accuracy, equal outcomes, or something else?
How should a system account for historical underinvestment that has shaped the very data it learns from?
When risk correlates with socioeconomic factors, how do you distinguish legitimate risk signals from discriminatory proxies?
Who should have standing to challenge a community’s risk assessment, and what evidence should they need?
How might optimising for overall prediction accuracy conflict with equitable treatment of all communities?

Suggested Strategies

When developing your assurance case, consider these potential approaches:

Strategy 1: Bias Detection and Mitigation

Identify and address sources of historical and structural bias in training data and model features, ensuring that past inequalities in flood protection investment do not perpetuate future disadvantage.

Strategy 2: Transparent Methodology

Enable communities and oversight bodies to understand how risk scores are calculated, which factors contribute most to their assessment, and how the methodology accounts for known limitations.

Strategy 3: Meaningful Oversight and Appeals

Establish processes through which communities can challenge their risk assessments and access independent review, with community representation in governance structures.

Strategy 4: Ongoing Fairness Monitoring

Track investment allocation patterns and risk score distributions over time to detect emerging disparities and ensure the system’s fairness properties are maintained as conditions change.

Recommended Techniques for Evidence

The following techniques from the TEA Techniques library may be useful when gathering evidence for this assurance case:

Demographic Parity Assessment - Evaluate whether risk scores and resulting investment allocations are distributed equitably across community demographics
Counterfactual Fairness Assessment - Test whether a community’s risk score would change if its socioeconomic characteristics were different while physical flood risk factors remained the same
Sensitivity Analysis for Fairness - Assess how sensitive the model’s predictions are to features that may act as proxies for protected characteristics