Mortenson Center — Global Health for Engineers
Title
Title Slide Course Structure Evan Thomas
Current State of Global Health
Overview USAID Shutdown WHO Withdrawal EPA & Domestic Rollbacks Implications for Engineers
Part 1 - Overview
Intro to Global Health Part 1 - Overview What is Global Health? Disease & Determinants of Health Multi-sectoral Dimension of the Deter... Governance & Armed Conflict
Part 2 – Global Burden of Disease
Intro to Global Health Part 2 – Globa... Explore the Data Health & Income Child Mortality by Cause Extreme Poverty CO2 Emissions Burden of Disease Map Undernourishment Child Mortality Communicable Diseases CD- Modes of transmission Impact of Communicable Diseases Malaria Malaria Control Malaria Geographic Distribution Global HIV Burden HIV/AIDS Air Quality and Public Health Water and Public Health Sanitation and Public Health
Part 3 - Examples
Intro to Global Health Part 3 - Examples Water Quality & Dose-Response
Part 3 – Epidemiology
Intro to Global Health Part 3 – Epide... Epidemiology — Defined Epidemiology Key Terms Measurement of Health Status Solving Health Problems Scenario: Unexplained Pneumonia Epidemiology Study Types Confounding Measuring the association between exp... Epidemiological Formulas Worked Example: Odds Ratio Expressing RRs as Percentages
Part 4 – Interventions
When does the odds ratio approximate ... Surveillance & Public Health Appr... Approaches to Interventions Behavior Change Randomized Controlled Trial (RCT)
Rwanda Case Study
Rwanda Tubeho Neza Water Filter &... Programme Field & Regional Programme Key Technology & Scale & Published
Research Methodology
Research Methodology Tubeho Neza Program Biased Approaches Three Threats to Validity Confounding, Selection & Info Bias Recognizing Bias Study Design Hierarchy Counterfactual & Potential Outcomes Why Randomization Works Cluster Randomization Randomization vs. Sampling Stratification & Quasi-Experimental Reliability & Validity R & V Interaction Incidence & Prevalence Measures of Association Self-Report vs. Sensor Statistical Inference Hypothesis Testing P-Values & Confidence Intervals Choosing Statistical Tests Regression & Survival Decision Tree Rwanda Primary Analysis Clustering Ignoring Clustering & Sample Size Analytic Solutions: Clustering Missing Data Mechanisms Analytic Solutions: Missing Data External Validity Effect Modification Efficacy to Scale
GBD Lab Assignment
Global Burden of Disease Lab

Global Health for Engineers

Mortenson Center in Global Engineering

University of Colorado Boulder

1
Mortenson Center Overview
2

Course Structure

An introduction to the professional field of Global Health, particularly focused on the areas of global health that engineers often contribute to – i.e. community environmental health. In a short course, we won't have time to touch on things like health systems organization but there are many resources for additional learning.

We will be reading published peer reviewed studies of global health interventions – learning how to search for, read and analyze these kinds of studies is fundamental to being conversant in the field of global health. Many of these papers have engineers as co-authors.

This class is also an opportunity for group and 1:1 discussion. I am happy to meet with each of you individually to discuss career objectives and networking.

Class will be heavy on case studies, with student-led facilitation.

All readings posted in Canvas. Textbook is self-paced with online Canvas quizzes.

See Canvas Syllabus for link to class schedule.

3

Evan Thomas

Professor

  • Environmental Engineering Program
  • Civil, Environmental and Architectural Engineering Dept.
  • Aerospace Engineering Sciences Department

Mortenson Endowed Chair in Global Engineering

Director, Mortenson Center in Global Engineering

University of Colorado Boulder

  • PhD, Aerospace Engineering Sciences, 2009
  • MS, Aerospace Engineering Sciences, 2006
  • BS, Aerospace Engineering Sciences, 2005
  • BS, Broadcast Journalism, 2005

Oregon Health and Science University

  • MPH, Master in Public Health, 2014

Fletcher School at Tufts University

  • Global MBA, 2022

NASA Johnson Space Center, Aerospace Engineer, 2004-2010

Portland State University, Assistant/Associate Professor, 2010-2018

Virridy Inc, Founder and CEO, 2012 – Present

Manna Energy Limited / DelAgua Health, Founder, COO, 2007-2016

~80 journal articles, 10 patents, professional work in 16 countries

Slide 3 Slide 3 Slide 3 Slide 3
4
Context

The Current State of Global Health

2025–2026: A period of unprecedented disruption to global health infrastructure.

5

USAID Shutdown

Timeline

  • Jan 20, 2025: Executive Order freezing all foreign assistance funds for 90 days, including PEPFAR
  • March 2025: Termination of 83% of USAID's 6,300 global initiatives
  • Staff cut from 10,000 to 15 personnel
  • Feb 2026: Congress passed $50B foreign aid bill to begin reinvestment

Projected Impact

  • 9.4 million additional deaths projected by 2030 (The Lancet)
  • 500,000–1,000,000 lives lost in 2025 alone (Center for Global Development)
  • 2.3 million people on antiretroviral treatment lost support
  • 12.5–17.9 million additional malaria cases projected
  • 2.4 million people in Yemen lost food assistance
6

US Withdrawal from the WHO

What Happened

  • Jan 20, 2025: Executive order giving one-year notice of withdrawal
  • Jan 22, 2026: US formally withdrew from the WHO
  • US historically contributed ~15% of WHO's total budget (~$1.28 billion annually)
  • WHO cutting ~2,371 staff (25% reduction) by mid-2026

Consequences for Global Health

  • Global Influenza Surveillance and Response System (GISRS) disrupted — seasonal flu vaccine formulation at risk
  • 50-country disease surveillance network dismantled
  • Emergency outbreak response time reverted from <48 hours to >2 weeks
  • Pandemic preparedness infrastructure weakened
7

EPA Rollbacks, CDC Cuts & Domestic Health

EPA Regulatory Rollbacks

  • 31 climate, air, and water pollution regulations rolled back
  • Emissions reporting requirements for CO2, methane, and other GHGs eliminated
  • Coal-fired power plant emission restrictions dismantled
  • Air pollution-related deaths projected to increase by tens of thousands per year

CDC Budget & Workforce Cuts

  • Proposed 53% budget reduction for FY 2026
  • ~25% of CDC workforce cut by end of 2025
  • 42,000 jobs lost nationwide if proposed cuts adopted
  • 60+ CDC programs eliminated, including global HIV/AIDS prevention and global immunization
  • Morbidity and Mortality Weekly Report (MMWR) staff fired

Surveillance & Preparedness

  • 12+ health tracking programs eliminated (deaths, disease trends)
  • Disease detectives, outbreak forecasters, and data offices cut
  • Public Health Emergency Preparedness funding cut 52%
  • Weakened capacity for early detection, outbreak investigation, and pandemic preparedness
8

Implications for Global Health Engineers

Funding Landscape Shift

Traditional USAID-funded pathways for global engineering projects are disrupted. New funding models needed — carbon credits, private sector, multilateral alternatives.

Surveillance Gaps

Engineers working in water, sanitation, and environmental monitoring face a world with weaker disease surveillance and slower outbreak response.

Domestic & Global Nexus

EPA rollbacks demonstrate that environmental health is not just a developing-country issue. Air and water quality challenges affect communities everywhere.

Local Capacity Building

With reduced external support, building local technical and institutional capacity is more critical than ever for sustainable health outcomes.

Data & Evidence

Loss of federal data collection increases the importance of independent monitoring, sensor networks, and digital MRV systems.

Discussion

How should the global health engineering community respond to these changes? What role can universities and the private sector play?

9
Section

Intro to Global Health
Part 1 - Overview

10

What is Global Health?

"An area for study, research, and practice that places a priority on improving health and achieving equity in health for all people worldwide. Global health emphasizes transnational health issues, determinants, and solutions; involves many disciplines within and beyond the health sciences and promotes interdisciplinary collaboration; and is a synthesis of population-based prevention with individual-level clinical care."

— Consortium of Universities for Global Health

Overview Objectives

Define the determinants, including social and economic, that impact health.

Highlight the differences in disease and life expectancy between high- and low-income countries.

Identify some of the dynamics in developing countries that impact health trends.

Key Concepts

The determinants of health

The measurement of health status

The importance of culture to health

The global burden of disease

Key risk factors for health problems

Organisation of health systems

Disciplines: Public Health · Public Policy · Medicine · Social Sciences · Behavioural Sciences · Law · Economics · History · Engineering · Biomedical Sciences · Environmental Sciences · Anthropology

11

Disease & Determinants of Health

Examples of disease that disproportionately impact developing countries

  • Malaria
  • Diarrhea
  • Pneumonia
  • HIV / Aids

99% of the children under the age of 5 who die every year lived in developing countries.

Determinants of Health

  • Genetics
  • Age
  • Gender
  • Lifestyle choices
  • Community influences
  • Income status
  • Geographical location
  • Urbanization
  • Climate Change
  • Governance
  • Culture
  • Environmental factors
  • Work conditions
  • Education
  • Access to health services
12

Multi-sectoral Dimension of the Determinants of Health

Malnutrition

More susceptible to disease and less likely to recover

Cooking with wood and coal

Lung diseases

Poor sanitation

More intestinal infections

Poor life circumstances

Commercial sex work and STIs, HIV/AIDS

Advertising tobacco and alcohol

Addiction and related diseases

Rapid growth in vehicular traffic

Road traffic accidents

13

Governance & Armed Conflict

Governance

Governance has a direct impact on socioeconomic status, health inequalities, and development

  • Allocation of Resources
  • Control of policy
  • Trade agreements
  • Regional politics
  • Abuse of power
  • Education

Armed Conflict

The Results:

  • Disparities over resources and power
  • Broken relationship with neighboring countries
  • Lack of development
  • Inequality along race/gender lines
  • Resources diverted from health care to support conflict
  • Displacement

Health Implications:

  • Malnutrition
  • Diarrhea
  • Respiratory infections
  • AIDS
  • Extreme poverty
  • Negative long-term effects on health
14
Section

Intro to Global Health
Part 2 – Global Burden of Disease

15

Explore the Data

Interactive tools for exploring the Global Burden of Disease: GBD Compare | SDG Visualizations | Our World in Data

Countries Health and Wealth 2021
15

Health & Income of Nations

Life expectancy vs. GDP per capita. Bubble size = population, color = region. Source: World Bank / UN, 2023.

16

Global Mortality Among Children Under 5

Total deaths by cause, 1990–2021. Sources: IHME GBD 2021, UNICEF IGME 2024, WHO Global Health Estimates.

17

Child Mortality in Sub-Saharan Africa

Total under-5 deaths by cause, 1990–2021. Sources: IHME GBD 2021, WHO Africa Region, UNICEF IGME 2024.

18

Extreme Poverty by Region

People living below $2.15/day (2017 PPP). Sub-Saharan Africa is the only region where absolute numbers are rising. Source: World Bank, 2024. By 2030, 9 in 10 people in extreme poverty will live in Sub-Saharan Africa.

19

Per Capita CO2 Emissions

High-income countries have the highest per capita emissions, while low-income countries with the greatest disease burden contribute the least.

20

Global Burden of Disease

DALYs per 100,000 from all causes. Sub-Saharan Africa and South Asia bear the highest disease burden.

21

Undernourishment

Share of population with insufficient caloric intake. Malnutrition is a key determinant of susceptibility to disease.

22

Child Mortality

4.9 million children under age five died in 2022, 13,000 every day.

99% of children who die under the age of 5 are in low and middle income countries.

Child mortality
23

Communicable Diseases

Defined as

    • “any condition which is transmitted directly or indirectly to a person from an infected person or animal through the agency of an intermediate animal, host, or vector, or through the inanimate environment”.

Transmission is facilitated by the following (IOM)

  • more frequent human contact due to
    • Increase in the volume and means of transportation (affordable international air travel),
    • globalization (increased trade and contact)
  • Microbial adaptation and change
  • Breakdown of public health capacity at various levels
  • Change in human demographics and behavior
  • Economic development and land use patterns
24

CD- Modes of transmission

Direct

    • Blood-borne or sexual – HIV, Hepatitis B,C
    • Inhalation – Tuberculosis, influenza, anthrax
    • Food-borne – E.coli, Salmonella,
    • Contaminated water- Cholera, rotavirus, Hepatitis A

Indirect

    • Vector-borne- malaria, onchocerciasis, trypanosomiasis
    • Formites

Zoonotic diseases – animal handling and feeding practices (Mad cow disease, Avian Influenza)

25

Impact of Communicable Diseases

Disease Burden

CDs account for about 30% of the global BoD and 60% of the BoD in Africa.

CDs typically affect LIC and MICs disproportionately.

    • Account for 40% of the disease burden in low and middle income countries

Most communicable diseases are preventable or treatable.

Social Impact

Disruption of family and social networks

  • Child-headed households, social exclusion

Widespread stigma and discrimination

  • TB, HIV/AIDS, Leprosy
  • Discrimination in employment, schools, migration policies

Orphans and vulnerable children

  • Loss of primary care givers
  • Susceptibility to exploitation and trafficking

Interventions such as quarantine measures may aggravate the social disruption

Economic Impact

At the macro level

  • Reduction in revenue for the country (e.g. tourism)
    • Drop in international travel to affected countries by 50-70%
    • Malaria causes an average loss of 1.3% annual GDP in countries with intense transmission

At the household level

  • Poorer households are disproportionately affected
  • Substantial loss in productivity and income for the infirmed and caregiver
  • Catastrophic costs of treating illness
26

Malaria

In 2022, there were an estimated 249 million malaria cases worldwide

    • causes 30% of Low birth weight in newborns Globally.

In 2022, malaria killed an estimated 608,000 people. Malaria kills a child under 5 approximately every minute

40% of the world's population is at risk of malaria. Most cases and deaths occur in SSA.

Malaria is the 9th leading cause of death in LICs and MICs

    • 11% of childhood deaths worldwide attributable to malaria
    • SSA children account for 82% of malaria deaths worldwide

Source: WHO World Malaria Report 2023

Global malaria prevalence
27

Malaria Control

Malaria control

  • Early diagnosis and prompt treatment to cure patients and reduce parasite reservoir
  • Vector control:
    • Indoor residual spraying
    • Long lasting insecticide treated bed nets
  • Intermittent preventive treatment of pregnant women

Challenges in malaria control

  • Widespread resistance to conventional anti-malaria drugs
  • Malaria and HIV
  • Health Systems Constraints
    • Access to services
    • Coverage of prevention interventions
28

Malaria Geographic Distribution

Slide 38
29

Global HIV Burden

39.9 million people living with HIV (2023)

Slide 39
30

HIV/AIDS

In 2023, 39.9 million people worldwide were living with HIV, of which 67% live in SSA

  • 4.1 million people worldwide became newly infected
  • 2.8 million people lost their lives to AIDS

New infections occur predominantly among the 15-24 age group.

First identified in the early 1980s. Has affected over 85 million people since the start of the epidemic.

Source: UNAIDS Global AIDS Update 2023

HIV Co-infections

Impact of TB on HIV

  • TB considerably shortens the survival of people with HIV/AIDS.
  • TB kills up to half of all AIDS patients worldwide.
  • TB bacteria accelerate the progress of AIDS infection in the patient

HIV and Malaria

  • Diseases of poverty
  • HIV infected adults are at risk of developing severe malaria
  • Acute malaria episodes temporarily increase HIV viral load
  • Adults with low CD4 count more susceptible to treatment failure
31

Air Quality and Public Health

Slide 44
32

Water and Public Health

“No other single intervention in the history of medicine has saved as many lives and reduced as much suffering as the provisioning of uncontaminated water,” - Paul Edward

One billion people in the world lack access to clean drinking water

  • A leading cause of death worldwide
  • An estimated 1.4 million people die each year from inadequate water, sanitation and hygiene

Source: WHO, 2023

Slide 46
33

Sanitation and Public Health

Minimum Standards

  • Safe disposal excreta and sullage (greywater)
    • Avoid disposal within 15 meters of any water source
  • Provision of drainage
  • Disposal of waste
  • Control of insect and rodents
Slide 47 Slide 47
34
Section

Intro to Global Health
Part 3 - Examples

35

Water Quality & Dose-Response

Adherence to Clean Drinking Water Consumption

Adherence

Dose-Response

ehp.niehs.nih.gov/doi/10.1289/ehp.1206429

Dose-Response
36
Section

Intro to Global Health
Part 3 – Epidemiology

37

Epidemiology — Defined

Adapted from: Last JM, ed. A dictionary of epidemiology. 2nd ed. Toronto, Canada: Oxford University Press; 1988.

Study of the distribution and determinants of health-related states among specified populations and the application of that study to the control of health problems

Purposes in Public Health Practice

Discover the agent, host, and environmental factors that affect health

Determine the relative importance of causes of illness, disability, and death

Identify those segments of the population that have the greatest risk from specific causes of ill health

Evaluate the effectiveness of health programs and services in improving population health

Epidemiology
38

Epidemiology Key Terms

epidemic or outbreak: disease occurrence among a population that is in excess of what is expected in a given time and place.

cluster: group of cases in a specific time and place that might be more than expected.

endemic: disease or condition present among a population at all times.

pandemic: a disease or condition that spreads across regions.

rate: number of cases occurring during a specific period; always dependent on the size of the population during that period.

39

Measurement of Health Status

Cause of death

  • Obtained from death certification but limited because of incomplete coverage

Life expectancy at birth

  • The average number of years a new-borns baby could expect to live if current trends in mortality were to continue for the rest of the new-born's life

Maternal mortality rate

  • The number of women who die as a result of childbirth and pregnancy related complications per 100,000 live births in a given year

Infant mortality rate

  • The number of deaths in infants under 1 year per 1,000 live births for a given year

Neonatal mortality rate

  • The number of deaths among infants under 28 days in a given year per 1,000 live births in that year

Child mortality rate

  • The probability that a new-born will die before reaching the age of five years, expressed as a number per 1,000 live births
40

Solving Health Problems

Step 1

Data

collection

Action

Solving health problems

Assessment

Hypothesis

testing

Action

Step 2

Step 3

Step 4

Step 1 -

Surveillance; determine time, place, and person

Inference

Determine how and why

Intervention

Step 1 -

Step 2

Step 3

Step 4

Slide 62
41

Scenario: Unexplained Pneumonia

July 21–24

July 26–Aug 1

August 2

(Morning)

August 2

(Evening)

American Legion Convention,

Philadelphia, Pennsylvania

18 deaths reported among conventioneers

Health care provider at a veterans’ hospital in Philadelphia calls CDC to report cases of severe respiratory illness among attendees of the American Legion Convention

71 additional cases reported

Fraser DW, Tsai, T, Orenstein W, et al. Legionnaires’ disease: description of an epidemic of pneumonia. New Engl J Med 1977;297:1189–97.

42

Epidemiology Study Types

Epidemiology study

types

Experimental

Observational

Descriptive

Analytic

43

Confounding

Occurs when an extrinsic factor is associated with a disease outcome and, independent of that association, is also associated with the exposure

Exposure Outcome

Confounder

44

Measuring the association between exposure and outcome variables

The appropriate measure of association to use depends on the nature of the data

When exposure and outcome variables are dichotomous (two-level nominal data)

  • Odds ratio – use with case-control study (observational)
  • Risk ratio – use with cohort study (controlled)
  • Rate ratio – use with cohort study (controlled)

“Risk” refers to the probability of occurrence of an event or outcome.

“Odds” refers to the probability of occurrence of an event/probability of the event not occurring.

45

Epidemiological Formulas

Rate Formula

To calculate a rate, we first need to determine the frequency of disease, which includes:

the number of cases of the illness or condition

the size of the population at risk

the period during which we are calculating the rate

Rate formula

Equations Based on 2x2 Table

2x2 Table
46

Worked Example: Odds Ratio

OR =

Odds Ratio example

Example

OR =

Smokers

Non-smokers

OR calculation
47

Expressing RRs as Percentages

We can also express these RRs as percent change

  • RR > 1 % Increase Change = (RR – 1) × 100
  • RR < 1 % Decrease Change = (1 – RR) × 100
48

When does the odds ratio approximate the risk ratio?

For health-related states or events that are rare (i.e., affecting less than 10% of the population), a + b can be approximated by b, and c + d can be approximated by d

49

Surveillance & Public Health Approach

Goal of Intervention

Develop strategies for particular groups to engage in change toward health

Provide a support system to initiate change and sustain positive behaviors

A Public Health Approach

Surveillance → Risk Factor Identification → Intervention → Implementation → Evaluation

Surveillance Risk factors Intervention Implementation Evaluation
50

Approaches to Interventions

Personal responsibility and action

Utilitarian Approaches – "Greatest good for the greatest number"

  • Including non Health Systems Interventions.

Regulations and Laws

Partnerships and Collaboration

Enlightened Self Interest

Education

Develop favorable attitudes towards the behavior

Training (i.e. Community health workers)

Participatory engagement

Provide sustainable access

Utilizing underlying skills of the local people

Supportive Environment

  • Utilizing families, local organizations, community leaders, policy makers
51

Behavior Change

Why Do People NOT Change Behavior?

Do they understand the message?

Do they see themselves as vulnerable?

Do they trust the ones who present the message?

Do they think the benefits of change are worth the long-term benefits?

Is it too costly?

Does change contradict with their religious beliefs?

Health Belief Model

Perceived susceptibility – risk of acquiring the disease

Perceived severity – perception on the risk of acquiring the disease

Perceived benefits – is it worth the change?

Perceived barriers – obstacles to achieving health change

Cue to action – what will trigger this change?

Self-efficacy – how confident is the person to successfully perform a behavioral change?

52

Randomized Controlled Trial (RCT)

Slide 86
53
Case Study

Rwanda Tubeho Neza
Water Filter & Cookstove Programme

54

Programme Overview

The Tubeho Neza ("Live Well") programme was one of the world's first carbon-credit-financed health interventions, distributing water filters and improved cookstoves across Western Province, Rwanda.

  • First-ever UN CDM and Gold Standard programmes earning carbon credits for water treatment
  • Carbon credit revenue funded distribution, training, and ongoing monitoring
  • Implemented through Rwanda's existing Community-Based Environmental Health Promotion Programme (CBEHPP)
  • Community health workers (CHWs) served as the primary delivery and behavior change channel
  • Cluster-randomized controlled trial design to rigorously evaluate health impacts

Evan A. Thomas, PhD, PE, MPH — Director, Professor, University of Colorado Boulder

Rwanda programme
55

Field & Regional Context

Field context 1 Field context 2 Field context 3 DR Congo context
56

Programme Implementation & Monitoring

Implementation 1 Implementation 2 Implementation 3 Implementation 4 Monitoring Data collection
57

Key Results

29%
Reduction in diarrhea
92%
Increase in clean water access
90
Averted childhood deaths/yr
7,500
Averted DALYs annually
  • 73% reduction in indoor air pollution among outdoor cooks
  • 38% reduction in cryptosporidium exposure seroconversion
  • 97.5% reduction in fecal contamination of drinking water (RCT finding)
  • 48% reduction in cooking area air pollution
  • 25% reduction in acute respiratory infections in children under 5
  • Over 90% adoption rate maintained through CHW-delivered behavior change
Results chart 1 Results chart 2 Results chart 3

Sources: Thomas et al., Lancet Planetary Health (2018); Kirby et al., PLOS ONE (2014)

58

Technology & Digital Monitoring

Water Filters

  • LifeStraw Family 2.0 household water filters
  • Significant microbiological effectiveness reducing E. coli contamination
  • Free distribution with carbon waiver for credit generation

Remote Sensing Innovation

  • Electronic sensors remotely transmitting usage data
  • Sensor-reported use was substantially lower than self-reported use
  • Demonstrated critical value of objective digital monitoring
  • Published in ACS Environmental Science & Technology

Carbon Credit Model

  • Pay-for-performance model funded by voluntary carbon credits
  • Health, livelihood, and environmental benefits substantially outweighed costs
  • Fuel savings and averted healthcare costs = largest economic gains
Technology Monitoring tech
59

Scale & Lessons Learned

Scale 1 Scale 2 Scale 3

Key Lessons

  • Integration with existing community health infrastructure (CBEHPP) improved scalability
  • Geographic accessibility and CHW engagement were key determinants of sustained adoption
  • Objective sensor monitoring revealed gap between self-reported and actual use
  • Carbon credit financing created sustainable revenue for operations and maintenance
  • Combining hardware distribution with behavior change messaging achieved >90% uptake
60

Published Research (13 Papers)

TitleJournalLink
Health, livelihood, and environmental impacts of the Tubeho Neza programmeThe Lancet Planetary HealthOpen
Effects of adding household water filters to Rwanda's CBEHPPNature — npj Clean WaterOpen
Assessing Impact of Water Filters and Cookstoves: A Randomised Controlled TrialPLOS ONEOpen
Designing and Piloting a Program to Provide Water Filters and CookstovesPLOS ONEOpen
Cost-benefit analysis of livelihood, environmental and health benefitsScienceDirectOpen
Use, microbiological effectiveness and health impact of a household water filterScienceDirectOpen
Study design of a cluster-randomized controlled trialScienceDirectOpen
Process evaluation and assessment of useBMC Public HealthOpen
Use of Remotely Reporting Electronic SensorsACS Env. Sci. & Tech.Open
Integration of Household Water Filters with Community-Based SanitationMDPI SustainabilityOpen
Geospatial-temporal, demographic, and programmatic adoption characteristicsCogent EngineeringOpen
Assessing use, exposure, and health impacts (Dissertation)Semantic ScholarOpen
Lessons from Rwanda on tackling unsafe drinking water and air pollutionThe ConversationOpen
61
Section

Research Methodology

Methodological Considerations in Evaluating Global Health Interventions

62

The Tubeho Neza Program

Central Case Study: Tubeho Neza ("Live Well") Trial — Large-scale distribution of water filters and cookstoves to 101,000+ households in Western Province, Rwanda

The Intervention

  • Free distribution + promotion of LifeStraw Family 2.0 water filter and EcoZoom Dura rocket cookstove
  • To 101,000+ households (poorest 25%) in Western Province, Rwanda
  • Delivered by community health workers

The Evaluation Questions

  • Did this program actually reduce diarrhea in children under 5? Acute respiratory infections (ARI)?
  • Did it improve household water quality? Personal air quality (PM2.5)?

Published: Kirby, Nagel et al. (2019) PLoS Medicine  |  Design: Nagel, Kirby et al. (2016) CCTC

63

Biased Approaches to Evaluation

Biased Approach #1: Before vs. After

Diarrhea prevalence drops from 15% before distribution to 9% after. Tempting conclusion: a 40% reduction.

  • Seasonality: The rainy season ended between measurements — diarrhea is seasonal
  • Concurrent changes: Rotavirus vaccine campaigns, economic growth, other WASH programs
  • Regression to mean: Starting during a disease spike means prevalence naturally returns to average

Before/after comparisons confound the intervention effect with all temporal changes.

Biased Approach #2: Users vs. Non-Users

Filter users have 40% less diarrhea — but users are more educated, practice handwashing, have better healthcare access.

This is selection bias: the comparison group is not exchangeable with the treatment group.

Rwanda Trial: Baseline diarrhea was 15.3% (intervention) vs 13.7% (control) — already different before intervention.

64

The Three Systematic Threats to Validity

Every epidemiological study faces two kinds of error. Random error shrinks with larger samples. Systematic error — bias — does not.

C — Confounding

A third variable distorts the exposure-outcome relationship because it is associated with both.

e.g., Wealthier households use filters AND have less diarrhea

S — Selection Bias

The study sample is not representative, or loss to follow-up differs systematically between groups.

e.g., Analyzing only clinic visitors misses healthy children

I — Information Bias

Systematic errors in how exposure, outcome, or covariates are measured.

e.g., Caregivers over-report filter use (social desirability)

65

Confounding, Selection Bias & Information Bias

Confounding: When a Third Variable Creates a False Association

Three criteria for a confounder: (1) associated with the exposure, (2) independently associated with the outcome, (3) not on the causal pathway.

Control methods: Design: randomization, restriction, matching. Analysis: stratification, regression, propensity scores.

Selection Bias: When Who Gets Studied Distorts Findings

  • Loss to follow-up: Differential attrition biases the comparison
  • Collider bias: Conditioning on a common effect creates spurious associations
  • Volunteer/self-selection: Study volunteers differ systematically from non-volunteers

Information Bias: When Measurement Is Systematically Wrong

  • Differential misclassification: Error differs between groups (recall bias, courtesy bias)
  • Non-differential misclassification: Equal error across groups — biases toward the null
67%
Self-reported filter use
37%
Sensor-detected filter use
84%
Self-reported stove use
37%
Sensor-detected stove use
66

Recognizing Bias & Direction of Effects

StageCritical QuestionBias Addressed
DesignIs treatment assignment independent of confounders?Confounding
DesignCan participants be blinded to assignment?Information
EnrollmentIs the sample representative of the target population?Selection
Follow-UpIs attrition differential between groups?Selection
MeasurementAre outcomes assessed identically in all groups?Information
AnalysisHave confounders been identified via DAG and adjusted?Confounding

Direction of Bias

  • Non-differential misclassification: Toward null — dilutes the true effect
  • Uncontrolled confounding: Either direction — depends on the confounder
  • Differential misclassification: Either direction — unpredictable (recall bias, courtesy bias)
67

Study Design Hierarchy for Causal Inference

Each step down the hierarchy requires stronger assumptions to claim causation.

Randomized Controlled Trial (RCT)

Randomization creates exchangeable groups. Gold standard for causal inference.

Quasi-Experimental Designs

Exploit natural variation (DiD, RDD, ITS). No random assignment but clever design.

Cohort Study

Follow exposed/unexposed over time. Temporal ordering established.

Case-Control Study

Identify cases, select controls, look back. Efficient for rare outcomes.

Cross-sectional study: Snapshot — exposure and outcome measured simultaneously. Weakest for causation.

68

The Counterfactual & Potential Outcomes

Six months after distribution, 7-day diarrhea prevalence among children under 5 is 8.7%. Is that good or bad?

You cannot answer without knowing what the prevalence would have been without the intervention. This unobservable quantity is the counterfactual.

If counterfactual = 13%

29% reduction. Scale nationally.

If counterfactual = 8%

No effect. Don't waste resources.

The Potential Outcomes Framework (Neyman-Rubin)

For each child i: Yi(1) = outcome if treated; Yi(0) = outcome if not treated.

Causal effect = Yi(1) − Yi(0). The fundamental problem: we can only observe ONE potential outcome per child.

Average Treatment Effect (ATE) = E[Yi(1)] − E[Yi(0)]. Randomization makes the control group an unbiased estimate of the treated group's counterfactual.

69

Why Randomization Solves Confounding

Randomization ensures that, on average, treatment and control groups are identical in every way except the intervention — including unmeasured confounders.

What Randomization Gives You

Exchangeability (no confounding) • Unbiased ATE estimation • Valid statistical inference • Transparent, pre-specified analysis

What It Does NOT Give You

Protection from attrition bias • Protection from measurement bias • External validity / generalizability

Intention to Treat (ITT)

The ATE of being assigned to treatment, regardless of compliance. In Rwanda: the effect of living in a sector that received filters — not the effect of actually using the filter. ITT is the primary estimand for RCTs.

Rwanda Trial: Randomization at the sector level balanced observed covariates (age, sex, water source, sanitation type).

70

Individual vs. Cluster Randomization

Individual Randomization

Each person randomly assigned. Gold standard for independence. Problem: impractical for community-level interventions.

Cluster Randomization

Groups (villages, sectors) randomized. Prevents contamination. Cost: fewer independent units = less power. Requires GEE or mixed models.

Rwanda Trial: 96 sectors cluster-randomized 3:1. Sectors contain ~40 villages each. Stratified by district (7 districts). 3:1 ratio driven by programmatic goal of reaching 75% of the province in year 1.

Rwanda Trial: Two Studies in One

Sector-Level Study (Population Scale)

All ~82,000 children < 5 in 96 sectors. Data from health facility records and CHW reports.

Village-Level Sub-Study (Intensive Data)

174 village clusters, 1,582 households. 1:1 ratio via PPES sampling. Surveys, water sampling, PM2.5 monitoring, sensors.

Design lesson: When feasible, nest an intensive sub-study within a larger pragmatic trial.

71

Randomization vs. Random Sampling

Two distinct uses of randomness serving fundamentally different purposes.

Random Sampling

Purpose: Select who enters the study
Protects: External validity (generalizability)
Threat if absent: Results may not generalize

Randomization (Random Allocation)

Purpose: Assign who receives treatment
Protects: Internal validity (causal inference)
Threat if absent: Confounding

With RandomizationWithout Randomization
With SamplingStrongest: causal + generalizable (rare)Generalizable, not causal (e.g., DHS)
Without SamplingCausal, limited scope (most RCTs, e.g., Tubeho Neza)Weakest: neither causal nor generalizable

Key insight: Most trials have randomization without random sampling — establishing causation within a specific context.

72

Stratification & Quasi-Experimental Designs

Stratified Randomization

Ensures treatment/control balance on key variables. With only 96 clusters, pure randomization could produce imbalanced groups. Stratifying by district (7 districts) ensures proportional representation. Restricted randomization: generate many allocations, reject those with poor covariate balance.

Difference-in-Differences (DiD)

Compare change over time in treated vs. untreated groups. ΔTreated − ΔControl = causal effect. Key assumption: parallel trends — both groups would have followed the same trajectory without intervention.

Other Quasi-Experimental Designs

  • Regression Discontinuity (RDD): Exploit an eligibility cutoff. Households just above/below the Ubudehe threshold are similar but one gets the intervention. Estimates local effect at the cutoff.
  • Interrupted Time Series (ITS): Track outcomes over time; look for level/slope change at intervention point. Requires long pre-intervention series (8+ data points).
73

Reliability & Validity of Measurement

Any measurement instrument must demonstrate both before its data can be trusted.

Reliability (Consistency)

A reliable instrument yields the same result repeatedly. Necessary but not sufficient — consistent measurement can still be consistently wrong.

Types: Test-retest (ICC, Pearson r) • Inter-rater (Cohen’s κ) • Intra-rater (ICC) • Internal consistency (Cronbach’s α)

Validity (Accuracy)

A valid instrument measures the construct it claims to measure. Without validity, reliability is meaningless.

Types: Content (expert panels) • Criterion (sensitivity/specificity, AUC-ROC) • Construct (CFA, convergent/discriminant)

Key Metrics

MetricRangePoorModerateGoodExcellent
ICC0–1< 0.500.50–0.750.75–0.90> 0.90
Cohen’s κ−1 to 1< 0.200.21–0.600.61–0.80> 0.80
Cronbach’s α0–1< 0.600.60–0.700.70–0.90> 0.90
74

How Reliability & Validity Interact

Low R, Low V

Scattered, off-center. Needs complete redesign.

High R, Low V

Tightly clustered, off-center. Most deceptive: looks precise but systematically wrong.

Low R, High V

Scattered around center. On average correct but too noisy. Increase sample size.

High R, High V

Tightly clustered on center. The gold standard.

Pitfalls: Assuming reliability implies validity • Validating in one population but applying in another • Reporting only Cronbach’s α without factor structure

Low reliability, low validity High reliability, low validity Low reliability, high validity High reliability, high validity
75

Disease Frequency: Incidence & Prevalence

Incidence: New Cases Over Time

  • Cumulative incidence (risk): New cases / Population at risk at start. e.g., 50/500 = 10% over 6 months
  • Incidence rate: New cases / Person-time at risk. e.g., 50/250 person-years = 0.20/person-year. Accounts for variable follow-up.

Person-time: A child observed 6 months contributes 0.5 person-years.

Prevalence: Proportion with Disease

  • Point prevalence: Cases at one time / Total population. A snapshot mixing new and ongoing cases.
  • Period prevalence: Cases during a period / Population. "Has your child had diarrhea in the past 7 days?"

Prevalence = Incidence × Duration. Treatment that shortens episodes reduces prevalence without changing incidence.

Rwanda Trial: Primary outcome was 7-day period prevalence of caregiver-reported diarrhea and ARI. Repeated cross-sectional visits at ~4-month intervals.

76

Measures of Association: Relative & Absolute

Relative Measures

Rwanda Trial primary estimand: Prevalence Ratio from log-binomial GEE.

PR 0.71
Diarrhea (29% reduction)
PR 0.75
ARI (25% reduction)

Values < 1 indicate a protective effect; > 1 indicates increased risk.

Absolute Measures

  • Risk/Prevalence Difference: RD = Risk(exposed) − Risk(unexposed). Rwanda diarrhea: 8.7% − 12.9% = −4.2 percentage points
  • Number Needed to Treat: NNT = 1/|RD|. Rwanda: 1/0.042 ≈ 24 households per case prevented in a 7-day window

Relative measures tell you how strong the effect is. Absolute measures tell you how much disease you actually prevent. Always report both.

77

Self-Report vs. Sensor & Evaluation Design

67%
Reported filter use
37%
Sensor-confirmed use
~2×
Overestimation factor

Filter use declined: 75.5% → 67.6% → 64.8%. Traditional stove use increased: 24.1% → 49.4% ("stove stacking").

Implications for Evaluation Design

  • Triangulate: Self-report + observation + sensor + biomarker. Avoid reliance on a single method.
  • Negative controls: Include outcomes that shouldn't be affected (Rwanda: toothache). If treatment reduces toothache, you have a bias problem.
  • Blinding matters: Unblinded trials with subjective outcomes overestimate effects. Pooled masked HWT trials found no significant effect on diarrhea.
  • Plan measurement validation: Budget for sensor sub-studies and spot-checks — these are essential, not extras.

The best study design in the world is undermined by poor measurement.

78

The Core Problem of Statistical Inference

We want to learn about a population, but we can only observe a sample.

Population

Parameters (fixed, unknown): True mean diarrhea prevalence across all children under 5 in Western Province, Rwanda

Sample

Statistics (variable, calculated): Observed prevalence of 8.7% from a survey of 2,000 households in 30 randomly selected villages

Key Concepts

  • Sampling distribution: Distribution of a statistic across all possible samples of the same size
  • Central Limit Theorem: Sampling distribution of the mean approaches normal as n increases
  • Standard error: SD of the sampling distribution — measures estimate precision. Larger samples → smaller SEs

Key intuition: Probability distributions let us quantify uncertainty — which results would be common vs. surprising given a particular assumption.

79

The Logic of Hypothesis Testing

1. Assume No Effect

H0: Filter distribution has no effect on diarrhea prevalence.

2. Calculate Expected

Under H0, how much would prevalence vary between groups by chance?

3. Compare Observed

You observed a 4.3 pp difference. Test statistic t = 3.7.

4. Quantify Surprise

p = 0.002. Only 0.2% of random samples would show this large a difference.

Proof by contradiction: assume no effect exists, show the data would be very unlikely under that assumption, conclude the effect is probably real.

Two Ways to Be Wrong

Reality: H0 TRUEReality: H0 FALSE
Reject H0Type I Error (α) — false positiveCorrect (Power = 1 − β)
Fail to rejectCorrect (true negative)Type II Error (β) — false negative
80

P-Values & Confidence Intervals

What the P-Value IS

The probability of observing data at least as extreme as your result, assuming H0 is true. p = 0.03 means: if there were truly no effect, you'd see this result only 3% of the time.

What the P-Value is NOT

  • NOT the probability that H0 is true
  • NOT the probability of a false positive
  • NOT a measure of effect size

α = 0.05 is a convention, not a law. Some fields use 0.01 or 0.10. Set before analyzing data.

Confidence Intervals: Beyond Yes/No

If you repeated your study 100 times and computed a 95% CI each time, approximately 95 of those intervals would contain the true parameter. CIs provide the range of plausible effect sizes.

Confidence interval illustration
81

Choosing the Right Statistical Test

Before choosing a test, answer three questions: (1) What is your outcome variable? (2) How many groups? (3) Are assumptions met?

OutcomeScenarioParametricNon-Parametric
Continuous1 sample vs. known valueOne-sample t-testWilcoxon signed-rank
2 independent groupsIndependent t-testMann-Whitney U
3+ groupsOne-way ANOVAKruskal-Wallis
Categorical2×2 independentChi-square test (Fisher’s exact if cell < 5)
Paired before/afterMcNemar’s test
Multiple predictorsLogistic regression
Time-to-eventEstimate/compare curvesKaplan-Meier / Log-rank test
Model with covariatesCox proportional hazards regression

Go non-parametric when: n < 30 and non-normal • Skewed data (income, costs) • Ordinal scales (Likert, staging)

82

Correlation, Regression & Survival Analysis

Correlation

  • Pearson’s r: Both continuous and normal. Linear relationship. e.g., income vs. water quality
  • Spearman’s rho: Ordinal or non-normal. Monotonic relationship. e.g., education level vs. health knowledge rank

Regression

  • Linear regression: Continuous outcome. Y = b0 + b1X + e. e.g., predict child BMI from diet and activity
  • Logistic regression: Binary outcome. Log-odds as linear function. e.g., predict diarrhea Y/N from WASH variables
  • Poisson / Negative binomial: Count outcome. Rate of events per unit time. e.g., clinic visits per month

Survival Analysis

  • Kaplan-Meier: Estimate survival curves. Handles censoring
  • Log-Rank test: Compare survival between 2+ groups
  • Cox regression: Model hazard with covariates. Produces hazard ratios
83

Statistical Test Decision Tree

What is your outcome variable?

Continuous

1 sample → t-test / Wilcoxon
2 groups → t-test / Mann-Whitney
3+ groups → ANOVA / Kruskal-Wallis
Paired → Paired t / Wilcoxon

Categorical

Independent → Chi-square / Fisher’s
Paired → McNemar’s test
Model → Logistic regression

Time-to-Event

Estimate → Kaplan-Meier
Compare → Log-rank test
Model → Cox regression

Quick Rules of Thumb

  • Continuous outcome + 2 groups → start with a t-test
  • Binary outcome → chi-square for comparison, logistic regression for modeling
  • Censored follow-up time → survival analysis (Kaplan-Meier, log-rank, Cox)
  • Clustered data → mixed models or GEE, not standard tests
84

Rwanda Trial: Primary Analysis in Detail

The Analytic Pipeline

  • Outcome: Binary (diarrhea yes/no in past 7 days)
  • Model: Log-binomial regression → directly estimates prevalence ratio
  • Clustering: GEE with exchangeable working correlation, robust (sandwich) SEs at village level
  • Covariates: Child age (months) and sex only — minimal adjustment since randomization handles confounding
  • Weights: Sampling weights to account for PPES design in village-level sub-study
PR 0.71
Diarrhea (p=0.001)
PR 0.75
ARI (p=0.009)
PR 0.62
Detectable TTC (p<0.001)
β −0.09
Cook PM2.5 (p=0.486)

Water quality improved significantly. Air quality did not — consistent with stove stacking behavior detected by sensors.

85

The Issue of Clustering

Children within the same village share water sources, sanitation, disease ecology, climate, economic conditions, CHW quality, and health facility access. Their health outcomes are correlated — not independent.

Intraclass Correlation & Design Effect

ICC = σ²(between) / [σ²(between) + σ²(within)] — proportion of variance due to between-cluster differences.

Design Effect (DEFF) = 1 + (m − 1) × ICC, where m = average cluster size. Effective sample size = n / DEFF.

LevelOutcomeICCDesign Effect
Within-villageDiarrhea0.021.18 (m=10)
Within-villageARI0.041.36 (m=10)
Sector-levelDiarrhea9.5
Within-child (repeated)Diarrhea0.09
86

Ignoring Clustering & Sample Size

What Happens When You Ignore Clustering

↑ Type I
False positive rate increases
↓ CIs
Intervals too narrow
↓ p-vals
Artificially small

If you analyze 5,000 children from 10 clusters as independent observations, your nominal 5% significance level may actually be 20–30%. This is one of the most common methodological errors in global health research.

Sample Size Under Clustering

The number of clusters matters more than individuals per cluster.

Clusters needed per arm: k = (Zα + Zβ)² × DEFF × 2σ² / δ². Adding more people per cluster gives diminishing returns once m × ICC > 1.

Better to have 20 clusters of 50 than 10 clusters of 100. Always calculate cluster-adjusted sample sizes.

87

Analytic Solutions for Clustering

Cluster-Robust SEs

"Sandwich" standard errors. Inflate SEs to account for within-cluster correlation. Requires ≥30 clusters. Simplest solution.

GEE

Generalized Estimating Equations. Specify working correlation structure. Robust SEs protect against misspecification. Population-averaged. Used for Rwanda primary analysis.

Mixed Models

Random effects / multilevel models. Estimate cluster-specific parameters. Handle multiple nesting levels. Cluster-specific interpretation.

For policy questions ("what’s the average effect?"), GEE is preferred. For prediction ("what will happen in this village?"), mixed models are preferred.

Rwanda Trial: GEE with exchangeable working correlation and robust SEs, clustering at village level.

88

Missing Data Mechanisms

Rubin’s classification defines why data are missing — and determines which analytic methods are valid.

MCAR — Least Concern

Missingness unrelated to any data. Loss of power but no bias. Complete-case analysis valid.

e.g., Lab machine randomly malfunctions

MAR — Moderate Concern

Missingness depends on observed data. Biased if ignored. Recoverable with MI or ML methods.

e.g., Younger participants miss follow-up

MNAR — Greatest Concern

Missingness depends on the unobserved value itself. Biased under all standard methods. Requires sensitivity analyses.

e.g., Depressed patients don’t complete depression scale

89

Analytic Solutions for Missing Data

MethodValid UnderHow It Works
Complete-Case AnalysisMCARDelete cases with missing values. Simple but wasteful. Biased unless MCAR.
Single ImputationMCARReplace with mean/median/LOCF. Underestimates variance. Rarely recommended.
Multiple Imputation (MI)MARCreate m complete datasets (20–50) with plausible imputed values. Pool results via Rubin’s rules.
Maximum Likelihood (ML)MAREstimate parameters directly from observed data likelihood (EM, FIML). No imputed datasets needed.
Inverse Probability WeightingMARWeight complete cases by inverse probability of being observed. Combines with MI for doubly robust estimation.
Sensitivity AnalysisMNARVary assumptions about missing values. Pattern-mixture models, selection models, tipping-point analyses.
90

External Validity: Not All Water Treatment Works

Randomized trials of chlorination in Bangladesh, Kenya, and Zimbabwe found NO significant effect on childhood diarrhea.

Pathogen Profile

Chlorine is ineffective against Cryptosporidium. LifeStraw removes it through physical ultrafiltration. Intervention type matters.

Measurement & Blinding

Chlorination trials with blinded outcomes (sham chlorine) found no effect. Unblinded self-reported outcomes may overestimate effects.

The generalizability question isn’t "does water treatment work?" It’s "which treatment, against which pathogens, in which context, with what delivery model?"

Internal vs. External Validity

The Rwanda trial has strong internal validity (cluster-RCT, pre-registered, large sample). But results are specific to: Western Province, Rwanda | Poorest quartile | Rural | LifeStraw + EcoZoom | CHW-delivered with carbon credit financing.

91

Effect Modification: Why Effects Vary

Treatment effects are not universal constants. They vary across subgroups and contexts.

Biological Modifiers
  • Age: children < 2 more susceptible
  • Nutritional status
  • HIV/immune status
  • Pathogen profile: filter vs. chlorine
Contextual Modifiers
  • Baseline water quality
  • Baseline disease burden
  • Competing transmission routes
Implementation Modifiers
  • CHW network quality
  • Supply chain reliability
  • Cultural acceptability
  • Government support

Always report subgroup analyses and discuss which modifiers might limit transportability. Rwanda’s strong CHW system is unusual — replication elsewhere may yield smaller effects.

92

Efficacy → Effectiveness → Scale

97.5%
Pilot TTC reduction (supervised)
38%
Scaled TTC reduction (real-world)
~2.5×
Efficacy-effectiveness gap

Why Effects Shrink at Scale

  • Adherence: Self-reported filter use declined 75.5% → 64.8%. Sensor data: actual use ~37%.
  • Stove stacking: Traditional fire use increased 24.1% → 49.4%. No exclusive adoption → no air quality improvement.
  • Implementation variation: Hundreds of CHWs with varying training, supply chain delays, seasonal access challenges.
Efficacy-effectiveness gap
93
Lab Assignment

Global Burden of Disease Lab

Water quality QMRA & air quality cost-effectiveness analysis using R, GBD Compare, and HAPIT.

Open Full Lab Assignment →
1 / 92