Global Health for Engineers
Mortenson Center in Global Engineering
University of Colorado Boulder
Course Structure
An introduction to the professional field of Global Health, particularly focused on the areas of global health that engineers often contribute to – i.e. community environmental health. In a short course, we won’t have time to touch on things like health systems organization but there are many resources for additional learning.
We will be reading published peer reviewed studies of global health interventions – learning how to search for, read and analyze these kinds of studies is fundamental to being conversant in the field of global health. Many of these papers have engineers as co-authors.
This class is also an opportunity for group and 1:1 discussion. I am happy to meet with each of you individually to discuss career objectives and networking
Class will be heavy on case studies, with student-led facilitation
All readings posted in Canvas. Textbook is self-paced with online Canvas quizzes
See Canvas Syllabus for link to class schedule
Evan Thomas
Professor
- Environmental Engineering Program
- Civil, Environmental and Architectural Engineering Dept.
- Aerospace Engineering Sciences Department
Mortenson Endowed Chair in Global Engineering
Director, Mortenson Center in Global Engineering
University of Colorado Boulder
- PhD, Aerospace Engineering Sciences, 2009
- MS, Aerospace Engineering Sciences, 2006
- BS, Aerospace Engineering Sciences, 2005
- BS, Broadcast Journalism, 2005
Oregon Health and Science University
MPH, Master in Public Health, 2014
Fletcher School at Tufts University
Global MBA, 2022
NASA Johnson Space Center, Aerospace Engineer, 2004-2010
Portland State University, Assistant/Associate Professor, 2010-2018
Virridy Inc, Founder and CEO, 2012 – Present
Manna Energy Limited / DelAgua Health, Founder, COO, 2007-2016
~80 journal articles, 10 patents, professional work in 16 countries
Intro to Global Health
Part 1 - Overview
What is Global Health?
"An area for study, research, and practice that places a priority on improving health and achieving equity in health for all people worldwide. Global health emphasizes transnational health issues, determinants, and solutions; involves many disciplines within and beyond the health sciences and promotes interdisciplinary collaboration; and is a synthesis of population-based prevention with individual-level clinical care."
— Consortium of Universities for Global Health
Overview Objectives
Define the determinants, including social and economic, that impact health.
Highlight the differences in disease and life expectancy between high- and low-income countries.
Identify some of the dynamics in developing countries that impact health trends.
Key Concepts
The determinants of health
The measurement of health status
The importance of culture to health
The global burden of disease
Key risk factors for health problems
Organisation of health systems
Disciplines: Public Health · Public Policy · Medicine · Social Sciences · Behavioural Sciences · Law · Economics · History · Engineering · Biomedical Sciences · Environmental Sciences · Anthropology
Disease & Determinants of Health
Examples of disease that disproportionately impact developing countries
Malaria
Diarrhea
Pneumonia
HIV / Aids
99% of the children under the age of 5 who die every year lived in developing countries.
Determinants of Health
Genetics
Age
Gender
Lifestyle choices
Community influences
Income status
Geographical location
Urbanization
Climate Change
Governance
Culture
Environmental factors
Work conditions
Education
Access to health services
Source: Dahlgren G. and Whitehead M. 1991
Multi-sectoral Dimension of the Determinants of Health
Malnutrition –
- more susceptible to disease and less likely to recover
Cooking with wood and coal –
- lung diseases
Poor sanitation –
- more intestinal infections
Poor life circumstances –
- commercial sex work and STIs, HIV/AIDS
Advertising tobacco and alcohol –
- addiction and related diseases
Rapid growth in vehicular traffic often with untrained drivers on unsafe roads-
- road traffic accidents
Governance & Armed Conflict
Governance
Governance has a direct impact on socioeconomic status, health inequalities, and development
Allocation of Resources
Control of policy
Trade agreements
Regional politics
Abuse of power
Education
Image: Lvakurwa, 2009
Armed Conflict
The Results:
Disparities over resources and power
Broken relationship with neighboring countries
Lack of development
Inequality along race/gender lines
Resources diverted from health care to support conflict
Displacement
Health Implications:
Malnutrition
Diarrhea
Respiratory infections
AIDS
Extreme poverty
Negative long-term effects on health
Climate Change, Disasters & Globalization
Flooding/Natural Disasters
Indonesia: 2004 Tsunami
Image: DigitalGlobe, 2004
Haiti Earthquake, 2010
Image: Daily Mail, 2010
Globalization
”the way in which nations, businesses, and people are becoming more connected and interdependent through economic integration, communication, cultural diffusion, and travel” (Labonte & Schrecker, 2006, p. 3).
Means: moving goods and services, capital, technology, labor…
Princeton.edu INA, 2003

Intro to Global Health
Part 2 – Global Burden of Disease
Explore the Data
Interactive tools for exploring the Global Burden of Disease:

Child Mortality
4.9 million children under age five died in 2022, 13,000 every day.
99% of children who die under the age of 5 are in low and middle income countries.
Source: UNICEF/WHO, 2023

Communicable Diseases
Defined as
- “any condition which is transmitted directly or indirectly to a person from an infected person or animal through the agency of an intermediate animal, host, or vector, or through the inanimate environment”.
Transmission is facilitated by the following (IOM)
- more frequent human contact due to
- Increase in the volume and means of transportation (affordable international air travel),
- globalization (increased trade and contact)
- Microbial adaptation and change
- Breakdown of public health capacity at various levels
- Change in human demographics and behavior
- Economic development and land use patterns
CD- Modes of transmission
Direct
- Blood-borne or sexual – HIV, Hepatitis B,C
- Inhalation – Tuberculosis, influenza, anthrax
- Food-borne – E.coli, Salmonella,
- Contaminated water- Cholera, rotavirus, Hepatitis A
Indirect
- Vector-borne- malaria, onchocerciasis, trypanosomiasis
- Formites
Zoonotic diseases – animal handling and feeding practices (Mad cow disease, Avian Influenza)
Impact of Communicable Diseases
Disease Burden
CDs account for about 30% of the global BoD and 60% of the BoD in Africa.
CDs typically affect LIC and MICs disproportionately.
- Account for 40% of the disease burden in low and middle income countries
Most communicable diseases are preventable or treatable.
Social Impact
Disruption of family and social networks
- Child-headed households, social exclusion
Widespread stigma and discrimination
- TB, HIV/AIDS, Leprosy
- Discrimination in employment, schools, migration policies
Orphans and vulnerable children
- Loss of primary care givers
- Susceptibility to exploitation and trafficking
Interventions such as quarantine measures may aggravate the social disruption
Economic Impact
At the macro level
- Reduction in revenue for the country (e.g. tourism)
- Drop in international travel to affected countries by 50-70%
- Malaria causes an average loss of 1.3% annual GDP in countries with intense transmission
At the household level
- Poorer households are disproportionately affected
- Substantial loss in productivity and income for the infirmed and caregiver
- Catastrophic costs of treating illness
Malaria
In 2022, there were an estimated 249 million malaria cases worldwide
- causes 30% of Low birth weight in newborns Globally.
In 2022, malaria killed an estimated 608,000 people. Malaria kills a child under 5 approximately every minute
40% of the world's population is at risk of malaria. Most cases and deaths occur in SSA.
Malaria is the 9th leading cause of death in LICs and MICs
- 11% of childhood deaths worldwide attributable to malaria
- SSA children account for 82% of malaria deaths worldwide
Source: WHO World Malaria Report 2023

Malaria Control
Malaria control
- Early diagnosis and prompt treatment to cure patients and reduce parasite reservoir
- Vector control:
- Indoor residual spraying
- Long lasting insecticide treated bed nets
- Intermittent preventive treatment of pregnant women
Challenges in malaria control
- Widespread resistance to conventional anti-malaria drugs
- Malaria and HIV
- Health Systems Constraints
- Access to services
- Coverage of prevention interventions
Malaria Geographic Distribution

Global HIV Burden
39.9 million people living with HIV (2023)

HIV/AIDS
In 2023, 39.9 million people worldwide were living with HIV, of which 67% live in SSA
- 4.1 million people worldwide became newly infected
- 2.8 million people lost their lives to AIDS
New infections occur predominantly among the 15-24 age group.
First identified in the early 1980s. Has affected over 85 million people since the start of the epidemic.
Source: UNAIDS Global AIDS Update 2023
HIV Co-infections
Impact of TB on HIV
- TB considerably shortens the survival of people with HIV/AIDS.
- TB kills up to half of all AIDS patients worldwide.
- TB bacteria accelerate the progress of AIDS infection in the patient
HIV and Malaria
- Diseases of poverty
- HIV infected adults are at risk of developing severe malaria
- Acute malaria episodes temporarily increase HIV viral load
- Adults with low CD4 count more susceptible to treatment failure
Air Quality and Public Health

Water and Public Health
“No other single intervention in the history of medicine has saved as many lives and reduced as much suffering as the provisioning of uncontaminated water,” - Paul Edward
One billion people in the world lack access to clean drinking water
- A leading cause of death worldwide
- An estimated 1.4 million people die each year from inadequate water, sanitation and hygiene
Source: WHO, 2023

Sanitation and Public Health
Minimum Standards
- Safe disposal excreta and sullage (greywater)
- Avoid disposal within 15 meters of any water source
- Provision of drainage
- Disposal of waste
- Control of insect and rodents
Intro to Global Health
Part 3 - Examples
Water Quality & Dose-Response
Adherence to Clean Drinking Water Consumption

Dose-Response

Intro to Global Health
Part 3 – Epidemiology
Epidemiology — Defined
Adapted from: Last JM, ed. A dictionary of epidemiology. 2nd ed. Toronto, Canada: Oxford University Press; 1988.
Study of the distribution and determinants of health-related states among specified populations and the application of that study to the control of health problems
Purposes in Public Health Practice
Discover the agent, host, and environmental factors that affect health
Determine the relative importance of causes of illness, disability, and death
Identify those segments of the population that have the greatest risk from specific causes of ill health
Evaluate the effectiveness of health programs and services in improving population health

Epidemiology Key Terms
epidemic or outbreak: disease occurrence among a population that is in excess of what is expected in a given time and place.
cluster: group of cases in a specific time and place that might be more than expected.
endemic: disease or condition present among a population at all times.
pandemic: a disease or condition that spreads across regions.
rate: number of cases occurring during a specific period; always dependent on the size of the population during that period.
Measurement of Health Status
Cause of death
- Obtained from death certification but limited because of incomplete coverage
Life expectancy at birth
- The average number of years a new-borns baby could expect to live if current trends in mortality were to continue for the rest of the new-born's life
Maternal mortality rate
- The number of women who die as a result of childbirth and pregnancy related complications per 100,000 live births in a given year
Infant mortality rate
- The number of deaths in infants under 1 year per 1,000 live births for a given year
Neonatal mortality rate
- The number of deaths among infants under 28 days in a given year per 1,000 live births in that year
Child mortality rate
- The probability that a new-born will die before reaching the age of five years, expressed as a number per 1,000 live births
Solving Health Problems
Step 1
Data
collection
Action
Solving health problems
Assessment
Hypothesis
testing
Action
Step 2
Step 3
Step 4
Step 1 -
Surveillance; determine time, place, and person
Inference
Determine how and why
Intervention
Step 1 -
Step 2
Step 3
Step 4

Scenario: Unexplained Pneumonia
July 21–24
July 26–Aug 1
August 2
(Morning)
August 2
(Evening)
American Legion Convention,
Philadelphia, Pennsylvania
18 deaths reported among conventioneers
Health care provider at a veterans’ hospital in Philadelphia calls CDC to report casesof severe respiratory illness among attendees of the American Legion Convention
71 additional cases reported
Fraser DW, Tsai, T, Orenstein W, et al. Legionnaires’ disease: description of an epidemic of pneumonia. New Engl J Med 1977;297:1189–97.
Epidemiology Study Types
Epidemiology study
types
Experimental
Observational
Descriptive
Analytic
Confounding
Occurs when an extrinsic factor is associated with a disease outcome and, independent of that association, is also associated with the exposure
Exposure Outcome
Confounder
Measuring the association between exposure and outcome variables
The appropriate measure of association to use depends on the nature of the data
When exposure and outcome variables are dichotomous (two-level nominal data)
- Odds ratio – use with case-control study (observational)
- Risk ratio – use with cohort study (controlled)
- Rate ratio – use with cohort study (controlled)
“Risk” refers to the probability of occurrence of an event or outcome.
“Odds” refers to the probability of occurrence of an event/probability of the event not occurring.
Epidemiological Formulas
Rate Formula
To calculate a rate, we first need to determine the frequency of disease, which includes:
the number of cases of the illness or condition
the size of the population at risk
the period during which we are calculating the rate

Equations Based on 2x2 Table

Worked Example: Odds Ratio
OR =

Example
OR =
Smokers
Non-smokers

Expressing RRs as Percentages
We can also express these RRs as percent change
- RR > 1 % Increase Change = (RR – 1) × 100
- RR < 1 % Decrease Change = (1 – RR) × 100
When does the odds ratio approximate the risk ratio?
For health-related states or events that are rare (i.e., affecting less than 10% of the population),a + b can be approximated by b, and c + d can be approximated by d
Surveillance & Public Health Approach
Goal of Intervention
Develop strategies for particular groups to engage in change toward health
Provide a support system to initiate change and sustain positive behaviors
A Public Health Approach
Surveillance → Risk Factor Identification → Intervention → Implementation → Evaluation
Approaches to Interventions
Personal responsibility and action
Utilitarian Approaches – "Greatest good for the greatest number"
- Including non Health Systems Interventions.
Regulations and Laws
Partnerships and Collaboration
Enlightened Self Interest
Education
Develop favorable attitudes towards the behavior
Training (i.e. Community health workers)
Participatory engagement
Provide sustainable access
Utilizing underlying skills of the local people
Supportive Environment
- Utilizing families, local organizations, community leaders, policy makers
Behavior Change
Why Do People NOT Change Behavior?
Do they understand the message?
Do they see themselves as vulnerable?
Do they trust the ones who present the message?
Do they think the benefits of change are worth the long-term benefits?
Is it too costly?
Does change contradict with their religious beliefs?
Health Belief Model
Perceived susceptibility – risk of acquiring the disease
Perceived severity – perception on the risk of acquiring the disease
Perceived benefits – is it worth the change?
Perceived barriers – obstacles to achieving health change
Cue to action – what will trigger this change?
Self-efficacy – how confident is the person to successfully perform a behavioral change?
Randomized Controlled Trial (RCT)

Rwanda Tubeho Neza
Water Filter & Cookstove Programme
Programme Overview
The Tubeho Neza ("Live Well") programme was one of the world's first carbon-credit-financed health interventions, distributing water filters and improved cookstoves across Western Province, Rwanda.
- First-ever UN CDM and Gold Standard programmes earning carbon credits for water treatment
- Carbon credit revenue funded distribution, training, and ongoing monitoring
- Implemented through Rwanda's existing Community-Based Environmental Health Promotion Programme (CBEHPP)
- Community health workers (CHWs) served as the primary delivery and behavior change channel
- Cluster-randomized controlled trial design to rigorously evaluate health impacts
Evan A. Thomas, PhD, PE, MPH — Director, Professor, University of Colorado Boulder

Field & Regional Context
Programme Implementation & Monitoring
Key Results
- 73% reduction in indoor air pollution among outdoor cooks
- 38% reduction in cryptosporidium exposure seroconversion
- 97.5% reduction in fecal contamination of drinking water (RCT finding)
- 48% reduction in cooking area air pollution
- 25% reduction in acute respiratory infections in children under 5
- Over 90% adoption rate maintained through CHW-delivered behavior change
Sources: Thomas et al., Lancet Planetary Health (2018); Kirby et al., PLOS ONE (2014)
Technology & Digital Monitoring
Water Filters
- LifeStraw Family 2.0 household water filters
- Significant microbiological effectiveness reducing E. coli contamination
- Free distribution with carbon waiver for credit generation
Remote Sensing Innovation
- Electronic sensors remotely transmitting usage data
- Sensor-reported use was substantially lower than self-reported use
- Demonstrated critical value of objective digital monitoring
- Published in ACS Environmental Science & Technology
Carbon Credit Model
- Pay-for-performance model funded by voluntary carbon credits
- Health, livelihood, and environmental benefits substantially outweighed costs
- Fuel savings and averted healthcare costs = largest economic gains
Scale & Lessons Learned
Key Lessons
- Integration with existing community health infrastructure (CBEHPP) improved scalability
- Geographic accessibility and CHW engagement were key determinants of sustained adoption
- Objective sensor monitoring revealed gap between self-reported and actual use
- Carbon credit financing created sustainable revenue for operations and maintenance
- Combining hardware distribution with behavior change messaging achieved >90% uptake
Published Research (13 Papers)
| Title | Journal | Link |
|---|---|---|
| Health, livelihood, and environmental impacts of the Tubeho Neza programme | The Lancet Planetary Health | Open |
| Effects of adding household water filters to Rwanda's CBEHPP | Nature — npj Clean Water | Open |
| Assessing Impact of Water Filters and Cookstoves: A Randomised Controlled Trial | PLOS ONE | Open |
| Designing and Piloting a Program to Provide Water Filters and Cookstoves | PLOS ONE | Open |
| Cost-benefit analysis of livelihood, environmental and health benefits | ScienceDirect | Open |
| Use, microbiological effectiveness and health impact of a household water filter | ScienceDirect | Open |
| Study design of a cluster-randomized controlled trial | ScienceDirect | Open |
| Process evaluation and assessment of use | BMC Public Health | Open |
| Use of Remotely Reporting Electronic Sensors | ACS Env. Sci. & Tech. | Open |
| Integration of Household Water Filters with Community-Based Sanitation | MDPI Sustainability | Open |
| Geospatial-temporal, demographic, and programmatic adoption characteristics | Cogent Engineering | Open |
| Assessing use, exposure, and health impacts (Dissertation) | Semantic Scholar | Open |
| Lessons from Rwanda on tackling unsafe drinking water and air pollution | The Conversation | Open |
Research Methodology
Methodological Considerations in Evaluating Global Health Interventions
The Tubeho Neza Program
Central Case Study: Tubeho Neza ("Live Well") Trial — Large-scale distribution of water filters and cookstoves to 101,000+ households in Western Province, Rwanda
The Intervention
- Free distribution + promotion of LifeStraw Family 2.0 water filter and EcoZoom Dura rocket cookstove
- To 101,000+ households (poorest 25%) in Western Province, Rwanda
- Delivered by community health workers
The Evaluation Questions
- Did this program actually reduce diarrhea in children under 5? Acute respiratory infections (ARI)?
- Did it improve household water quality? Personal air quality (PM2.5)?
Published: Kirby, Nagel et al. (2019) PLoS Medicine | Design: Nagel, Kirby et al. (2016) CCTC
Biased Approaches to Evaluation
Biased Approach #1: Before vs. After
Diarrhea prevalence drops from 15% before distribution to 9% after. Tempting conclusion: a 40% reduction.
- Seasonality: The rainy season ended between measurements — diarrhea is seasonal
- Concurrent changes: Rotavirus vaccine campaigns, economic growth, other WASH programs
- Regression to mean: Starting during a disease spike means prevalence naturally returns to average
Before/after comparisons confound the intervention effect with all temporal changes.
Biased Approach #2: Users vs. Non-Users
Filter users have 40% less diarrhea — but users are more educated, practice handwashing, have better healthcare access.
This is selection bias: the comparison group is not exchangeable with the treatment group.
Rwanda Trial: Baseline diarrhea was 15.3% (intervention) vs 13.7% (control) — already different before intervention.
The Three Systematic Threats to Validity
Every epidemiological study faces two kinds of error. Random error shrinks with larger samples. Systematic error — bias — does not.
C — Confounding
A third variable distorts the exposure-outcome relationship because it is associated with both.
e.g., Wealthier households use filters AND have less diarrhea
S — Selection Bias
The study sample is not representative, or loss to follow-up differs systematically between groups.
e.g., Analyzing only clinic visitors misses healthy children
I — Information Bias
Systematic errors in how exposure, outcome, or covariates are measured.
e.g., Caregivers over-report filter use (social desirability)
Confounding, Selection Bias & Information Bias
Confounding: When a Third Variable Creates a False Association
Three criteria for a confounder: (1) associated with the exposure, (2) independently associated with the outcome, (3) not on the causal pathway.
Control methods: Design: randomization, restriction, matching. Analysis: stratification, regression, propensity scores.
Selection Bias: When Who Gets Studied Distorts Findings
- Loss to follow-up: Differential attrition biases the comparison
- Collider bias: Conditioning on a common effect creates spurious associations
- Volunteer/self-selection: Study volunteers differ systematically from non-volunteers
Information Bias: When Measurement Is Systematically Wrong
- Differential misclassification: Error differs between groups (recall bias, courtesy bias)
- Non-differential misclassification: Equal error across groups — biases toward the null
Recognizing Bias & Direction of Effects
| Stage | Critical Question | Bias Addressed |
|---|---|---|
| Design | Is treatment assignment independent of confounders? | Confounding |
| Design | Can participants be blinded to assignment? | Information |
| Enrollment | Is the sample representative of the target population? | Selection |
| Follow-Up | Is attrition differential between groups? | Selection |
| Measurement | Are outcomes assessed identically in all groups? | Information |
| Analysis | Have confounders been identified via DAG and adjusted? | Confounding |
Direction of Bias
- Non-differential misclassification: Toward null — dilutes the true effect
- Uncontrolled confounding: Either direction — depends on the confounder
- Differential misclassification: Either direction — unpredictable (recall bias, courtesy bias)
Study Design Hierarchy for Causal Inference
Each step down the hierarchy requires stronger assumptions to claim causation.
Randomized Controlled Trial (RCT)
Randomization creates exchangeable groups. Gold standard for causal inference.
Quasi-Experimental Designs
Exploit natural variation (DiD, RDD, ITS). No random assignment but clever design.
Cohort Study
Follow exposed/unexposed over time. Temporal ordering established.
Case-Control Study
Identify cases, select controls, look back. Efficient for rare outcomes.
Cross-sectional study: Snapshot — exposure and outcome measured simultaneously. Weakest for causation.
The Counterfactual & Potential Outcomes
Six months after distribution, 7-day diarrhea prevalence among children under 5 is 8.7%. Is that good or bad?
You cannot answer without knowing what the prevalence would have been without the intervention. This unobservable quantity is the counterfactual.
If counterfactual = 13%
29% reduction. Scale nationally.
If counterfactual = 8%
No effect. Don't waste resources.
The Potential Outcomes Framework (Neyman-Rubin)
For each child i: Yi(1) = outcome if treated; Yi(0) = outcome if not treated.
Causal effect = Yi(1) − Yi(0). The fundamental problem: we can only observe ONE potential outcome per child.
Average Treatment Effect (ATE) = E[Yi(1)] − E[Yi(0)]. Randomization makes the control group an unbiased estimate of the treated group's counterfactual.
Why Randomization Solves Confounding
Randomization ensures that, on average, treatment and control groups are identical in every way except the intervention — including unmeasured confounders.
What Randomization Gives You
Exchangeability (no confounding) • Unbiased ATE estimation • Valid statistical inference • Transparent, pre-specified analysis
What It Does NOT Give You
Protection from attrition bias • Protection from measurement bias • External validity / generalizability
Intention to Treat (ITT)
The ATE of being assigned to treatment, regardless of compliance. In Rwanda: the effect of living in a sector that received filters — not the effect of actually using the filter. ITT is the primary estimand for RCTs.
Rwanda Trial: Randomization at the sector level balanced observed covariates (age, sex, water source, sanitation type).
Individual vs. Cluster Randomization
Individual Randomization
Each person randomly assigned. Gold standard for independence. Problem: impractical for community-level interventions.
Cluster Randomization
Groups (villages, sectors) randomized. Prevents contamination. Cost: fewer independent units = less power. Requires GEE or mixed models.
Rwanda Trial: 96 sectors cluster-randomized 3:1. Sectors contain ~40 villages each. Stratified by district (7 districts). 3:1 ratio driven by programmatic goal of reaching 75% of the province in year 1.
Rwanda Trial: Two Studies in One
Sector-Level Study (Population Scale)
All ~82,000 children < 5 in 96 sectors. Data from health facility records and CHW reports.
Village-Level Sub-Study (Intensive Data)
174 village clusters, 1,582 households. 1:1 ratio via PPES sampling. Surveys, water sampling, PM2.5 monitoring, sensors.
Design lesson: When feasible, nest an intensive sub-study within a larger pragmatic trial.
Randomization vs. Random Sampling
Two distinct uses of randomness serving fundamentally different purposes.
Random Sampling
Purpose: Select who enters the study
Protects: External validity (generalizability)
Threat if absent: Results may not generalize
Randomization (Random Allocation)
Purpose: Assign who receives treatment
Protects: Internal validity (causal inference)
Threat if absent: Confounding
| With Randomization | Without Randomization | |
|---|---|---|
| With Sampling | Strongest: causal + generalizable (rare) | Generalizable, not causal (e.g., DHS) |
| Without Sampling | Causal, limited scope (most RCTs, e.g., Tubeho Neza) | Weakest: neither causal nor generalizable |
Key insight: Most trials have randomization without random sampling — establishing causation within a specific context.
Stratification & Quasi-Experimental Designs
Stratified Randomization
Ensures treatment/control balance on key variables. With only 96 clusters, pure randomization could produce imbalanced groups. Stratifying by district (7 districts) ensures proportional representation. Restricted randomization: generate many allocations, reject those with poor covariate balance.
Difference-in-Differences (DiD)
Compare change over time in treated vs. untreated groups. ΔTreated − ΔControl = causal effect. Key assumption: parallel trends — both groups would have followed the same trajectory without intervention.
Other Quasi-Experimental Designs
- Regression Discontinuity (RDD): Exploit an eligibility cutoff. Households just above/below the Ubudehe threshold are similar but one gets the intervention. Estimates local effect at the cutoff.
- Interrupted Time Series (ITS): Track outcomes over time; look for level/slope change at intervention point. Requires long pre-intervention series (8+ data points).
Reliability & Validity of Measurement
Any measurement instrument must demonstrate both before its data can be trusted.
Reliability (Consistency)
A reliable instrument yields the same result repeatedly. Necessary but not sufficient — consistent measurement can still be consistently wrong.
Types: Test-retest (ICC, Pearson r) • Inter-rater (Cohen’s κ) • Intra-rater (ICC) • Internal consistency (Cronbach’s α)
Validity (Accuracy)
A valid instrument measures the construct it claims to measure. Without validity, reliability is meaningless.
Types: Content (expert panels) • Criterion (sensitivity/specificity, AUC-ROC) • Construct (CFA, convergent/discriminant)
Key Metrics
| Metric | Range | Poor | Moderate | Good | Excellent |
|---|---|---|---|---|---|
| ICC | 0–1 | < 0.50 | 0.50–0.75 | 0.75–0.90 | > 0.90 |
| Cohen’s κ | −1 to 1 | < 0.20 | 0.21–0.60 | 0.61–0.80 | > 0.80 |
| Cronbach’s α | 0–1 | < 0.60 | 0.60–0.70 | 0.70–0.90 | > 0.90 |
How Reliability & Validity Interact
Low R, Low V
Scattered, off-center. Needs complete redesign.
High R, Low V
Tightly clustered, off-center. Most deceptive: looks precise but systematically wrong.
Low R, High V
Scattered around center. On average correct but too noisy. Increase sample size.
High R, High V
Tightly clustered on center. The gold standard.
Pitfalls: Assuming reliability implies validity • Validating in one population but applying in another • Reporting only Cronbach’s α without factor structure
Disease Frequency: Incidence & Prevalence
Incidence: New Cases Over Time
- Cumulative incidence (risk): New cases / Population at risk at start. e.g., 50/500 = 10% over 6 months
- Incidence rate: New cases / Person-time at risk. e.g., 50/250 person-years = 0.20/person-year. Accounts for variable follow-up.
Person-time: A child observed 6 months contributes 0.5 person-years.
Prevalence: Proportion with Disease
- Point prevalence: Cases at one time / Total population. A snapshot mixing new and ongoing cases.
- Period prevalence: Cases during a period / Population. "Has your child had diarrhea in the past 7 days?"
Prevalence = Incidence × Duration. Treatment that shortens episodes reduces prevalence without changing incidence.
Rwanda Trial: Primary outcome was 7-day period prevalence of caregiver-reported diarrhea and ARI. Repeated cross-sectional visits at ~4-month intervals.
Measures of Association: Relative & Absolute
Relative Measures
Rwanda Trial primary estimand: Prevalence Ratio from log-binomial GEE.
Values < 1 indicate a protective effect; > 1 indicates increased risk.
Absolute Measures
- Risk/Prevalence Difference: RD = Risk(exposed) − Risk(unexposed). Rwanda diarrhea: 8.7% − 12.9% = −4.2 percentage points
- Number Needed to Treat: NNT = 1/|RD|. Rwanda: 1/0.042 ≈ 24 households per case prevented in a 7-day window
Relative measures tell you how strong the effect is. Absolute measures tell you how much disease you actually prevent. Always report both.
Self-Report vs. Sensor & Evaluation Design
Filter use declined: 75.5% → 67.6% → 64.8%. Traditional stove use increased: 24.1% → 49.4% ("stove stacking").
Implications for Evaluation Design
- Triangulate: Self-report + observation + sensor + biomarker. Avoid reliance on a single method.
- Negative controls: Include outcomes that shouldn't be affected (Rwanda: toothache). If treatment reduces toothache, you have a bias problem.
- Blinding matters: Unblinded trials with subjective outcomes overestimate effects. Pooled masked HWT trials found no significant effect on diarrhea.
- Plan measurement validation: Budget for sensor sub-studies and spot-checks — these are essential, not extras.
The best study design in the world is undermined by poor measurement.
The Core Problem of Statistical Inference
We want to learn about a population, but we can only observe a sample.
Population
Parameters (fixed, unknown): True mean diarrhea prevalence across all children under 5 in Western Province, Rwanda
Sample
Statistics (variable, calculated): Observed prevalence of 8.7% from a survey of 2,000 households in 30 randomly selected villages
Key Concepts
- Sampling distribution: Distribution of a statistic across all possible samples of the same size
- Central Limit Theorem: Sampling distribution of the mean approaches normal as n increases
- Standard error: SD of the sampling distribution — measures estimate precision. Larger samples → smaller SEs
Key intuition: Probability distributions let us quantify uncertainty — which results would be common vs. surprising given a particular assumption.
The Logic of Hypothesis Testing
1. Assume No Effect
H0: Filter distribution has no effect on diarrhea prevalence.
2. Calculate Expected
Under H0, how much would prevalence vary between groups by chance?
3. Compare Observed
You observed a 4.3 pp difference. Test statistic t = 3.7.
4. Quantify Surprise
p = 0.002. Only 0.2% of random samples would show this large a difference.
Proof by contradiction: assume no effect exists, show the data would be very unlikely under that assumption, conclude the effect is probably real.
Two Ways to Be Wrong
| Reality: H0 TRUE | Reality: H0 FALSE | |
|---|---|---|
| Reject H0 | Type I Error (α) — false positive | Correct (Power = 1 − β) |
| Fail to reject | Correct (true negative) | Type II Error (β) — false negative |
P-Values & Confidence Intervals
What the P-Value IS
The probability of observing data at least as extreme as your result, assuming H0 is true. p = 0.03 means: if there were truly no effect, you'd see this result only 3% of the time.
What the P-Value is NOT
- NOT the probability that H0 is true
- NOT the probability of a false positive
- NOT a measure of effect size
α = 0.05 is a convention, not a law. Some fields use 0.01 or 0.10. Set before analyzing data.
Confidence Intervals: Beyond Yes/No
If you repeated your study 100 times and computed a 95% CI each time, approximately 95 of those intervals would contain the true parameter. CIs provide the range of plausible effect sizes.

Choosing the Right Statistical Test
Before choosing a test, answer three questions: (1) What is your outcome variable? (2) How many groups? (3) Are assumptions met?
| Outcome | Scenario | Parametric | Non-Parametric |
|---|---|---|---|
| Continuous | 1 sample vs. known value | One-sample t-test | Wilcoxon signed-rank |
| 2 independent groups | Independent t-test | Mann-Whitney U | |
| 3+ groups | One-way ANOVA | Kruskal-Wallis | |
| Categorical | 2×2 independent | Chi-square test (Fisher’s exact if cell < 5) | |
| Paired before/after | McNemar’s test | ||
| Multiple predictors | Logistic regression | ||
| Time-to-event | Estimate/compare curves | Kaplan-Meier / Log-rank test | |
| Model with covariates | Cox proportional hazards regression | ||
Go non-parametric when: n < 30 and non-normal • Skewed data (income, costs) • Ordinal scales (Likert, staging)
Correlation, Regression & Survival Analysis
Correlation
- Pearson’s r: Both continuous and normal. Linear relationship. e.g., income vs. water quality
- Spearman’s rho: Ordinal or non-normal. Monotonic relationship. e.g., education level vs. health knowledge rank
Regression
- Linear regression: Continuous outcome. Y = b0 + b1X + e. e.g., predict child BMI from diet and activity
- Logistic regression: Binary outcome. Log-odds as linear function. e.g., predict diarrhea Y/N from WASH variables
- Poisson / Negative binomial: Count outcome. Rate of events per unit time. e.g., clinic visits per month
Survival Analysis
- Kaplan-Meier: Estimate survival curves. Handles censoring
- Log-Rank test: Compare survival between 2+ groups
- Cox regression: Model hazard with covariates. Produces hazard ratios
Statistical Test Decision Tree
What is your outcome variable?
Continuous
1 sample → t-test / Wilcoxon
2 groups → t-test / Mann-Whitney
3+ groups → ANOVA / Kruskal-Wallis
Paired → Paired t / Wilcoxon
Categorical
Independent → Chi-square / Fisher’s
Paired → McNemar’s test
Model → Logistic regression
Time-to-Event
Estimate → Kaplan-Meier
Compare → Log-rank test
Model → Cox regression
Quick Rules of Thumb
- Continuous outcome + 2 groups → start with a t-test
- Binary outcome → chi-square for comparison, logistic regression for modeling
- Censored follow-up time → survival analysis (Kaplan-Meier, log-rank, Cox)
- Clustered data → mixed models or GEE, not standard tests
Rwanda Trial: Primary Analysis in Detail
The Analytic Pipeline
- Outcome: Binary (diarrhea yes/no in past 7 days)
- Model: Log-binomial regression → directly estimates prevalence ratio
- Clustering: GEE with exchangeable working correlation, robust (sandwich) SEs at village level
- Covariates: Child age (months) and sex only — minimal adjustment since randomization handles confounding
- Weights: Sampling weights to account for PPES design in village-level sub-study
Water quality improved significantly. Air quality did not — consistent with stove stacking behavior detected by sensors.
The Issue of Clustering
Children within the same village share water sources, sanitation, disease ecology, climate, economic conditions, CHW quality, and health facility access. Their health outcomes are correlated — not independent.
Intraclass Correlation & Design Effect
ICC = σ²(between) / [σ²(between) + σ²(within)] — proportion of variance due to between-cluster differences.
Design Effect (DEFF) = 1 + (m − 1) × ICC, where m = average cluster size. Effective sample size = n / DEFF.
| Level | Outcome | ICC | Design Effect |
|---|---|---|---|
| Within-village | Diarrhea | 0.02 | 1.18 (m=10) |
| Within-village | ARI | 0.04 | 1.36 (m=10) |
| Sector-level | Diarrhea | — | 9.5 |
| Within-child (repeated) | Diarrhea | 0.09 | — |
Ignoring Clustering & Sample Size
What Happens When You Ignore Clustering
If you analyze 5,000 children from 10 clusters as independent observations, your nominal 5% significance level may actually be 20–30%. This is one of the most common methodological errors in global health research.
Sample Size Under Clustering
The number of clusters matters more than individuals per cluster.
Clusters needed per arm: k = (Zα + Zβ)² × DEFF × 2σ² / δ². Adding more people per cluster gives diminishing returns once m × ICC > 1.
Better to have 20 clusters of 50 than 10 clusters of 100. Always calculate cluster-adjusted sample sizes.
Analytic Solutions for Clustering
Cluster-Robust SEs
"Sandwich" standard errors. Inflate SEs to account for within-cluster correlation. Requires ≥30 clusters. Simplest solution.
GEE
Generalized Estimating Equations. Specify working correlation structure. Robust SEs protect against misspecification. Population-averaged. Used for Rwanda primary analysis.
Mixed Models
Random effects / multilevel models. Estimate cluster-specific parameters. Handle multiple nesting levels. Cluster-specific interpretation.
For policy questions ("what’s the average effect?"), GEE is preferred. For prediction ("what will happen in this village?"), mixed models are preferred.
Rwanda Trial: GEE with exchangeable working correlation and robust SEs, clustering at village level.
Missing Data Mechanisms
Rubin’s classification defines why data are missing — and determines which analytic methods are valid.
MCAR — Least Concern
Missingness unrelated to any data. Loss of power but no bias. Complete-case analysis valid.
e.g., Lab machine randomly malfunctions
MAR — Moderate Concern
Missingness depends on observed data. Biased if ignored. Recoverable with MI or ML methods.
e.g., Younger participants miss follow-up
MNAR — Greatest Concern
Missingness depends on the unobserved value itself. Biased under all standard methods. Requires sensitivity analyses.
e.g., Depressed patients don’t complete depression scale
Analytic Solutions for Missing Data
| Method | Valid Under | How It Works |
|---|---|---|
| Complete-Case Analysis | MCAR | Delete cases with missing values. Simple but wasteful. Biased unless MCAR. |
| Single Imputation | MCAR | Replace with mean/median/LOCF. Underestimates variance. Rarely recommended. |
| Multiple Imputation (MI) | MAR | Create m complete datasets (20–50) with plausible imputed values. Pool results via Rubin’s rules. |
| Maximum Likelihood (ML) | MAR | Estimate parameters directly from observed data likelihood (EM, FIML). No imputed datasets needed. |
| Inverse Probability Weighting | MAR | Weight complete cases by inverse probability of being observed. Combines with MI for doubly robust estimation. |
| Sensitivity Analysis | MNAR | Vary assumptions about missing values. Pattern-mixture models, selection models, tipping-point analyses. |
External Validity: Not All Water Treatment Works
Randomized trials of chlorination in Bangladesh, Kenya, and Zimbabwe found NO significant effect on childhood diarrhea.
Pathogen Profile
Chlorine is ineffective against Cryptosporidium. LifeStraw removes it through physical ultrafiltration. Intervention type matters.
Measurement & Blinding
Chlorination trials with blinded outcomes (sham chlorine) found no effect. Unblinded self-reported outcomes may overestimate effects.
The generalizability question isn’t "does water treatment work?" It’s "which treatment, against which pathogens, in which context, with what delivery model?"
Internal vs. External Validity
The Rwanda trial has strong internal validity (cluster-RCT, pre-registered, large sample). But results are specific to: Western Province, Rwanda | Poorest quartile | Rural | LifeStraw + EcoZoom | CHW-delivered with carbon credit financing.
Effect Modification: Why Effects Vary
Treatment effects are not universal constants. They vary across subgroups and contexts.
Biological Modifiers
- Age: children < 2 more susceptible
- Nutritional status
- HIV/immune status
- Pathogen profile: filter vs. chlorine
Contextual Modifiers
- Baseline water quality
- Baseline disease burden
- Competing transmission routes
Implementation Modifiers
- CHW network quality
- Supply chain reliability
- Cultural acceptability
- Government support
Always report subgroup analyses and discuss which modifiers might limit transportability. Rwanda’s strong CHW system is unusual — replication elsewhere may yield smaller effects.
Efficacy → Effectiveness → Scale
Why Effects Shrink at Scale
- Adherence: Self-reported filter use declined 75.5% → 64.8%. Sensor data: actual use ~37%.
- Stove stacking: Traditional fire use increased 24.1% → 49.4%. No exclusive adoption → no air quality improvement.
- Implementation variation: Hundreds of CHWs with varying training, supply chain delays, seasonal access challenges.

Global Burden of Disease Lab
Water quality QMRA & air quality cost-effectiveness analysis using R, GBD Compare, and HAPIT.
Open Full Lab Assignment →