Global Health for Engineers

2

Course Structure

An introduction to the professional field of Global Health, particularly focused on the areas of global health that engineers often contribute to – i.e. community environmental health. In a short course, we won’t have time to touch on things like health systems organization but there are many resources for additional learning.

We will be reading published peer reviewed studies of global health interventions – learning how to search for, read and analyze these kinds of studies is fundamental to being conversant in the field of global health. Many of these papers have engineers as co-authors.

This class is also an opportunity for group and 1:1 discussion. I am happy to meet with each of you individually to discuss career objectives and networking

Class will be heavy on case studies, with student-led facilitation

All readings posted in Canvas. Textbook is self-paced with online Canvas quizzes

See Canvas Syllabus for link to class schedule

3

Evan Thomas

Professor

Environmental Engineering Program
Civil, Environmental and Architectural Engineering Dept.
Aerospace Engineering Sciences Department

Mortenson Endowed Chair in Global Engineering

Director, Mortenson Center in Global Engineering

University of Colorado Boulder

PhD, Aerospace Engineering Sciences, 2009
MS, Aerospace Engineering Sciences, 2006
BS, Aerospace Engineering Sciences, 2005
BS, Broadcast Journalism, 2005

Oregon Health and Science University

MPH, Master in Public Health, 2014

Fletcher School at Tufts University

Global MBA, 2022

NASA Johnson Space Center, Aerospace Engineer, 2004-2010

Portland State University, Assistant/Associate Professor, 2010-2018

Virridy Inc, Founder and CEO, 2012 – Present

Manna Energy Limited / DelAgua Health, Founder, COO, 2007-2016

~80 journal articles, 10 patents, professional work in 16 countries

4

Section

Intro to Global Health
Part 1 - Overview

5

What is Global Health?

"An area for study, research, and practice that places a priority on improving health and achieving equity in health for all people worldwide. Global health emphasizes transnational health issues, determinants, and solutions; involves many disciplines within and beyond the health sciences and promotes interdisciplinary collaboration; and is a synthesis of population-based prevention with individual-level clinical care."

— Consortium of Universities for Global Health

Overview Objectives

Define the determinants, including social and economic, that impact health.

Highlight the differences in disease and life expectancy between high- and low-income countries.

Identify some of the dynamics in developing countries that impact health trends.

Key Concepts

The determinants of health

The measurement of health status

The importance of culture to health

The global burden of disease

Key risk factors for health problems

Organisation of health systems

Disciplines: Public Health · Public Policy · Medicine · Social Sciences · Behavioural Sciences · Law · Economics · History · Engineering · Biomedical Sciences · Environmental Sciences · Anthropology

6

Disease & Determinants of Health

Examples of disease that disproportionately impact developing countries

Malaria

Diarrhea

Pneumonia

HIV / Aids

99% of the children under the age of 5 who die every year lived in developing countries.

Determinants of Health

Genetics

Age

Gender

Lifestyle choices

Community influences

Income status

Geographical location

Urbanization

Climate Change

Governance

Culture

Environmental factors

Work conditions

Education

Access to health services

Source: Dahlgren G. and Whitehead M. 1991

7

Multi-sectoral Dimension of the Determinants of Health

Malnutrition –

more susceptible to disease and less likely to recover

Cooking with wood and coal –

lung diseases

Poor sanitation –

more intestinal infections

Poor life circumstances –

commercial sex work and STIs, HIV/AIDS

Advertising tobacco and alcohol –

addiction and related diseases

Rapid growth in vehicular traffic often with untrained drivers on unsafe roads-

road traffic accidents

8

Governance & Armed Conflict

Governance

Governance has a direct impact on socioeconomic status, health inequalities, and development

Allocation of Resources

Control of policy

Trade agreements

Regional politics

Abuse of power

Education

Image: Lvakurwa, 2009

Armed Conflict

The Results:

Disparities over resources and power

Broken relationship with neighboring countries

Lack of development

Inequality along race/gender lines

Resources diverted from health care to support conflict

Displacement

Health Implications:

Malnutrition

Diarrhea

Respiratory infections

AIDS

Extreme poverty

Negative long-term effects on health

9

Section

Intro to Global Health
Part 2 – Global Burden of Disease

10

Explore the Data

Interactive tools for exploring the Global Burden of Disease:

GBD Compare SDG Visualizations Our World in Data

11

Child Mortality

4.9 million children under age five died in 2022, 13,000 every day.

99% of children who die under the age of 5 are in low and middle income countries.

Source: UNICEF/WHO, 2023

12

Communicable Diseases

Defined as

“any condition which is transmitted directly or indirectly to a person from an infected person or animal through the agency of an intermediate animal, host, or vector, or through the inanimate environment”.

Transmission is facilitated by the following (IOM)

more frequent human contact due to

Increase in the volume and means of transportation (affordable international air travel),
globalization (increased trade and contact)

Microbial adaptation and change
Breakdown of public health capacity at various levels
Change in human demographics and behavior
Economic development and land use patterns

13

CD- Modes of transmission

Direct

Blood-borne or sexual – HIV, Hepatitis B,C
Inhalation – Tuberculosis, influenza, anthrax
Food-borne – E.coli, Salmonella,
Contaminated water- Cholera, rotavirus, Hepatitis A

Indirect

Vector-borne- malaria, onchocerciasis, trypanosomiasis
Formites

Zoonotic diseases – animal handling and feeding practices (Mad cow disease, Avian Influenza)

14

Impact of Communicable Diseases

Disease Burden

CDs account for about 30% of the global BoD and 60% of the BoD in Africa.

CDs typically affect LIC and MICs disproportionately.

Account for 40% of the disease burden in low and middle income countries

Most communicable diseases are preventable or treatable.

Social Impact

Disruption of family and social networks

Child-headed households, social exclusion

Widespread stigma and discrimination

TB, HIV/AIDS, Leprosy
Discrimination in employment, schools, migration policies

Orphans and vulnerable children

Loss of primary care givers
Susceptibility to exploitation and trafficking

Interventions such as quarantine measures may aggravate the social disruption

Economic Impact

At the macro level

Reduction in revenue for the country (e.g. tourism)

Drop in international travel to affected countries by 50-70%
Malaria causes an average loss of 1.3% annual GDP in countries with intense transmission

At the household level

Poorer households are disproportionately affected
Substantial loss in productivity and income for the infirmed and caregiver
Catastrophic costs of treating illness

15

Malaria

In 2022, there were an estimated 249 million malaria cases worldwide

causes 30% of Low birth weight in newborns Globally.

In 2022, malaria killed an estimated 608,000 people. Malaria kills a child under 5 approximately every minute

40% of the world's population is at risk of malaria. Most cases and deaths occur in SSA.

Malaria is the 9th leading cause of death in LICs and MICs

11% of childhood deaths worldwide attributable to malaria
SSA children account for 82% of malaria deaths worldwide

Source: WHO World Malaria Report 2023

16

Malaria Control

Malaria control

Early diagnosis and prompt treatment to cure patients and reduce parasite reservoir
Vector control:

Indoor residual spraying
Long lasting insecticide treated bed nets

Intermittent preventive treatment of pregnant women

Challenges in malaria control

Widespread resistance to conventional anti-malaria drugs
Malaria and HIV
Health Systems Constraints

Access to services
Coverage of prevention interventions

17

Malaria Geographic Distribution

18

Global HIV Burden

39.9 million people living with HIV (2023)

19

HIV/AIDS

In 2023, 39.9 million people worldwide were living with HIV, of which 67% live in SSA

4.1 million people worldwide became newly infected
2.8 million people lost their lives to AIDS

New infections occur predominantly among the 15-24 age group.

First identified in the early 1980s. Has affected over 85 million people since the start of the epidemic.

Source: UNAIDS Global AIDS Update 2023

HIV Co-infections

Impact of TB on HIV

TB considerably shortens the survival of people with HIV/AIDS.
TB kills up to half of all AIDS patients worldwide.
TB bacteria accelerate the progress of AIDS infection in the patient

HIV and Malaria

Diseases of poverty
HIV infected adults are at risk of developing severe malaria
Acute malaria episodes temporarily increase HIV viral load
Adults with low CD4 count more susceptible to treatment failure

20

Air Quality and Public Health

21

Water and Public Health

“No other single intervention in the history of medicine has saved as many lives and reduced as much suffering as the provisioning of uncontaminated water,” - Paul Edward

One billion people in the world lack access to clean drinking water

A leading cause of death worldwide
An estimated 1.4 million people die each year from inadequate water, sanitation and hygiene

Source: WHO, 2023

22

Sanitation and Public Health

Minimum Standards

Safe disposal excreta and sullage (greywater)

Avoid disposal within 15 meters of any water source

Provision of drainage
Disposal of waste
Control of insect and rodents

23

Section

Intro to Global Health
Part 3 - Examples

24

Water Quality & Dose-Response

Adherence to Clean Drinking Water Consumption

Dose-Response

ehp.niehs.nih.gov/doi/10.1289/ehp.1206429

25

Section

Intro to Global Health
Part 3 – Epidemiology

26

Epidemiology — Defined

Adapted from: Last JM, ed. A dictionary of epidemiology. 2nd ed. Toronto, Canada: Oxford University Press; 1988.

Study of the distribution and determinants of health-related states among specified populations and the application of that study to the control of health problems

Purposes in Public Health Practice

Discover the agent, host, and environmental factors that affect health

Determine the relative importance of causes of illness, disability, and death

Identify those segments of the population that have the greatest risk from specific causes of ill health

Evaluate the effectiveness of health programs and services in improving population health

27

Epidemiology Key Terms

epidemic or outbreak: disease occurrence among a population that is in excess of what is expected in a given time and place.

cluster: group of cases in a specific time and place that might be more than expected.

endemic: disease or condition present among a population at all times.

pandemic: a disease or condition that spreads across regions.

rate: number of cases occurring during a specific period; always dependent on the size of the population during that period.

28

Measurement of Health Status

Cause of death

Obtained from death certification but limited because of incomplete coverage

Life expectancy at birth

The average number of years a new-borns baby could expect to live if current trends in mortality were to continue for the rest of the new-born's life

Maternal mortality rate

The number of women who die as a result of childbirth and pregnancy related complications per 100,000 live births in a given year

Infant mortality rate

The number of deaths in infants under 1 year per 1,000 live births for a given year

Neonatal mortality rate

The number of deaths among infants under 28 days in a given year per 1,000 live births in that year

Child mortality rate

The probability that a new-born will die before reaching the age of five years, expressed as a number per 1,000 live births

29

Solving Health Problems

Step 1

Data

collection

Action

Solving health problems

Assessment

Hypothesis

testing

Action

Step 2

Step 3

Step 4

Step 1 -

Surveillance; determine time, place, and person

Inference

Determine how and why

Intervention

Step 1 -

Step 2

Step 3

Step 4

30

Scenario: Unexplained Pneumonia

July 21–24

July 26–Aug 1

August 2

(Morning)

August 2

(Evening)

American Legion Convention,

Philadelphia, Pennsylvania

18 deaths reported among conventioneers

Health care provider at a veterans’ hospital in Philadelphia calls CDC to report casesof severe respiratory illness among attendees of the American Legion Convention

71 additional cases reported

Fraser DW, Tsai, T, Orenstein W, et al. Legionnaires’ disease: description of an epidemic of pneumonia. New Engl J Med 1977;297:1189–97.

31

Epidemiology Study Types

Epidemiology study

types

Experimental

Observational

Descriptive

Analytic

32

Confounding

Occurs when an extrinsic factor is associated with a disease outcome and, independent of that association, is also associated with the exposure

Exposure Outcome

Confounder

33

Measuring the association between exposure and outcome variables

The appropriate measure of association to use depends on the nature of the data

When exposure and outcome variables are dichotomous (two-level nominal data)

Odds ratio – use with case-control study (observational)
Risk ratio – use with cohort study (controlled)
Rate ratio – use with cohort study (controlled)

“Risk” refers to the probability of occurrence of an event or outcome.

“Odds” refers to the probability of occurrence of an event/probability of the event not occurring.

34

Epidemiological Formulas

Rate Formula

To calculate a rate, we first need to determine the frequency of disease, which includes:

the number of cases of the illness or condition

the size of the population at risk

the period during which we are calculating the rate

Equations Based on 2x2 Table

35

Worked Example: Odds Ratio

OR =

Example

OR =

Smokers

Non-smokers

36

Expressing RRs as Percentages

We can also express these RRs as percent change

RR > 1 % Increase Change = (RR – 1) × 100
RR < 1 % Decrease Change = (1 – RR) × 100

37

When does the odds ratio approximate the risk ratio?

For health-related states or events that are rare (i.e., affecting less than 10% of the population),a + b can be approximated by b, and c + d can be approximated by d

38

Surveillance & Public Health Approach

Goal of Intervention

Develop strategies for particular groups to engage in change toward health

Provide a support system to initiate change and sustain positive behaviors

A Public Health Approach

Surveillance → Risk Factor Identification → Intervention → Implementation → Evaluation

39

Approaches to Interventions

Personal responsibility and action

Utilitarian Approaches – "Greatest good for the greatest number"

Including non Health Systems Interventions.

Regulations and Laws

Partnerships and Collaboration

Enlightened Self Interest

Education

Develop favorable attitudes towards the behavior

Training (i.e. Community health workers)

Participatory engagement

Provide sustainable access

Utilizing underlying skills of the local people

Supportive Environment

Utilizing families, local organizations, community leaders, policy makers

40

Behavior Change

Why Do People NOT Change Behavior?

Do they understand the message?

Do they see themselves as vulnerable?

Do they trust the ones who present the message?

Do they think the benefits of change are worth the long-term benefits?

Is it too costly?

Does change contradict with their religious beliefs?

Health Belief Model

Perceived susceptibility – risk of acquiring the disease

Perceived severity – perception on the risk of acquiring the disease

Perceived benefits – is it worth the change?

Perceived barriers – obstacles to achieving health change

Cue to action – what will trigger this change?

Self-efficacy – how confident is the person to successfully perform a behavioral change?

41

Randomized Controlled Trial (RCT)

42

Case Study

Rwanda Tubeho Neza
Water Filter & Cookstove Programme

43

Programme Overview

The Tubeho Neza ("Live Well") programme was one of the world's first carbon-credit-financed health interventions, distributing water filters and improved cookstoves across Western Province, Rwanda.

First-ever UN CDM and Gold Standard programmes earning carbon credits for water treatment
Carbon credit revenue funded distribution, training, and ongoing monitoring
Implemented through Rwanda's existing Community-Based Environmental Health Promotion Programme (CBEHPP)
Community health workers (CHWs) served as the primary delivery and behavior change channel
Cluster-randomized controlled trial design to rigorously evaluate health impacts

Evan A. Thomas, PhD, PE, MPH — Director, Professor, University of Colorado Boulder

44

Field & Regional Context

45

Programme Implementation & Monitoring

46

Key Results

73% reduction in indoor air pollution among outdoor cooks
38% reduction in cryptosporidium exposure seroconversion
97.5% reduction in fecal contamination of drinking water (RCT finding)
48% reduction in cooking area air pollution
25% reduction in acute respiratory infections in children under 5
Over 90% adoption rate maintained through CHW-delivered behavior change

Sources: Thomas et al., Lancet Planetary Health (2018); Kirby et al., PLOS ONE (2014)

47

Technology & Digital Monitoring

Water Filters

LifeStraw Family 2.0 household water filters
Significant microbiological effectiveness reducing E. coli contamination
Free distribution with carbon waiver for credit generation

Remote Sensing Innovation

Electronic sensors remotely transmitting usage data
Sensor-reported use was substantially lower than self-reported use
Demonstrated critical value of objective digital monitoring
Published in ACS Environmental Science & Technology

Carbon Credit Model

Pay-for-performance model funded by voluntary carbon credits
Health, livelihood, and environmental benefits substantially outweighed costs
Fuel savings and averted healthcare costs = largest economic gains

48

Scale & Lessons Learned

Key Lessons

Integration with existing community health infrastructure (CBEHPP) improved scalability
Geographic accessibility and CHW engagement were key determinants of sustained adoption
Objective sensor monitoring revealed gap between self-reported and actual use
Carbon credit financing created sustainable revenue for operations and maintenance
Combining hardware distribution with behavior change messaging achieved >90% uptake

49

Published Research (13 Papers)

Title	Journal	Link
Health, livelihood, and environmental impacts of the Tubeho Neza programme	The Lancet Planetary Health	Open
Effects of adding household water filters to Rwanda's CBEHPP	Nature — npj Clean Water	Open
Assessing Impact of Water Filters and Cookstoves: A Randomised Controlled Trial	PLOS ONE	Open
Designing and Piloting a Program to Provide Water Filters and Cookstoves	PLOS ONE	Open
Cost-benefit analysis of livelihood, environmental and health benefits	ScienceDirect	Open
Use, microbiological effectiveness and health impact of a household water filter	ScienceDirect	Open
Study design of a cluster-randomized controlled trial	ScienceDirect	Open
Process evaluation and assessment of use	BMC Public Health	Open
Use of Remotely Reporting Electronic Sensors	ACS Env. Sci. & Tech.	Open
Integration of Household Water Filters with Community-Based Sanitation	MDPI Sustainability	Open
Geospatial-temporal, demographic, and programmatic adoption characteristics	Cogent Engineering	Open
Assessing use, exposure, and health impacts (Dissertation)	Semantic Scholar	Open
Lessons from Rwanda on tackling unsafe drinking water and air pollution	The Conversation	Open

50

Section

Research Methodology

Methodological Considerations in Evaluating Global Health Interventions

51

The Tubeho Neza Program

Central Case Study: Tubeho Neza ("Live Well") Trial — Large-scale distribution of water filters and cookstoves to 101,000+ households in Western Province, Rwanda

The Intervention

Free distribution + promotion of LifeStraw Family 2.0 water filter and EcoZoom Dura rocket cookstove
To 101,000+ households (poorest 25%) in Western Province, Rwanda
Delivered by community health workers

The Evaluation Questions

Did this program actually reduce diarrhea in children under 5? Acute respiratory infections (ARI)?
Did it improve household water quality? Personal air quality (PM_2.5)?

Published: Kirby, Nagel et al. (2019) PLoS Medicine | Design: Nagel, Kirby et al. (2016) CCTC

52

Biased Approaches to Evaluation

Biased Approach #1: Before vs. After

Diarrhea prevalence drops from 15% before distribution to 9% after. Tempting conclusion: a 40% reduction.

Seasonality: The rainy season ended between measurements — diarrhea is seasonal
Concurrent changes: Rotavirus vaccine campaigns, economic growth, other WASH programs
Regression to mean: Starting during a disease spike means prevalence naturally returns to average

Before/after comparisons confound the intervention effect with all temporal changes.

Biased Approach #2: Users vs. Non-Users

Filter users have 40% less diarrhea — but users are more educated, practice handwashing, have better healthcare access.

This is selection bias: the comparison group is not exchangeable with the treatment group.

Rwanda Trial: Baseline diarrhea was 15.3% (intervention) vs 13.7% (control) — already different before intervention.

53

The Three Systematic Threats to Validity

Every epidemiological study faces two kinds of error. Random error shrinks with larger samples. Systematic error — bias — does not.

C — Confounding

A third variable distorts the exposure-outcome relationship because it is associated with both.

e.g., Wealthier households use filters AND have less diarrhea

S — Selection Bias

The study sample is not representative, or loss to follow-up differs systematically between groups.

e.g., Analyzing only clinic visitors misses healthy children

I — Information Bias

Systematic errors in how exposure, outcome, or covariates are measured.

e.g., Caregivers over-report filter use (social desirability)

54

Confounding, Selection Bias & Information Bias

Confounding: When a Third Variable Creates a False Association

Three criteria for a confounder: (1) associated with the exposure, (2) independently associated with the outcome, (3) not on the causal pathway.

Control methods: Design: randomization, restriction, matching. Analysis: stratification, regression, propensity scores.

Selection Bias: When Who Gets Studied Distorts Findings

Loss to follow-up: Differential attrition biases the comparison
Collider bias: Conditioning on a common effect creates spurious associations
Volunteer/self-selection: Study volunteers differ systematically from non-volunteers

Information Bias: When Measurement Is Systematically Wrong

Differential misclassification: Error differs between groups (recall bias, courtesy bias)
Non-differential misclassification: Equal error across groups — biases toward the null

55

Recognizing Bias & Direction of Effects

Stage	Critical Question	Bias Addressed
Design	Is treatment assignment independent of confounders?	Confounding
Design	Can participants be blinded to assignment?	Information
Enrollment	Is the sample representative of the target population?	Selection
Follow-Up	Is attrition differential between groups?	Selection
Measurement	Are outcomes assessed identically in all groups?	Information
Analysis	Have confounders been identified via DAG and adjusted?	Confounding

Direction of Bias

Non-differential misclassification: Toward null — dilutes the true effect
Uncontrolled confounding: Either direction — depends on the confounder
Differential misclassification: Either direction — unpredictable (recall bias, courtesy bias)

56

Study Design Hierarchy for Causal Inference

Each step down the hierarchy requires stronger assumptions to claim causation.

Randomized Controlled Trial (RCT)

Randomization creates exchangeable groups. Gold standard for causal inference.

Quasi-Experimental Designs

Exploit natural variation (DiD, RDD, ITS). No random assignment but clever design.

Cohort Study

Follow exposed/unexposed over time. Temporal ordering established.

Case-Control Study

Identify cases, select controls, look back. Efficient for rare outcomes.

Cross-sectional study: Snapshot — exposure and outcome measured simultaneously. Weakest for causation.

57

The Counterfactual & Potential Outcomes

Six months after distribution, 7-day diarrhea prevalence among children under 5 is 8.7%. Is that good or bad?

You cannot answer without knowing what the prevalence would have been without the intervention. This unobservable quantity is the counterfactual.

If counterfactual = 13%

29% reduction. Scale nationally.

If counterfactual = 8%

No effect. Don't waste resources.

The Potential Outcomes Framework (Neyman-Rubin)

For each child i: Y_i(1) = outcome if treated; Y_i(0) = outcome if not treated.

Causal effect = Y_i(1) − Y_i(0). The fundamental problem: we can only observe ONE potential outcome per child.

Average Treatment Effect (ATE) = E[Y_i(1)] − E[Y_i(0)]. Randomization makes the control group an unbiased estimate of the treated group's counterfactual.

58

Why Randomization Solves Confounding

Randomization ensures that, on average, treatment and control groups are identical in every way except the intervention — including unmeasured confounders.

What Randomization Gives You

Exchangeability (no confounding) • Unbiased ATE estimation • Valid statistical inference • Transparent, pre-specified analysis

What It Does NOT Give You

Protection from attrition bias • Protection from measurement bias • External validity / generalizability

Intention to Treat (ITT)

The ATE of being assigned to treatment, regardless of compliance. In Rwanda: the effect of living in a sector that received filters — not the effect of actually using the filter. ITT is the primary estimand for RCTs.

Rwanda Trial: Randomization at the sector level balanced observed covariates (age, sex, water source, sanitation type).

59

Individual vs. Cluster Randomization

Individual Randomization

Each person randomly assigned. Gold standard for independence. Problem: impractical for community-level interventions.

Cluster Randomization

Groups (villages, sectors) randomized. Prevents contamination. Cost: fewer independent units = less power. Requires GEE or mixed models.

Rwanda Trial: 96 sectors cluster-randomized 3:1. Sectors contain ~40 villages each. Stratified by district (7 districts). 3:1 ratio driven by programmatic goal of reaching 75% of the province in year 1.

Rwanda Trial: Two Studies in One

Sector-Level Study (Population Scale)

All ~82,000 children < 5 in 96 sectors. Data from health facility records and CHW reports.

Village-Level Sub-Study (Intensive Data)

174 village clusters, 1,582 households. 1:1 ratio via PPES sampling. Surveys, water sampling, PM_2.5 monitoring, sensors.

Design lesson: When feasible, nest an intensive sub-study within a larger pragmatic trial.

60

Randomization vs. Random Sampling

Two distinct uses of randomness serving fundamentally different purposes.

Random Sampling

Purpose: Select who enters the study
Protects: External validity (generalizability)
Threat if absent: Results may not generalize

Randomization (Random Allocation)

Purpose: Assign who receives treatment
Protects: Internal validity (causal inference)
Threat if absent: Confounding

	With Randomization	Without Randomization
With Sampling	Strongest: causal + generalizable (rare)	Generalizable, not causal (e.g., DHS)
Without Sampling	Causal, limited scope (most RCTs, e.g., Tubeho Neza)	Weakest: neither causal nor generalizable

Key insight: Most trials have randomization without random sampling — establishing causation within a specific context.

61

Stratification & Quasi-Experimental Designs

Stratified Randomization

Ensures treatment/control balance on key variables. With only 96 clusters, pure randomization could produce imbalanced groups. Stratifying by district (7 districts) ensures proportional representation. Restricted randomization: generate many allocations, reject those with poor covariate balance.

Difference-in-Differences (DiD)

Compare change over time in treated vs. untreated groups. ΔTreated − ΔControl = causal effect. Key assumption: parallel trends — both groups would have followed the same trajectory without intervention.

Other Quasi-Experimental Designs

Regression Discontinuity (RDD): Exploit an eligibility cutoff. Households just above/below the Ubudehe threshold are similar but one gets the intervention. Estimates local effect at the cutoff.
Interrupted Time Series (ITS): Track outcomes over time; look for level/slope change at intervention point. Requires long pre-intervention series (8+ data points).

62

Reliability & Validity of Measurement

Any measurement instrument must demonstrate both before its data can be trusted.

Reliability (Consistency)

A reliable instrument yields the same result repeatedly. Necessary but not sufficient — consistent measurement can still be consistently wrong.

Types: Test-retest (ICC, Pearson r) • Inter-rater (Cohen’s κ) • Intra-rater (ICC) • Internal consistency (Cronbach’s α)

Validity (Accuracy)

A valid instrument measures the construct it claims to measure. Without validity, reliability is meaningless.

Types: Content (expert panels) • Criterion (sensitivity/specificity, AUC-ROC) • Construct (CFA, convergent/discriminant)

Key Metrics

Metric	Range	Poor	Moderate	Good	Excellent
ICC	0–1	< 0.50	0.50–0.75	0.75–0.90	> 0.90
Cohen’s κ	−1 to 1	< 0.20	0.21–0.60	0.61–0.80	> 0.80
Cronbach’s α	0–1	< 0.60	0.60–0.70	0.70–0.90	> 0.90

63

How Reliability & Validity Interact

Low R, Low V

Scattered, off-center. Needs complete redesign.

High R, Low V

Tightly clustered, off-center. Most deceptive: looks precise but systematically wrong.

Low R, High V

Scattered around center. On average correct but too noisy. Increase sample size.

High R, High V

Tightly clustered on center. The gold standard.

Pitfalls: Assuming reliability implies validity • Validating in one population but applying in another • Reporting only Cronbach’s α without factor structure

64

Disease Frequency: Incidence & Prevalence

Incidence: New Cases Over Time

Cumulative incidence (risk): New cases / Population at risk at start. e.g., 50/500 = 10% over 6 months
Incidence rate: New cases / Person-time at risk. e.g., 50/250 person-years = 0.20/person-year. Accounts for variable follow-up.

Person-time: A child observed 6 months contributes 0.5 person-years.

Prevalence: Proportion with Disease

Point prevalence: Cases at one time / Total population. A snapshot mixing new and ongoing cases.
Period prevalence: Cases during a period / Population. "Has your child had diarrhea in the past 7 days?"

Prevalence = Incidence × Duration. Treatment that shortens episodes reduces prevalence without changing incidence.

Rwanda Trial: Primary outcome was 7-day period prevalence of caregiver-reported diarrhea and ARI. Repeated cross-sectional visits at ~4-month intervals.

65

Measures of Association: Relative & Absolute

Relative Measures

Rwanda Trial primary estimand: Prevalence Ratio from log-binomial GEE.

Values < 1 indicate a protective effect; > 1 indicates increased risk.

Absolute Measures

Risk/Prevalence Difference: RD = Risk(exposed) − Risk(unexposed). Rwanda diarrhea: 8.7% − 12.9% = −4.2 percentage points
Number Needed to Treat: NNT = 1/|RD|. Rwanda: 1/0.042 ≈ 24 households per case prevented in a 7-day window

Relative measures tell you how strong the effect is. Absolute measures tell you how much disease you actually prevent. Always report both.

66

Self-Report vs. Sensor & Evaluation Design

Filter use declined: 75.5% → 67.6% → 64.8%. Traditional stove use increased: 24.1% → 49.4% ("stove stacking").

Implications for Evaluation Design

Triangulate: Self-report + observation + sensor + biomarker. Avoid reliance on a single method.
Negative controls: Include outcomes that shouldn't be affected (Rwanda: toothache). If treatment reduces toothache, you have a bias problem.
Blinding matters: Unblinded trials with subjective outcomes overestimate effects. Pooled masked HWT trials found no significant effect on diarrhea.
Plan measurement validation: Budget for sensor sub-studies and spot-checks — these are essential, not extras.

The best study design in the world is undermined by poor measurement.

67

The Core Problem of Statistical Inference

We want to learn about a population, but we can only observe a sample.

Population

Parameters (fixed, unknown): True mean diarrhea prevalence across all children under 5 in Western Province, Rwanda

Sample

Statistics (variable, calculated): Observed prevalence of 8.7% from a survey of 2,000 households in 30 randomly selected villages

Key Concepts

Sampling distribution: Distribution of a statistic across all possible samples of the same size
Central Limit Theorem: Sampling distribution of the mean approaches normal as n increases
Standard error: SD of the sampling distribution — measures estimate precision. Larger samples → smaller SEs

Key intuition: Probability distributions let us quantify uncertainty — which results would be common vs. surprising given a particular assumption.

68

The Logic of Hypothesis Testing

1. Assume No Effect

H₀: Filter distribution has no effect on diarrhea prevalence.

2. Calculate Expected

Under H₀, how much would prevalence vary between groups by chance?

3. Compare Observed

You observed a 4.3 pp difference. Test statistic t = 3.7.

4. Quantify Surprise

p = 0.002. Only 0.2% of random samples would show this large a difference.

Proof by contradiction: assume no effect exists, show the data would be very unlikely under that assumption, conclude the effect is probably real.

Two Ways to Be Wrong

	Reality: H₀ TRUE	Reality: H₀ FALSE
Reject H₀	Type I Error (α) — false positive	Correct (Power = 1 − β)
Fail to reject	Correct (true negative)	Type II Error (β) — false negative

69

P-Values & Confidence Intervals

What the P-Value IS

The probability of observing data at least as extreme as your result, assuming H₀ is true. p = 0.03 means: if there were truly no effect, you'd see this result only 3% of the time.

What the P-Value is NOT

NOT the probability that H₀ is true
NOT the probability of a false positive
NOT a measure of effect size

α = 0.05 is a convention, not a law. Some fields use 0.01 or 0.10. Set before analyzing data.

Confidence Intervals: Beyond Yes/No

If you repeated your study 100 times and computed a 95% CI each time, approximately 95 of those intervals would contain the true parameter. CIs provide the range of plausible effect sizes.

70

Choosing the Right Statistical Test

Before choosing a test, answer three questions: (1) What is your outcome variable? (2) How many groups? (3) Are assumptions met?

Outcome	Scenario	Parametric	Non-Parametric
Continuous	1 sample vs. known value	One-sample t-test	Wilcoxon signed-rank
	2 independent groups	Independent t-test	Mann-Whitney U
	3+ groups	One-way ANOVA	Kruskal-Wallis
Categorical	2×2 independent	Chi-square test (Fisher’s exact if cell < 5)
	Paired before/after	McNemar’s test
	Multiple predictors	Logistic regression
Time-to-event	Estimate/compare curves	Kaplan-Meier / Log-rank test
Time-to-event	Model with covariates	Cox proportional hazards regression

Go non-parametric when: n < 30 and non-normal • Skewed data (income, costs) • Ordinal scales (Likert, staging)

71

Correlation, Regression & Survival Analysis

Correlation

Pearson’s r: Both continuous and normal. Linear relationship. e.g., income vs. water quality
Spearman’s rho: Ordinal or non-normal. Monotonic relationship. e.g., education level vs. health knowledge rank

Regression

Linear regression: Continuous outcome. Y = b₀ + b₁X + e. e.g., predict child BMI from diet and activity
Logistic regression: Binary outcome. Log-odds as linear function. e.g., predict diarrhea Y/N from WASH variables
Poisson / Negative binomial: Count outcome. Rate of events per unit time. e.g., clinic visits per month

Survival Analysis

Kaplan-Meier: Estimate survival curves. Handles censoring
Log-Rank test: Compare survival between 2+ groups
Cox regression: Model hazard with covariates. Produces hazard ratios

72

Statistical Test Decision Tree

What is your outcome variable?

Continuous

1 sample → t-test / Wilcoxon
2 groups → t-test / Mann-Whitney
3+ groups → ANOVA / Kruskal-Wallis
Paired → Paired t / Wilcoxon

Categorical

Independent → Chi-square / Fisher’s
Paired → McNemar’s test
Model → Logistic regression

Time-to-Event

Estimate → Kaplan-Meier
Compare → Log-rank test
Model → Cox regression

Quick Rules of Thumb

Continuous outcome + 2 groups → start with a t-test
Binary outcome → chi-square for comparison, logistic regression for modeling
Censored follow-up time → survival analysis (Kaplan-Meier, log-rank, Cox)
Clustered data → mixed models or GEE, not standard tests

73

Rwanda Trial: Primary Analysis in Detail

The Analytic Pipeline

Outcome: Binary (diarrhea yes/no in past 7 days)
Model: Log-binomial regression → directly estimates prevalence ratio
Clustering: GEE with exchangeable working correlation, robust (sandwich) SEs at village level
Covariates: Child age (months) and sex only — minimal adjustment since randomization handles confounding
Weights: Sampling weights to account for PPES design in village-level sub-study

Water quality improved significantly. Air quality did not — consistent with stove stacking behavior detected by sensors.

74

The Issue of Clustering

Children within the same village share water sources, sanitation, disease ecology, climate, economic conditions, CHW quality, and health facility access. Their health outcomes are correlated — not independent.

Intraclass Correlation & Design Effect

ICC = σ²(between) / [σ²(between) + σ²(within)] — proportion of variance due to between-cluster differences.

Design Effect (DEFF) = 1 + (m − 1) × ICC, where m = average cluster size. Effective sample size = n / DEFF.

Level	Outcome	ICC	Design Effect
Within-village	Diarrhea	0.02	1.18 (m=10)
Within-village	ARI	0.04	1.36 (m=10)
Sector-level	Diarrhea	—	9.5
Within-child (repeated)	Diarrhea	0.09	—

75

Ignoring Clustering & Sample Size

What Happens When You Ignore Clustering

If you analyze 5,000 children from 10 clusters as independent observations, your nominal 5% significance level may actually be 20–30%. This is one of the most common methodological errors in global health research.

Sample Size Under Clustering

The number of clusters matters more than individuals per cluster.

Clusters needed per arm: k = (Z_α + Z_β)² × DEFF × 2σ² / δ². Adding more people per cluster gives diminishing returns once m × ICC > 1.

Better to have 20 clusters of 50 than 10 clusters of 100. Always calculate cluster-adjusted sample sizes.

76

Analytic Solutions for Clustering

Cluster-Robust SEs

"Sandwich" standard errors. Inflate SEs to account for within-cluster correlation. Requires ≥30 clusters. Simplest solution.

GEE

Generalized Estimating Equations. Specify working correlation structure. Robust SEs protect against misspecification. Population-averaged. Used for Rwanda primary analysis.

Mixed Models

Random effects / multilevel models. Estimate cluster-specific parameters. Handle multiple nesting levels. Cluster-specific interpretation.

For policy questions ("what’s the average effect?"), GEE is preferred. For prediction ("what will happen in this village?"), mixed models are preferred.

Rwanda Trial: GEE with exchangeable working correlation and robust SEs, clustering at village level.

77

Missing Data Mechanisms

Rubin’s classification defines why data are missing — and determines which analytic methods are valid.

MCAR — Least Concern

Missingness unrelated to any data. Loss of power but no bias. Complete-case analysis valid.

e.g., Lab machine randomly malfunctions

MAR — Moderate Concern

Missingness depends on observed data. Biased if ignored. Recoverable with MI or ML methods.

e.g., Younger participants miss follow-up

MNAR — Greatest Concern

Missingness depends on the unobserved value itself. Biased under all standard methods. Requires sensitivity analyses.

e.g., Depressed patients don’t complete depression scale

78

Analytic Solutions for Missing Data

Method	Valid Under	How It Works
Complete-Case Analysis	MCAR	Delete cases with missing values. Simple but wasteful. Biased unless MCAR.
Single Imputation	MCAR	Replace with mean/median/LOCF. Underestimates variance. Rarely recommended.
Multiple Imputation (MI)	MAR	Create m complete datasets (20–50) with plausible imputed values. Pool results via Rubin’s rules.
Maximum Likelihood (ML)	MAR	Estimate parameters directly from observed data likelihood (EM, FIML). No imputed datasets needed.
Inverse Probability Weighting	MAR	Weight complete cases by inverse probability of being observed. Combines with MI for doubly robust estimation.
Sensitivity Analysis	MNAR	Vary assumptions about missing values. Pattern-mixture models, selection models, tipping-point analyses.

79

External Validity: Not All Water Treatment Works

Randomized trials of chlorination in Bangladesh, Kenya, and Zimbabwe found NO significant effect on childhood diarrhea.

Pathogen Profile

Chlorine is ineffective against Cryptosporidium. LifeStraw removes it through physical ultrafiltration. Intervention type matters.

Measurement & Blinding

Chlorination trials with blinded outcomes (sham chlorine) found no effect. Unblinded self-reported outcomes may overestimate effects.

The generalizability question isn’t "does water treatment work?" It’s "which treatment, against which pathogens, in which context, with what delivery model?"

Internal vs. External Validity

The Rwanda trial has strong internal validity (cluster-RCT, pre-registered, large sample). But results are specific to: Western Province, Rwanda | Poorest quartile | Rural | LifeStraw + EcoZoom | CHW-delivered with carbon credit financing.

80

Effect Modification: Why Effects Vary

Treatment effects are not universal constants. They vary across subgroups and contexts.

Biological Modifiers

Age: children < 2 more susceptible
Nutritional status
HIV/immune status
Pathogen profile: filter vs. chlorine

Contextual Modifiers

Baseline water quality
Baseline disease burden
Competing transmission routes

Implementation Modifiers

CHW network quality
Supply chain reliability
Cultural acceptability
Government support

Always report subgroup analyses and discuss which modifiers might limit transportability. Rwanda’s strong CHW system is unusual — replication elsewhere may yield smaller effects.

81

Efficacy → Effectiveness → Scale

Why Effects Shrink at Scale

Adherence: Self-reported filter use declined 75.5% → 64.8%. Sensor data: actual use ~37%.
Stove stacking: Traditional fire use increased 24.1% → 49.4%. No exclusive adoption → no air quality improvement.
Implementation variation: Hundreds of CHWs with varying training, supply chain delays, seasonal access challenges.

82

Lab Assignment

Global Burden of Disease Lab

Water quality QMRA & air quality cost-effectiveness analysis using R, GBD Compare, and HAPIT.

Open Full Lab Assignment →

Course Structure

Evan Thomas

Intro to Global HealthPart 1 - Overview

What is Global Health?

Disease & Determinants of Health

Multi-sectoral Dimension of the Determinants of Health

Governance & Armed Conflict

Climate Change, Disasters & Globalization

Intro to Global HealthPart 2 – Global Burden of Disease

Explore the Data

Child Mortality

Communicable Diseases

CD- Modes of transmission

Impact of Communicable Diseases

Malaria

Malaria Control

Malaria Geographic Distribution

Global HIV Burden

HIV/AIDS

Air Quality and Public Health

Water and Public Health

Sanitation and Public Health

Intro to Global HealthPart 3 - Examples

Water Quality & Dose-Response

Intro to Global HealthPart 3 – Epidemiology

Epidemiology — Defined

Epidemiology Key Terms

Measurement of Health Status

Solving Health Problems

Scenario: Unexplained Pneumonia

Epidemiology Study Types

Confounding

Measuring the association between exposure and outcome variables

Epidemiological Formulas

Worked Example: Odds Ratio

Expressing RRs as Percentages

When does the odds ratio approximate the risk ratio?

Surveillance & Public Health Approach

Approaches to Interventions

Behavior Change

Randomized Controlled Trial (RCT)

Rwanda Tubeho NezaWater Filter & Cookstove Programme

Programme Overview

Field & Regional Context

Programme Implementation & Monitoring

Key Results

Technology & Digital Monitoring

Water Filters

Remote Sensing Innovation

Carbon Credit Model

Scale & Lessons Learned

Key Lessons

Published Research (13 Papers)

Research Methodology

The Tubeho Neza Program

The Intervention

The Evaluation Questions

Biased Approaches to Evaluation

Biased Approach #1: Before vs. After

Biased Approach #2: Users vs. Non-Users

The Three Systematic Threats to Validity

C — Confounding

S — Selection Bias

I — Information Bias

Confounding, Selection Bias & Information Bias

Confounding: When a Third Variable Creates a False Association

Selection Bias: When Who Gets Studied Distorts Findings

Information Bias: When Measurement Is Systematically Wrong

Recognizing Bias & Direction of Effects

Direction of Bias

Study Design Hierarchy for Causal Inference

Randomized Controlled Trial (RCT)

Quasi-Experimental Designs

Cohort Study

Case-Control Study

The Counterfactual & Potential Outcomes

If counterfactual = 13%

If counterfactual = 8%

The Potential Outcomes Framework (Neyman-Rubin)

Intro to Global Health
Part 1 - Overview

Intro to Global Health
Part 2 – Global Burden of Disease

Intro to Global Health
Part 3 - Examples

Intro to Global Health
Part 3 – Epidemiology

Rwanda Tubeho Neza
Water Filter & Cookstove Programme