Skip to contents

Computes the non-fatal pathogen distribution for a given infectious syndrome (L) using facility-level microbiology data. This quantity represents the fractional contribution of each pathogen (k) to non-fatal infection cases, and is used in YLD estimation.

Usage

calculate_P_Lk_prime(
  data,
  syndrome_col,
  syndrome_name,
  specimen_col = NULL,
  specimen_name = NULL,
  polymicrobial_col,
  patient_col,
  pathogen_col,
  outcome_col,
  discharged_value = "Discharged",
  glass_ref = NULL,
  facility_col = NULL,
  facility_name = NULL,
  pathogen_name = NULL
)

Arguments

data

Data frame of facility-level microbiology records.

syndrome_col

Character. Column containing infectious syndrome labels (L).

syndrome_name

Character. Syndrome to analyse.

specimen_col

Character. Column containing specimen type.

specimen_name

Character. Specimen to restrict to (e.g., "Blood").

polymicrobial_col

Character. Column flagging polymicrobial patients (1 = polymicrobial, 0 = monomicrobial).

patient_col

Character. Unique patient identifier column.

pathogen_col

Character. Pathogen (organism) column (k).

outcome_col

Character. Final patient outcome column.

discharged_value

Character. Value indicating non-fatal discharge. Default "Discharged".

glass_ref

Character vector of valid pathogen names, or a data frame with columns specimen and pathogen. Applied to polymicrobial patients only. NULL skips GLASS filtering.

facility_col

Character or NULL. Facility identifier column. When provided without facility_name, returns both facility-level and pooled P'LK.

facility_name

Character or NULL. Restricts to a single facility.

pathogen_name

Character vector or NULL. Filter to specific pathogen(s).

Value

A list:

P_Lk_prime

Pooled P'LK data frame: pathogen_col, N_NF_LK, N_NF_L, P_Lk_prime.

facility_level

Per-facility P'LK (only when facility_col is supplied and facility_name is NULL).

Details

The unit of analysis is the patient. Each patient contributes total weight 1, distributed equally across their valid pathogens. For a patient with \(m_r\) valid pathogens, each pathogen receives weight \(1/m_r\).

For polymicrobial patients (polymicrobial_col == 1), only pathogens listed in the GLASS reference (glass_ref) for the given specimen type are retained before weighting. Monomicrobial patients (polymicrobial_col == 0) are never filtered.

The pooled formula across facilities is: $$P'_{LK}^{\text{pooled}} = \frac{\sum_f N^{NF}_{f,L,K}}{\sum_f N^{NF}_{f,L}}$$