Normalizes organism names using organisms.csv reference file. Automatically handles abbreviations (E. coli), case variations, and typos.
Usage
normalize_organism(
data,
organism_col = "organism_name",
add_organism_group = TRUE,
add_resistance_flags = TRUE
)Arguments
- data
Data frame with organism column
- organism_col
Character. Organism column name. Default "organism_name".
- add_organism_group
Logical. Add organism_group column from CSV. Default TRUE.
- add_resistance_flags
Logical. Add resistance flag columns (0/1). Default TRUE. Creates: is_MRSA, is_MSSA, is_MRCONS, is_MSCONS
Examples
if (FALSE) { # \dontrun{
data <- data.frame(organism = c("E. coli", "S. aureus", "MRSA"))
result <- normalize_organism(data, organism_col = "organism")
} # }