Tools for processing rabies metadata for REDCap upload
rabvRedcapProcessing
is an R package designed to streamline the cleaning, validation, and formatting of rabies lab metadata before uploading to REDCap. It automates common curation tasks including:
This ensures data consistency across multiple labs and minimizes manual formatting errors.
The typical workflow is illustrated below:
Function | Description | Input | Output |
---|---|---|---|
read_data() |
Load and summarize input metadata | CSV file | Data frame + duplicates report |
read_and_parse_dict() |
Load REDCap dictionary and extract coded lists | dictPath | List of mini dictionaries |
compare_cols_to_dict() |
Check for missing or extra columns vs REDCap dictionary | Data + dict | Harmonized data frame |
scan_mismatched_levels() |
Scan for values that don’t match allowed dictionary codes | Data + dicts | Warnings |
tidy_up_values() |
Fix common value mismatches (e.g., typos, synonyms) | Data | Cleaned data |
create_final_diagnosis() |
Derive final diagnostic result from test columns | Data | updated diagnostic_result (df) |
recode_data() |
Recode labels to dictionary codes | Data + dicts | Recoded data frame |
final_processing() |
Final formatting & generation of REDCap-ready forms | Data + dict | Diagnostic + Sequencing forms |
Install directly from GitHub using devtools
:
# install.packages("devtools") # if not already installed
devtools::install_github("RAGE-toolkit/rabvRedcapProcessing")
library(rabvRedcapProcessing)
Please report issues or suggest improvements via GitHub:
URL: https://github.com/RAGE-toolkit/rabvRedcapProcessing
BugReports: https://github.com/RAGE-toolkit/rabvRedcapProcessing/issues
Martha M. Luka, Kirstyn Brunker