Writing Reproducible Scripts for Data Extraction and Preprocessing