Engineer production-grade big data platforms for life sciences. This module blends HPC scheduling, distributed compute frameworks, and cloud-native storage/processing so you can handle terabyte-scale omics, imaging, and EHR data. You will build secure, cost-aware pipelines using Spark/Dask/Ray on top of Parquet/Delta/Iceberg with reproducible IaC and monitoring.