Orchestrating neuroimaging data processing using the 'Snakemake' workflow manager
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
The processing of neuroimaging data typically involves a complicated set of operations, which often require different software packages and have intensive computational and storage demands. Although there are many options available for the neuroimaging researcher to establish their preferred set of processing operations, there is relatively little guidance on how to orchestrate such operations within a coherent data processing workflow. Here, we describe the workflow manager 'Snakemake' and provide a working example of its potential usefulness for neuroimaging data processing. Snakemake allows researchers to use a Python-based markup language to describe how input files are transformed into output files. Snakemake then uses these rules to schedule and execute the computational jobs that are required to generate the output files, without unnecessary recreation of existing output files. This allows the complete processing workflow to be executed with a single command and creates workflows that are interpretable, efficient, and reproducible.