There's more…

The preceding recipe was made to run on a simple configuration of Airflow, with a default SQLite database, which implies sequential execution (only one task at a time). The explicit dependency of the FS sensor on the download is only there to avoid a race condition with the basic executor. Furthermore, with other executors (that is, supported by another backend database), the sub-samplings could occur in parallel.

Get Bioinformatics with Python Cookbook - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.