There's more...

There is a lot more to be discovered in Dask, for example, support for pandas' DataFrames, and more ad hoc structures, in addition to the NumPy arrays that we used here. You are strongly encouraged to read a bit about scheduling. Here, we used the multi-processing scheduler, but a lot of Dask's flexibility comes from the fact that schedulers can be swapped while maintaining the analysis code.

Get Bioinformatics with Python Cookbook - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.