In this chapter, we will cover the following recipes:
- Using high-performance data formats – HDF5
- Doing parallel computing with Dask
- Using high-performance data formats – Parquet
- Computing sequencing statistics using Spark
- Optimizing code with Cython and Numba