How to do it...

Let's take a look at the following steps:

  1. We will start by inspecting the description of all of the the sequences on the reference genome FASTA file:
from Bio import SeqIOgenome_name = 'PlasmoDB-9.3_Pfalciparum3D7_Genome.fasta'recs = SeqIO.parse(genome_name, 'fasta')for rec in recs:    print(rec.description)

This code should look familiar from the previous chapter, Chapter 2Next-Generation Sequencing. Let's take a look at part of the output:

Different genome references will have different description lines, but they will generally have important information. In this example, you can see that we have chromosomes, mitochondria, ...

Get Bioinformatics with Python Cookbook - Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.