Learn how to use modern Python bioinformatics libraries and applications to do cutting-edge research in computational biology
If you are either a computational biologist or a Python programmer, you will probably relate to the expression "explosive growth, exciting times". Python is arguably the main programming language for big data, and the deluge of data in biology, mostly from genomics and proteomics, makes bioinformatics one of the most exciting fields in data science.
Using the hands-on recipes in this book, you'll be able to do practical research and analysis in computational biology with Python. We cover modern, next-generation sequencing libraries and explore real-world examples on how to handle real data. The main focus of the book is the practical application of bioinformatics, but we also cover modern programming techniques and frameworks to deal with the ever increasing deluge of bioinformatics data.
What You Will Learn
Gain a deep understanding of Python's fundamental bioinformatics libraries and be exposed to the most important data science tools in Python
Process genome-wide data with Biopython
Analyze and perform quality control on next-generation sequencing datasets using libraries such as PyVCF or PySAM
Use DendroPy and Biopython for phylogenetic analysis
Perform population genetics analysis on large datasets
Simulate complex demographies and genomic features with simuPOP
Downloading the example code for this book. You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com. If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the files e-mailed directly to you.