Preface

This book assumes nothing, unlike many big data (Spark and Hadoop) books before it, which are often shrouded in complexity and assume years of prior experience. I don’t assume that you are a seasoned software engineer with years of experience in Java, I don’t assume that you are an experienced big data practitioner with extensive experience in Hadoop and other related open source software projects, and I don’t assume that you are an experienced data scientist.

By the same token, you will not find this book patronizing or an insult to your intelligence either. The only prerequisite to this book is that you are “comfortable” with Python. Spark includes several application programming interfaces (APIs). The Python API was selected as the ...

Get Sams Teach Yourself Apache Spark™ in 24 Hours now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.