It is a capital mistake to theorize before one has data.

Sir Arthur Conan Doyle, British writer

Chapter 5Working with Tabular Numeric Data

Often raw data comes from all kinds of text documents. Quite often the text actually represents numbers. Excel and CSV spreadsheets and especially database tables may contain millions or billions of numerical records. Core Python is an excellent text-processing tool, but it sometimes fails to deliver adequate numeric performance. That’s where numpy comes to the rescue.

NumPy—Numeric Python (imported as numpy)—is an interface to a family of efficient and parallelizable functions that implement high-performance numerical operations. The module numpy provides a new Python data structure—array—and a toolbox ...

Get Data Science Essentials in Python now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.