Introduction to the Python pandas package

The Python pandas package is very useful when dealing with data. The pandas package is a wonderful tool for data preprocessing, which is essential for data analysis. There is a humorous way of describing the importance of data cleaning: "A data scientist spends 80% of their time cleaning the data and the other 20% complaining about cleaning the data". To test if the package is preinstalled, we can type import pandas as pd after we launch Python. If we don't see any error messages, it means that the package was preinstalled. If we do, then we can use conda install pandas to install the package. To find all available functions, we could use the following three lines of Python code:

To find out about ...

Get Hands-On Data Science with Anaconda now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.