This is an archived version of the course and is no longer updated. Please find the latest version of the course on the main webpage.

Introduction

In this module, we will look at pandas, a Python library for data manipulation and analysis.

Pandas apparently derives from the term panel data, although it is also a word play on “Python data analysis”. In any case, it unfortunately has nothing do with the cute creature below 🐼

Panda

Image by Sharon Ang from Pixabay

Pandas offers powerful and flexible data structures that make it easy to manipulate and analyse data. It is used to clean, transform, and analyse data

Pandas is built on top of NumPy.

Again, this will only be a high-level introduction, just enough to get you started. We will not cover all the features of pandas.

To import pandas:

import pandas as pd

I will assume this import and use pd throughout the module.