This article is aimed at showing how to replicate some common Excel tasks using Python and the Pandas library. The point and click interface of Excel means the learning curve is somewhat less

steep than it is for using Pandas – however once a certain level of proficiency is met when using Pandas, the possibilities presented are far greater than you would find in Excel (at least that’s what I believe).

I have downloaded some test data from Microsoft in “.xlsx” format which can be downloaded below if you wish to follow along.

Let us first read our data into a Pandas DataFrame and visualise the first 5 rows of data, just to see what we are playing with.