Pandas

Sprint

Go to NumFOCUS academy page.

Pandas img

pandas is a Python package that provides fast, flexible, and expressive data structures designed to make working with structured (tabular, multidimensional, potentially heterogeneous) and time series data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, real world data analysis in Python. Additionally, it has the broader goal of becoming the most powerful and flexible open source data analysis / manipulation tool available in any language. It is already well on its way toward this goal.

Sprint leader

Marco Gorelli

Marco is a Data Scientist at the Samsung R&D Institute UK. Outside of work, he is a maintainer of pandas (data wrangling platform for Python widely adopted in the scientific computing community) and co-author of nbQA (tool and pre-commit hook to run any standard Python code quality tool on a Jupyter Notebook). He holds an MSc in Mathematics and Foundations of Computer Science from the University of Oxford.