Keynotes
Talks
-
Accelerating Differential Equations in R and Python using Julia's SciML Ecosystem, by Chris Rackauckas
-
An introduction to DataFrames.jl for pandas users, by Bogumił Kamiński
-
Autonomous Vehicles See More With Thermal Imaging: Multi-modal thin cross section Object Detection, by Laisha Wadhwa
-
Basic Pitfalls in Waveform Analysis, by Yukio Okuda
-
Bayesian Decision Science: A framework for making data informed decisions under uncertainty, by Ravin Kumar
-
Better Code for Data Science, by Alexander CS Hendorf
-
Building Large-Scale Multilingual Fuzzy Matching Framework, by Abdulrahman Althobaiti
-
Building a Successful Data Science Team, by Justin J. Nguyen
-
Building fairer models for finance, by Andrew Weeks
-
Building one (multi-task) model to rule them all!, by Nicole Carlson and Michael Sugimura
-
COVID-19 Visualizations, the Good, the Bad and the Malicious, by Rongpeng Li
-
Cardinal: A metrics based Active Learning framework, by Alexandre Abraham
-
Climate Change: analyzing remote sensing data with Python, by Luis Lopez
-
Complex Network Analysis with NetworkX, by K. Jarrod Millman
-
Computational Social Science with Python, and how Open Source transforms Academia and Research, by Bhargav Srinivasa Desikan
-
Data processing pipelines for Small Big Data, by Esteban J. G. Gabancho and Anthony Franklin, PhD
-
DevOps for science: using continuous integration for rigorous and reproducible analysis, by Elle O'Brien
-
Dirty Data science: machine-learning on non-curated data, by Gaël Varoquaux
-
Ensemble-X: Your personal strataGEM to build Ensembled Deep Learning Models for Medical Imaging, by Dipam Paul and Alankrita Tewari
-
Entity matching at scale, by Lorraine D'almeida
-
Feature drift monitoring as a service for machine learning models at scale, by Keira Zhou and Noriaki Tatsumi
-
FlyBrainLab: An Interactive Open Computing Platform for Exploring the Drosophila Brain, by Mehmet Kerem Turkcan, Aurel A. Lazar and Yiyin Zhou
-
Games, Algorithms, and Social Good, by Manojit Nandi
-
Geometric and statistical methods in systems biology: the case of metabolic networks, by Haris Zafeiropoulos and Apostolos Chalkis
-
Growing Machine Learning Platforms in the Enterprise, by Hussain Sultan and Ben Lindquist
-
Hosting Dask: Challenges and Opportunities, by Matthew Rocklin
-
How to guarantee your machine learning model will fail on first contact with the real world., by Jesper Dramsch
-
How to review a model, by Andy R. Terrel
-
Inventing Curriculum using Python and spaCy, by Gajendra Deshpande
-
Is a neural network better than Ash at detecting Team Rocket? If so, how?, by Juan De Dios Santos
-
Leveraging python and open-source for data-science on the buy-side., by James Munro
-
ML-Based Time Series Regression: 10 concepts we learned from Demand Forecasting, by Felix Wick
-
Meditations on First Deployment: A Practical Guide to Responsible Data Science & Engineering, by Alejandro Saucedo
-
Modelling the extreme using quantile regression, by Massimiliano Ungheretti
-
Modern Time Series Analysis with STUMPY, by Sean Law
-
Monitoring machine learning models in production, by Arnaud Van Looveren
-
Open Source Fairness, by Aileen Nielsen
-
Opening the Black Box, by Ben Fowler and Chelsey Kate Meise
-
Ordinary viDeogame Equations: Winning games with PyMC3, sundials and numba, by Adrian Seyboldt
-
Parallel processing in Python: The current landscape, by Aaron Richter
-
Quickly deploying explainable AI dashboards, by Oege Dijk
-
Responsible ML in Production, by Catherine Nelson and Hannes Hapke
-
Rethinking Software Testing for Data Science, by Eduardo Blancas
-
Safe, Fair and Ethical AI - A Practical Framework, by Tariq Rashid
-
Scalable cross-filtering dashboards with Panel, HoloViews and hvPlot, by Philipp Rudiger and James A. Bednar
-
Separation of ~concerns~ scales in software, by Thomas A Caswell
-
Skinny Pandas Riding on a Rocket, by Ian Ozsvald (PyDataLondon)
-
Snap ML: Accelerated, Accurate, Efficient Machine Learning, by Haris Pozidis and Thomas Parnell
-
Speed Up Your Data Processing: Parallel and Asynchronous Programming in Data Science, by Chin Hwee Ong
-
Streamlit: The Fastest Way to build Data Apps, by Steven Kolawole
-
Supercharge Scientific Computing in Python with Numba, by Ankit Mahato
-
Taking Care of Parameters So You Don’t Have to with ParamTools, by Hank Doupe
-
Taking a Close Look in the Mirror: Data Literacy for Data Experts, by Laura J Ludwig
-
The Big Benefits of Small Data, by Christopher Lozinski
-
Thrifty Machine Learning, by Rebecca Bilbro
-
TimeSeries Forecasting with ML Algorithms and there comparisons, by Sonam Pankaj
-
Transformation from Research Oriented Code into Machine Learning APIs with Python, by Tetsuya Jesse Hirata
-
Uncertainty Quantification for Online Learning via Hierarchical Incremental Gradient Descent, by Vihan Singh
-
Uncertainty Quantification in Neural Networks with Keras, by Matias Valdenegro-Toro
-
Using Algorithm X to re-analyse the last UK general election, by Alex Glaser
-
Using EOLearn to build a machine learning pipeline to detect plastics in the ocean., by Stuart Lynn
-
Visions: An Open-Source Library for Semantic Data, by Ian Eaves and Simon Brugman
-
Visual data: abundant, relevant, labelled, cheap. Pick two?, by Irina Vidal Migallon
-
What Lies in Word Embeddings, by Vincent D. Warmerdam
-
What cyber security can teach us about COVID-19 testing, by Hagit Grushka - Cohen
-
What's new in pandas?, by Joris Van den Bossche and Tom Augspurger
-
When features go missing, Bayes’ comes to the rescue, by Narendra Mukherjee
-
Why I didn’t use deep learning for my image recognition problem, by Liucija Latanauskaite
-
ipywidgets for Education! Using Jupyter tools to make Math Visualization applets for the classroom, by Chiin-Rui Tan
-
pandas.(to/from)_sql is simple but not fast, by Uwe Korn
-
pyodide: scientific Python compiled to WebAssembly, by Roman Yurchak
Tutorials
-
A Gentle Introduction to Multi-Objective Optimisation, by Eyal Kazin
-
Beautiful (ML) Data: Patterns & Best Practice for effective Data solutions with PyTorch, by Valerio Maggio
-
Computer shows why: Visualizing deep learning for fun and profit, by Eyal Gruss
-
Exploratory Data Analysis with Pandas and Matplotlib, by Allen Downey
-
From 0 to Virtual Assistant (now with Human Handoff!), by Karen Palacio, Florencia Alonso and Brandon Janes
-
Humble Data - Beginners Workshop for Minorities, by Cheuk Ting Ho, Sandrine Pataut and Dani Papamaximou
-
Mine your own data - Analyze your Facebook Timeline, by Isabel Yepes
-
Panel: Dashboards for PyData, by James A. Bednar
-
Probability Calibration: Latest Techniques, by Brian Lucena
-
Scaling Up Your Data Work With Dask, by James Bourbeau and Hugo Bowne-Anderson
-
Solving large-scale inverse problems in Python with PyLops, by Matteo Ravasi, Ivan Vasconcelos and David Vargas
-
Why and What If – Causal Analysis for Everyone, by Bruno Gonçalves
Short Talks
-
A Unified API Wrapper to Simplify Web Data Collection, by Pei Wang and Weiyuan Wu
-
A crash-update to lifelines, by Cameron Davidson-Pilon
-
Accelerating Text Processing With RAPIDS, by Vibhu Jawa
-
Asynchronous fsspec file operations, by Martin Durant
-
Creating a data-driven culture: a social perspective, by Jordi Contestí
-
Crowdsource a Distributed Organizations Data Model, by Christopher Lozinski
-
Data Visualization & Storytelling, by Jose Berengueres
-
Enquiry-Based Learning for Science and Engineering utilizing Bokeh, by Raghuram Thiagarajan, Anna Moragne, Brian Lucas and Srinivas Rangarajan
-
Gaussian Process Fitting: let the data guide you!, by Tomás Müller
-
Indian Sign Language Recognition(ISLAR), by Akshay Bahadur
-
Learning from your (model’s) mistakes, by Simona Maggio
-
Lessons from a Nuclear Core Loading Quantum Algorithm Study, by Colleen M. Farrelly and Joseph Fustero
-
Matrix Profile API: A novel cross language time-series mining library, by Tyler Marrs and Andrew Van Benschoten
-
NLP in Spanish, alternatives and challenges, by Isabel Yepes
-
Python, Let's Go Home. Quickly., by Miroslav Šedivý
-
Pythons in Python: Wildlife Trade Data Analysis Using Python, by Anne Devan-Song and Lee Tirrell
-
Rapidly emulating professional visualizations from New York Times in Python using Altair, by Shantam Raj
-
Sampling from (truncated) high-dimensional logconcave densities with VolEsti (GeomScale Project), by Marios Papachristou
-
Ten Ways to Fizz Buzz, by Joel Grus
-
Turn your notebook into a LaTeX-article with TexBook, by Valerio Maggio
-
UBI Center: A think tank built on GitHub, Python, and Jupyter, by Max Ghenis
-
Using Dominance Analysis for accurate and intuitive feature importance, by Shashank Shekhar
-
nbreproduce: Jupyter notebooks in reproducible environments, by Mridul Seth
Posters
Sprints
Extracurriculars
Sponsor Events