Talk Release Schedule

Talk recordings will be released at the start of each day following the schedule below.

November 11th

Quickly deploying explainable AI dashboards, by Oege Dijk
Is a neural network better than Ash at detecting Team Rocket? If so, how?, by Juan De Dios Santos
TimeSeries Forecasting with ML Algorithms and there comparisons, by Sonam Pankaj
Visions: An Open-Source Library for Semantic Data, by Ian Eaves and Simon Brugman
Autonomous Vehicles See More With Thermal Imaging: Multi-modal thin cross section Object Detection, by Laisha Wadhwa
Accelerating Differential Equations in R and Python using Julia’s SciML Ecosystem, by Chris Rackauckas
An introduction to DataFrames.jl for pandas users, by Bogumił Kamiński
Why I didn’t use deep learning for my image recognition problem, by Liucija Latanauskaite
Feature drift monitoring as a service for machine learning models at scale, by Keira Zhou and Noriaki Tatsumi
DevOps for science: using continuous integration for rigorous and reproducible analysis, by Elle O’Brien
Skinny Pandas Riding on a Rocket, by Ian Ozsvald (PyDataLondon)
Using Algorithm X to re-analyse the last UK general election, by Alex Glaser
Taking Care of Parameters So You Don’t Have to with ParamTools, by Hank Doupe
FlyBrainLab: An Interactive Open Computing Platform for Exploring the Drosophila Brain, by Mehmet Kerem Turkcan, Aurel A. Lazar and Yiyin Zhou
Ensemble-X: Your personal strataGEM to build Ensembled Deep Learning Models for Medical Imaging, by Dipam Paul and Alankrita Tewari
Enquiry-Based Learning for Science and Engineering utilizing Bokeh, by Raghuram Thiagarajan, Anna Moragne, Brian Lucas and Srinivas Rangarajan
Rapidly emulating professional visualizations from New York Times in Python using Altair, by Shantam Raj
Using Dominance Analysis for accurate and intuitive feature importance, by Shashank Shekhar
Accelerating Text Processing With RAPIDS, by Vibhu Jawa

November 12th

ipywidgets for Education! Using Jupyter tools to make Math Visualization applets for the classroom, by Chiin-Rui Tan
COVID-19 Visualizations, the Good, the Bad and the Malicious, by Rongpeng Li
Opening the Black Box, by Ben Fowler and Chelsey Kate Meise
Uncertainty Quantification for Online Learning via Hierarchical Incremental Gradient Descent, by Vihan Singh
What cyber security can teach us about COVID-19 testing, by Hagit Grushka - Cohen
ML-Based Time Series Regression: 10 concepts we learned from Demand Forecasting, by Felix Wick
Basic Pitfalls in Waveform Analysis, by Yukio Okuda
Entity matching at scale, by Lorraine D’almeida
Building a Successful Data Science Team, by Justin J. Nguyen
The Big Benefits of Small Data, by Christopher Lozinski
Monitoring machine learning models in production, by Arnaud Van Looveren
Better Code for Data Science, by Alexander CS Hendorf
Thrifty Machine Learning, by Rebecca Bilbro
Leveraging python and open-source for data-science on the buy-side., by James Munro
Sampling from (truncated) high-dimensional logconcave densities with VolEsti (GeomScale Project), by Marios Papachristou
UBI Center: A think tank built on GitHub, Python, and Jupyter, by Max Ghenis
Python, Let’s Go Home. Quickly., by Miroslav Šedivý
Matrix Profile API: A novel cross language time-series mining library, by Tyler Marrs and Andrew Van Benschoten

November 13th

Climate Change: analyzing remote sensing data with Python, by Luis Lopez
Using EOLearn to build a machine learning pipeline to detect plastics in the ocean., by Stuart Lynn
Cardinal: A metrics based Active Learning framework, by Alexandre Abraham
Streamlit: The Fastest Way to build Data Apps, by Steven Kolawole
Data processing pipelines for Small Big Data, by Esteban J. G. Gabancho and Anthony Franklin, PhD
Transformation from Research Oriented Code into Machine Learning APIs with Python, by Tetsuya Jesse Hirata
How to review a model, by Andy R. Terrel
Speed Up Your Data Processing: Parallel and Asynchronous Programming in Data Science, by Chin Hwee Ong
Snap ML: Accelerated, Accurate, Efficient Machine Learning, by Haris Pozidis and Thomas Parnell
Parallel processing in Python: The current landscape, by Aaron Richter
Visual data: abundant, relevant, labelled, cheap. Pick two?, by Irina Vidal Migallon
pandas.(to/from)_sql is simple but not fast, by Uwe Korn
pyodide: scientific Python compiled to WebAssembly, by Roman Yurchak
Dirty Data science: machine-learning on non-curated data, by Gaël Varoquaux
What’s new in pandas?, by Joris Van den Bossche and Tom Augspurger
Growing Machine Learning Platforms in the Enterprise, by Hussain Sultan and Ben Lindquist
NLP in Spanish, alternatives and challenges, by Isabel Yepes
Asynchronous fsspec file operations, by Martin Durant
A Unified API Wrapper to Simplify Web Data Collection, by Pei Wang and Weiyuan Wu
Learning from your (model’s) mistakes, by Simona Maggio

November 14th

Taking a Close Look in the Mirror: Data Literacy for Data Experts, by Laura J Ludwig
Complex Network Analysis with NetworkX, by K. Jarrod Millman
Separation of ~concerns~ scales in software, by Thomas A Caswell
Computational Social Science with Python, and how Open Source transforms Academia and Research, by Bhargav Srinivasa Desikan
Scalable cross-filtering dashboards with Panel, HoloViews and hvPlot, by Philipp Rudiger and James A. Bednar
Building Large-Scale Multilingual Fuzzy Matching Framework, by Abdulrahman Althobaiti
Safe, Fair and Ethical AI - A Practical Framework, by Tariq Rashid
Meditations on First Deployment: A Practical Guide to Responsible Data Science & Engineering, by Alejandro Saucedo
Responsible ML in Production, by Catherine Nelson and Hannes Hapke
Geometric and statistical methods in systems biology: the case of metabolic networks, by Haris Zafeiropoulos and Apostolos Chalkis
When features go missing, Bayes’ comes to the rescue, by Narendra Mukherjee
Uncertainty Quantification in Neural Networks with Keras, by Matias Valdenegro-Toro
Bayesian Decision Science: A framework for making data informed decisions under uncertainty, by Ravin Kumar
Modelling the extreme using quantile regression, by Massimiliano Ungheretti
Modern Time Series Analysis with STUMPY, by Sean Law
A crash-update to lifelines, by Cameron Davidson-Pilon
Ten Ways to Fizz Buzz, by Joel Grus
Data Visualization & Storytelling, by Jose Berengueres
nbreproduce: Jupyter notebooks in reproducible environments, by Mridul Seth
Gaussian Process Fitting: let the data guide you!, by Tomás Müller

November 15th

Scalable cross-filtering dashboards with Panel, HoloViews and hvPlot, by Philipp Rudiger and James A. Bednar
Supercharge Scientific Computing in Python with Numba, by Ankit Mahato
Inventing Curriculum using Python and spaCy, by Gajendra Deshpande
How to guarantee your machine learning model will fail on first contact with the real world., by Jesper Dramsch
Rethinking Software Testing for Data Science, by Eduardo Blancas
Building one (multi-task) model to rule them all!, by Nicole Carlson and Michael Sugimura
Hosting Dask: Challenges and Opportunities, by Matthew Rocklin
What Lies in Word Embeddings, by Vincent D. Warmerdam
Building fairer models for finance, by Andrew Weeks
Games, Algorithms, and Social Good, by Manojit Nandi
Open Source Fairness, by Aileen Nielsen
Indian Sign Language Recognition(ISLAR), by Akshay Bahadur
Pythons in Python: Wildlife Trade Data Analysis Using Python, by Anne Devan-Song and Lee Tirrell
Crowdsource a Distributed Organizations Data Model, by Christopher Lozinski
Lessons from a Nuclear Core Loading Quantum Algorithm Study, by Colleen M. Farrelly and Joseph Fustero
Creating a data-driven culture: a social perspective, by Jordi Contestí