Most Recent Posts
Modern Python Packaging with uv - 8 September 2024
August 2024
Learnings from migrating Pandas to Polars - 2 August 2024
June 2024
The Beauty of PEP 515: Underscores in Numeric Literals - 20 June 2024
A decade in Data Science: the rise and fall of big data - 13 June 2024
2023
July 2023
Dependency injection in Python with gin-config - 28 July 2023
June 2023
A Plotly Theme Party 🎉 - 1 June 2023
March 2023
An introduction to Support Vector Machines - 13 March 2023
2022
March 2022
The log10 of 0 is over 9000… right? - 30 March 2022
2017
December 2017
Using AWS Lambda and Slack to have fun while saving on EMR costs - 4 December 2017
2016
September 2016
Helping our new Data Scientists start in Python: A guide to learning by doing - 30 September 2016
April 2016
Upload your local Spark script to an AWS EMR cluster using a simple Python script - 25 April 2016
February 2016
A recommendation system for blogs: Content-based similarity (2) - 11 February 2016
2015
November 2015
A recommendation system for blogs: Setting up the prerequisites (1) - 19 November 2015
September 2015
Slashception with regexp_extract in Hiv - 30 September 2015
The GAM approach to spend your money more efficiently! - 15 September 2015
April 2015
Optimizing media spends using S-response curves - 30 April 2015
2014
July 2014
Sentiment analysis using Support Vector Machines - 14 July 2014