Back from the 2026 Databricks Data + AI Summit
A recap of the Databricks Data+AI Summit 2026 in San Francisco, covering the keynote announcements from Genie ONE and the AI Gateway to Reyden and...
Read more βETL pipelines, data processing, infrastructure, analytics, visualization, and insights discovery
A recap of the Databricks Data+AI Summit 2026 in San Francisco, covering the keynote announcements from Genie ONE and the AI Gateway to Reyden and...
Read more β
A practical walkthrough of Python packaging: project layout, pyproject.toml, publishing to PyPI, testing with pytest, documentation with Sphinx, and CI/CD with GitHub Actions illustrated through...
Read more β
After years of building in-house ML platforms, we migrated to Databricks in December 2024. This handbook shares practical tips and tricks for working with the...
Read more β
I recently decided to experiment with Docker containers to build standalone applications to optimize the operation flow of my different data/scraper pipelines. I have limited...
Read more β
Hello, in this article, I will give you a quick tour of a project that I recently resurrected from the dead to collect the French...
Read more β
I recently started to prototype an image classifier at work, and this work led me to the fastai package that I had in my backload...
Read more β
Hello readers, I wanted a long time to write an article on an AWS service that I am using in my daily job called EMR....
Read more β
In this article, I am going to present a pipeline that I built a few weeks ago to collect data (text and pictures) from the...
Read more β
Hello, in this article, I am going to detail a dataset that I built a few weeks ago on the game Hearthstone.
Read more β
For this article, I am going to start the analysis of the data extracted with the pipeline explained on this article. The goal of this...
Read more β
I started this project in echo of the Kaggle competition related to PUBG, where the goal was to predict the player rank in the match,...
Read more β
Learn how to build a web scraping system to collect and analyze Crossfit Open data, including athlete profiles, gym information, and performance metrics from the...
Read more β
Learn how to build an interactive dashboard using Dash (Plotly) to visualize personal fitness and health data from Nokia devices, Strava, and Crossfit sessions.
Read more β
Hello reader, in this article I will explain my approach to deploy a chatbot in Python on the Messenger platform.
Read more β