Back from the 2026 Databricks Data + AI Summit
A recap of the Databricks Data+AI Summit 2026 in San Francisco, covering the keynote announcements from Genie ONE and the AI Gateway to Reyden and...
Read more βMachine learning, AI systems, MLOps, model deployment, and AI best practices
A recap of the Databricks Data+AI Summit 2026 in San Francisco, covering the keynote announcements from Genie ONE and the AI Gateway to Reyden and...
Read more β
A practical walkthrough of Python packaging: project layout, pyproject.toml, publishing to PyPI, testing with pytest, documentation with Sphinx, and CI/CD with GitHub Actions illustrated through...
Read more β
A practical walkthrough of text embeddings from fundamentals to production β covering model selection, vector storage, similarity metrics, and validation through a real cold-start content...
Read more β
I spent a weekend rebuilding my Jekyll blog using Claude Code and vibe coding. The result? A functional website in days instead of months. But...
Read more β
Building a French TV show dataset using LLMs for HTML parsing, comparing zero-shot and few-shot prompting, and exploring fine-tuning for data extraction.
Read more β
Building a machine learning playground for the Suika Game using physics simulation, creating baseline agents, and setting up an experiment framework.
Read more β
Experimenting with OpenAI's Whisper to transcribe French podcasts, comparing different deployment strategies, and benchmarking costs and performance.
Read more β
Tackle the cold start problem in recommender systems using transformers. Build a Marvel Snap deck recommendation system handling new cards with embedding techniques.
Read more β
For a few months, I wanted to test DVC, a toolkit around versioning for ML projects built by iterative. I tried it a bit at...
Read more β
Recently I heard about a package developed by Facebook research (META research !?) called KATS, released by Facebook's Infrastructure Data Science team end of last...
Read more β
I wanted for a long time to participate thoughtfully in a Kaggle competition ( I think I made some tests a few years ago but...
Read more β
I wanted to write for a few weeks around ml/ds libraries that I have on my backlog of things to try. One article per library...
Read more β
In this article, there will be an overview of the service of AWS Sagemaker. The idea will be to see from my DS perspective how...
Read more β
This article will be part of my annual dive in R; the idea will be to use two R libraries in time-series forecasting and causal...
Read more β
In this article, I am going to present some of my findings on my exploration of TensorFlow, the idea will be with TensorFlow to build...
Read more β
In this article, I am going to illustrate some of the works around music information that I work since the past few weeks applicated to...
Read more β
For this article, I am going to describe my hands-on on a new library that has open-sourced recently by Netflix to operate and version machine...
Read more β
The version used for this article is mlflow 1.4.0
Read more β
Hello, the season of the Open starts again this year (once again !?) so I am writing this article to:
Read more β
Since I published the article on the London smart meter and the possible analysis of the data, I am receiving regularly messages of people that...
Read more β
Hello, the goal of this article is to offer a clear description of the dataset that I uploaded in November 2017 on Kaggle followed by...
Read more β
Hello readers, for this article I am going to explain my approach to create a forecast system of the French (metropolitan) energy consumption. This kind...
Read more β