Release news
Since our last newsletter post, we’ve made quite a few releases!
We shipped Python 3.12 support for our core package and a few plugins, improved the developer experience, and dropped some dependencies to make Kedro more lightweight.
We released Kedro 0.19.5 (a hotfix on top of 0.19.4) that supports running commands in any subdirectory (and offers improved error messages outside a Kedro project), faster pipeline creation, more visible cookiecutter errors, and substantial improvements to the Kedro + Airflow documentation.
We launched kedro-datasets 3.0 to ship the much anticipated ibis.TableDataset
contributed by Deepyaman Datta from Voltron Data, as well as a NetCDF
dataset for climate and weather data contributed by Riley X. Brady, Ph.D. from QuantumBlack, AI by McKinsey
Kedro-Viz 9.0 dropped, adding filters and namespace pipeline states to stateful URLs, the option to preview JSON datasets, and an upgrade to Node 18.
Phew! Throw in some additional releases of kedro-telemetry 0.4.0, kedro-mlflow 0.12.2 by Yolan Honoré-Rougé and kedro-boot 0.2.2 by Takieddine KADIRI, and you’ll understand why we were a bit quiet on this blog last month!
Kedro TSC news
We are pleased to welcome Simon Brugman to Kedro’s Technical Steering Committee. Simon has been consistently contributing to Kedro since at least 2020, and in particular he has been a great contributor to kedro-airflow
, has pushed for upstreaming solutions to typical problems for Kedro users such as PyCodeHash, and has advocated for Kedro at work and elsewhere.
Kedro in presentation 📢
There are some Kedro presentations coming up in the near future at events around the world.
At PyCon US 2024, there’s a presentation on May 18th titled “Improve Your ML Projects: Embrace Reproducibility and Production Readiness with Kedro” by Juliana Ferreira Alves.
PyData London will feature “Analytics engineering without dbt? Building the composable Python data stack with Kedro and Ibis” on June 14th.
EuroPython 2024 showcases “From zero to MLOps: An open source stack to fight spaghetti ML” although we don’t have a fixed date for this as yet, it’ll be during the event running 8th-14th July.
Kedro in print 📚
We recommend checking out a fantastic new book from O’Reilly: Software Engineering for Data Scientists. The book explains the best practices needed to write reproducible, robust, scalable code, which is, of course, a guiding philosophy for Kedro projects. We were pleased to spot multiple mentions of Kedro, particularly in the chapter on design and refactoring.
If you don't have access to Safari Online, you can get 30 days free access to the book, and other O’Reilly content, with this code.
Kedro on film 📽️
Janick Spirig has made a video for Kedro. In less than 15 minutes, he explains how you can use Kedro’s Data Catalog in any Python project for loading and storing datasets, without the need of implementing traditional Kedro components such as nodes or pipelines. Check it out!
If you spot Kedro in an article or video, don’t forget to share it on the #resources channel on Slack. And if you make a video about Kedro, or write an article, don’t be shy! Add it to the channel and to the “Awesome Kedro” repository so others can find it!
Recently on the Kedro blog
Recently published on the Kedro blog:
We’re always looking for collaborators to write about their experiences using Kedro. Get in touch with us on our Slack workspace to tell us your story!
That’s it for this edition!
Don’t forget that we also make regular Kedro updates on LinkedIn, on our Mastodon and on our increasingly popular Slack community. Keep an eye on the QuantumBlack LinkedIn feed too!
Bookmark this blog or add our RSS feed to your favorite reader to stay in the loop and join us next month for a summery update from the Kedro team.