Favorite refs
My favorite so far: https://towardsdatascience.com/how-to-structure-a-data-science-project-for-readability-and-transparency-360c6716800
On repository structure, we have the typical:
But there are others such as:
Close to this, going up in complexity (based on DAGs)
- Kedro spaceflights tutorial — Kedro 0.17.7 documentation
- Hamilton: Scaling to Match your Data! | Stitch Fix Technology – Multithreaded
A full framework: Why Metaflow | Metaflow Docs
Also, a good summary on how to share features: How Machine Learning Teams Share and Reuse Features - Tecton