- Tensor and Fully Sharded Data Parallelism - How Trillion Parameter Models Are Trained https://martynassubonis.substack.com/p/tensor-and-fully-sharded-data-parallelism 4 comments mlquestions
- Optimizing Docker Images for Python Production Services https://martynassubonis.substack.com/p/optimizing-docker-images-for-python 7 comments devops