discu
Newsletters
Mentions
Extension
Pricing
Login
Sign Up
Hacker News
Transformers from Scratch (2021)
https://e2eml.school/transformers.html
46 comments
25/4/2023
Transformers from Scratch
https://e2eml.school/transformers.html
17 comments
23/11/2021
Reddit
Art of stacking neural network modules in favor of gradient flow
https://e2eml.school/transformers.html
3 comments
8/5/2023
learnmachinelearning