Hacker News
- Notes on training BERT from scratch on an 8GB consumer GPU https://sidsite.com/posts/bert-from-scratch/ 43 comments
- [P] Notes on training BERT from scratch on an 8GB consumer GPU https://sidsite.com/posts/bert-from-scratch/ 22 comments machinelearning
- Notes on training BERT from scratch on an 8GB consumer GPU https://sidsite.com/posts/bert-from-scratch/ 2 comments learnmachinelearning
Linked pages
- Hugging Face – The AI community building the future. https://huggingface.co/ 57 comments
- [1810.04805] BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding https://arxiv.org/abs/1810.04805 25 comments
- [2212.14034] Cramming: Training a Language Model on a Single GPU in One Day https://arxiv.org/abs/2212.14034 25 comments
- Weights & Biases – Developer tools for ML https://wandb.ai/site 11 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:sidsite.com
Search title: Notes on training BERT from scratch on an 8GB consumer GPU | sidsite
See how to search.