Hacker News
- Cerebras-GPT: Open Compute-Optimal Language Models Trained on Cerebras Cluster https://arxiv.org/abs/2304.03208 12 comments
- [R] Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster https://arxiv.org/abs/2304.03208 39 comments machinelearning
Linking pages
- GitHub - eugeneyan/open-llms: 🤖 A list of open LLMs available for commercial use. https://github.com/eugeneyan/open-llms 2 comments
- Cerebras Systems Releases Seven New GPT Models Trained on CS-2 Wafer-Scale Systems - Cerebras https://www.cerebras.net/press-release/cerebras-systems-releases-seven-new-gpt-models-trained-on-cs-2-wafer-scale-systems 0 comments
- LLM Collection | Prompt Engineering Guide https://www.promptingguide.ai/models/collection 0 comments
- BTLM-3B-8K: 7B Performance in a 3 Billion Parameter Model - Cerebras https://www.cerebras.net/machine-learning/btlm-3b-8k-7b-performance-in-a-3-billion-parameter-model/ 0 comments
- The Practitioner's Guide to the Maximal Update Parameterization | EleutherAI Blog https://blog.eleuther.ai/mutransfer/ 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2304.03208] Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster
See how to search.