Hacker News
- Sophia: Scalable Stochastic 2nd-Order Optimizer for Language Model Pre-Training https://arxiv.org/abs/2305.14342 2 comments
- [R] Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training https://arxiv.org/abs/2305.14342 6 comments machinelearning
Linking pages
- Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation) https://magazine.sebastianraschka.com/p/practical-tips-for-finetuning-llms 37 comments
- AI Research Highlights In 3 Sentences Or Less (May-June 2023) https://magazine.sebastianraschka.com/p/ai-research-highlights-in-3-sentences-2a1 3 comments
- It doesn't matter if the robots are sentient or not https://www.mkaic.blog/p/it-doesnt-matter-if-the-robots 0 comments
- GitHub - AIoT-MLSys-Lab/Efficient-LLMs-Survey: Efficient Large Language Models: A Survey https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey 0 comments
- Compute Thresholds are Ineffective - by Dean W. Ball https://hyperdimensional.substack.com/p/compute-thresholds-are-ineffective 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2305.14342] Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
See how to search.