Hacker News
- Distributed Inference and Fine-Tuning of Large Language Models over the Internet https://browse.arxiv.org/html/2312.08361v1 22 comments
Linked pages
- [2205.01068] OPT: Open Pre-trained Transformer Language Models https://arxiv.org/abs/2205.01068 318 comments
- Petals – Decentralized platform for running large language models https://petals.dev/ 125 comments
- Attention is All you Need https://papers.nips.cc/paper/7181-attention-is-all-you-need 30 comments
- GLM-130B: An Open Bilingual Pre-Trained Model | GLM-130B https://keg.cs.tsinghua.edu.cn/glm-130b/posts/glm-130b/ 2 comments
- bigscience/bloom-7b1 · Hugging Face https://huggingface.co/bigscience/bloom-7b1 2 comments
- [2112.06905] GLaM: Efficient Scaling of Language Models with Mixture-of-Experts https://arxiv.org/abs/2112.06905 1 comment
- [2109.04650] What Changes Can Large-scale Language Models Bring? Intensive Study on HyperCLOVA: Billions-scale Korean Generative Pretrained Transformers https://arxiv.org/abs/2109.04650 1 comment
- [1811.02084] Mesh-TensorFlow: Deep Learning for Supercomputers https://arxiv.org/abs/1811.02084 0 comments
- [1404.5997] One weird trick for parallelizing convolutional neural networks http://arxiv.org/abs/1404.5997 0 comments
- [2104.12369] PanGu-$α$: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation https://arxiv.org/abs/2104.12369 0 comments
- https://images.nvidia.com/aem-dam/en-zz/Solutions/geforce/ampere/pdf/NVIDIA-ampere-GA102-GPU-Architecture-Whitepaper-V1.pdf 0 comments
- [2204.06745] GPT-NeoX-20B: An Open-Source Autoregressive Language Model https://arxiv.org/abs/2204.06745 0 comments
- [2112.11446] Scaling Language Models: Methods, Analysis & Insights from Training Gopher https://arxiv.org/abs/2112.11446 0 comments
- https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf 0 comments
Related searches:
Search whole site: site:browse.arxiv.org
Search title: Distributed Inference and Fine-tuning of Large Language Models Over The Internet
See how to search.