Hacker News
- SlowLlama: Finetune llama2-70B and codellama on MacBook Air without quantization https://github.com/okuvshynov/slowllama 54 comments
Linking pages
Linked pages
- GitHub - ggerganov/llama.cpp: Port of Facebook's LLaMA model in C/C++ https://github.com/ggerganov/llama.cpp 286 comments
- GitHub - karpathy/llama2.c: Inference Llama 2 in pure C, single file, fp32, haha https://github.com/karpathy/llama2.c 167 comments
- [2106.09685] LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685 8 comments
- GitHub - okuvshynov/cubestat: Horizon chart for CPU/GPU/Neural Engine utilization monitoring on Apple M1/M2 and nVidia GPUs on Linux https://github.com/okuvshynov/cubestat 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - okuvshynov/slowllama: Finetune llama2-70b and codellama on MacBook Air without quantization
See how to search.