Hacker News
- Collection of LLM resources that can be used to build products you can “own” https://github.com/imaurer/awesome-decentralized-llm 6 comments
Lobsters
- awesome-decentralized-llm: Collection of LLM resources that can be used to build products you can "own" or to perform reproducible research https://github.com/imaurer/awesome-decentralized-llm 4 comments ai
Linking pages
Linked pages
- Large language models are having their Stable Diffusion moment right now https://simonwillison.net/2023/Mar/11/llama/ 369 comments
- GitHub - nomic-ai/gpt4all: gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue https://github.com/nomic-ai/gpt4all 325 comments
- Stanford CRFM https://crfm.stanford.edu/2023/03/13/alpaca.html 298 comments
- [2303.17580] HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace https://arxiv.org/abs/2303.17580 295 comments
- GitHub - antimatter15/alpaca.cpp: Locally run an Instruction-Tuned Chat-Style LLM https://github.com/antimatter15/alpaca.cpp 287 comments
- GitHub - ggerganov/llama.cpp: Port of Facebook's LLaMA model in C/C++ https://github.com/ggerganov/llama.cpp 286 comments
- [2304.03442] Generative Agents: Interactive Simulacra of Human Behavior https://arxiv.org/abs/2304.03442 276 comments
- Cerebras-GPT: A Family of Open, Compute-efficient, Large Language Models - Cerebras https://www.cerebras.net/blog/cerebras-gpt-a-family-of-open-compute-efficient-large-language-models/ 234 comments
- The Coming of Local LLMs - Nick Arner https://nickarner.com/notes/the-coming-of-local-llms-march-23-2023/ 206 comments
- Introducing LLaMA: A foundational, 65-billion-parameter language model https://ai.facebook.com/blog/large-language-model-llama-meta-ai/ 204 comments
- GitHub - BlinkDL/RWKV-LM: RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding. https://github.com/BlinkDL/RWKV-LM 179 comments
- GitHub - Torantulino/Auto-GPT: An experimental open-source attempt to make GPT-4 fully autonomous. https://github.com/Torantulino/Auto-GPT 175 comments
- Hello Dolly: Democratizing the magic of ChatGPT with open models https://www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html 173 comments
- GitHub - BlinkDL/ChatRWKV: ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source. https://github.com/BlinkDL/ChatRWKV 139 comments
- Running LLaMA 7B on a 64GB M2 MacBook Pro with llama.cpp | Simon Willison’s TILs https://til.simonwillison.net/llms/llama-7b-m2 82 comments
- Cerebras-GPT vs LLaMA AI Model Comparison | LunaTrace https://www.lunasec.io/docs/blog/cerebras-gpt-vs-llama-ai-model-comparison/ 70 comments
- The RWKV language model: An RNN with the advantages of a transformer | The Good Minima https://johanwind.github.io/2023/03/23/rwkv_overview.html 45 comments
- GitHub - microsoft/JARVIS: JARVIS, a system to connect LLMs with ML community https://github.com/microsoft/JARVIS 43 comments
- GitHub - ravenscroftj/turbopilot https://github.com/ravenscroftj/turbopilot 36 comments
- ReAct: Synergizing Reasoning and Acting in Language Models https://react-lm.github.io 29 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.