What We Know About LLMs (Primer) - discu.eu

Hacker News

What we know about LLMs https://willthompson.name/what-we-know-about-llms-primer 164 comments 25/7/2023

Linking pages

Multimodal LLM with a robot arm, SDXL 1.0, HealthScribe by Amazon, OverflowAI, Generative-AI based virtual room styler by Wayfair and more https://aibrews.substack.com/p/multimodal-llm-with-a-robot-arm-sdxl 0 comments

Linked pages

Introducing Superalignment https://openai.com/blog/introducing-superalignment 558 comments
Anthropic | Introducing 100K Context Windows https://www.anthropic.com/index/100k-context-windows 474 comments
[2005.14165] Language Models are Few-Shot Learners https://arxiv.org/abs/2005.14165 201 comments
LLM Powered Autonomous Agents | Lil'Log https://lilianweng.github.io/posts/2023-06-23-agent/ 177 comments
[1706.03762] Attention Is All You Need https://arxiv.org/abs/1706.03762 145 comments
Prompt Engineering | Lil'Log https://lilianweng.github.io/posts/2023-03-15-prompt-engineering/ 59 comments
Open LLM Leaderboard - a Hugging Face Space by HuggingFaceH4 https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard 51 comments
Democratic inputs to AI https://openai.com/blog/democratic-inputs-to-ai 39 comments
The Illustrated Transformer – Jay Alammar – Visualizing machine learning one concept at a time. https://jalammar.github.io/illustrated-transformer/ 36 comments
How to Train Really Large Models on Many GPUs? | Lil'Log https://lilianweng.github.io/posts/2021-09-25-train-large/ 33 comments
[1810.04805] BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding https://arxiv.org/abs/1810.04805 25 comments
https://chat.openai.com/auth/login 19 comments
[2108.12409] Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation https://arxiv.org/abs/2108.12409 17 comments
Illustrating Reinforcement Learning from Human Feedback (RLHF) https://huggingface.co/blog/rlhf 14 comments
https://arxiv.org/pdf/2112.04426.pdf 12 comments
[2106.09685] LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685 8 comments
[2107.03374] Evaluating Large Language Models Trained on Code https://arxiv.org/abs/2107.03374 8 comments
[2112.09332] WebGPT: Browser-assisted question-answering with human feedback https://arxiv.org/abs/2112.09332 3 comments
[2205.14135] FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness https://arxiv.org/abs/2205.14135 3 comments
Andrej Karpathy on X: "The hottest new programming language is English" / X https://twitter.com/karpathy/status/1617979122625712128 3 comments

Related searches:

Search whole site: site:willthompson.name

Search title: What We Know About LLMs (Primer)

See how to search.

Submit link to: