Hacker News
- Chain-of-Thought Hub: Measuring LLMs' Reasoning Performance https://github.com/FranxYao/chain-of-thought-hub 26 comments
Linking pages
- GitHub - imoneoi/openchat: OpenChat: Advancing Open-source Language Models with Imperfect Data https://github.com/imoneoi/openchat 25 comments
- GitHub - SAILResearch/awesome-foundation-model-leaderboards: A curated list of awesome leaderboards for foundation models https://github.com/SAILResearch/awesome-foundation-model-leaderboards 3 comments
- GitHub - Hannibal046/Awesome-LLM: Awesome-LLM: a curated list of Large Language Model https://github.com/Hannibal046/Awesome-LLM 0 comments
- Week 22 - 2023 :: Per Prompt - Weekly LLV, GPT, AI Digests https://perprompt.com/posts/w22-2023/ 0 comments
- GitHub - lmmlzn/Awesome-LLMs-Datasets: Summarize existing representative LLMs text datasets. https://github.com/lmmlzn/Awesome-LLMs-Datasets 0 comments
Linked pages
- GPT-4 https://openai.com/research/gpt-4 5744 comments
- [2303.12712] Sparks of Artificial General Intelligence: Early experiments with GPT-4 https://arxiv.org/abs/2303.12712 911 comments
- https://ai.google/static/documents/palm2techreport.pdf 282 comments
- StarCoder: A State-of-the-Art LLM for Code https://huggingface.co/blog/starcoder 22 comments
- LLaMA: Open and Efficient Foundation Language Models - Meta Research https://research.facebook.com/publications/llama-open-and-efficient-foundation-language-models/ 7 comments
- [2210.11416] Scaling Instruction-Finetuned Language Models https://arxiv.org/abs/2210.11416 5 comments
- [2305.07922] CodeT5+: Open Code Large Language Models for Code Understanding and Generation https://arxiv.org/abs/2305.07922 2 comments
- [2201.11903] Chain of Thought Prompting Elicits Reasoning in Large Language Models https://arxiv.org/abs/2201.11903 1 comment
- [2210.09261] Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them https://arxiv.org/abs/2210.09261 1 comment
- Holistic Evaluation of Language Models (HELM) https://crfm.stanford.edu/helm/latest/ 1 comment
- https://cdn.openai.com/papers/gpt-4.pdf 1 comment
- [2206.14858] Solving Quantitative Reasoning Problems with Language Models https://arxiv.org/abs/2206.14858 0 comments
- [2211.09085] Galactica: A Large Language Model for Science https://arxiv.org/abs/2211.09085 0 comments
- [2203.07814] Competition-Level Code Generation with AlphaCode https://arxiv.org/abs/2203.07814 0 comments
- [2202.12837] Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? https://arxiv.org/abs/2202.12837 0 comments
- Towards Complex Reasoning: the Polaris of Large Language Models https://yaofu.notion.site/Towards-Complex-Reasoning-the-Polaris-of-Large-Language-Models-c2b4a51355b44764975f88e6a42d4e75 0 comments
- Leaderboard | C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models https://cevalbenchmark.com/static/leaderboard.html 0 comments
- [2211.14275] Solving math word problems with process- and outcome-based feedback https://arxiv.org/abs/2211.14275#deepmind 0 comments