- [P] 22 Research Paper Highlights (April-May 2023) -- Summarized In 3 Sentences Or Less https://magazine.sebastianraschka.com/p/ai-research-highlights-in-3-sentences 4 comments machinelearning
Linking pages
- Ahead of AI #9: LLM Tuning & Dataset Perspectives https://magazine.sebastianraschka.com/p/ahead-of-ai-9-llm-tuning-and-dataset 4 comments
- AI Research Highlights In 3 Sentences Or Less (May-June 2023) https://magazine.sebastianraschka.com/p/ai-research-highlights-in-3-sentences-2a1 3 comments
- The NeurIPS 2023 LLM Efficiency Challenge Starter Guide - Lightning AI https://lightning.ai/pages/community/tutorial/neurips2023-llm-efficiency-guide/ 0 comments
Linked pages
- [2304.11062] Scaling Transformer to 1M tokens and beyond with RMT https://arxiv.org/abs/2304.11062 153 comments
- [2304.15004] Are Emergent Abilities of Large Language Models a Mirage? https://arxiv.org/abs/2304.15004 130 comments
- [2305.01625] Unlimiformer: Long-Range Transformers with Unlimited Length Input https://arxiv.org/abs/2305.01625 109 comments
- [2305.02301] Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes https://arxiv.org/abs/2305.02301 56 comments
- [2304.14454] PMC-LLaMA: Further Finetuning LLaMA on Medical Papers https://arxiv.org/abs/2304.14454 50 comments
- [2304.05332] Emergent autonomous scientific research capabilities of large language models https://arxiv.org/abs/2304.05332 14 comments
- [2110.09456] NormFormer: Improved Transformer Pretraining with Extra Normalization https://arxiv.org/abs/2110.09456 3 comments
- [2305.03047] Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision https://arxiv.org/abs/2305.03047 3 comments
- [2304.09848] Evaluating Verifiability in Generative Search Engines https://arxiv.org/abs/2304.09848 2 comments
- [2304.03283] Diffusion Models as Masked Autoencoders https://arxiv.org/abs/2304.03283 1 comment
- [2304.08467] Learning to Compress Prompts with Gist Tokens https://arxiv.org/abs/2304.08467 1 comment
- [2304.08551] Generative Disco: Text-to-Video Generation for Music Visualization https://arxiv.org/abs/2304.08551 1 comment
- Ahead of AI #7: Large Language Models 3.0 https://magazine.sebastianraschka.com/p/ahead-of-ai-7-large-language-models 0 comments
- [2304.06718] Segment Everything Everywhere All at Once https://arxiv.org/abs/2304.06718 0 comments
- [2304.11082] Fundamental Limitations of Alignment in Large Language Models https://arxiv.org/abs/2304.11082 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:magazine.sebastianraschka.com
Search title: AI Research Highlights In 3 Sentences Or Less (April-May 2023)
See how to search.