AI Research Highlights In 3 Sentences Or Less (April-May 2023) - discu.eu

Reddit

[P] 22 Research Paper Highlights (April-May 2023) -- Summarized In 3 Sentences Or Less https://magazine.sebastianraschka.com/p/ai-research-highlights-in-3-sentences 4 comments 14/5/2023 machinelearning

Linking pages

Linked pages

[2304.11062] Scaling Transformer to 1M tokens and beyond with RMT https://arxiv.org/abs/2304.11062 153 comments
[2304.15004] Are Emergent Abilities of Large Language Models a Mirage? https://arxiv.org/abs/2304.15004 130 comments
[2305.01625] Unlimiformer: Long-Range Transformers with Unlimited Length Input https://arxiv.org/abs/2305.01625 109 comments
[2305.02301] Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes https://arxiv.org/abs/2305.02301 56 comments
[2304.14454] PMC-LLaMA: Further Finetuning LLaMA on Medical Papers https://arxiv.org/abs/2304.14454 50 comments
[2304.05332] Emergent autonomous scientific research capabilities of large language models https://arxiv.org/abs/2304.05332 14 comments
[2110.09456] NormFormer: Improved Transformer Pretraining with Extra Normalization https://arxiv.org/abs/2110.09456 3 comments
[2305.03047] Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision https://arxiv.org/abs/2305.03047 3 comments
[2304.09848] Evaluating Verifiability in Generative Search Engines https://arxiv.org/abs/2304.09848 2 comments
[2304.03283] Diffusion Models as Masked Autoencoders https://arxiv.org/abs/2304.03283 1 comment
[2304.08467] Learning to Compress Prompts with Gist Tokens https://arxiv.org/abs/2304.08467 1 comment
[2304.08551] Generative Disco: Text-to-Video Generation for Music Visualization https://arxiv.org/abs/2304.08551 1 comment
Ahead of AI #7: Large Language Models 3.0 https://magazine.sebastianraschka.com/p/ahead-of-ai-7-large-language-models 0 comments
[2304.06718] Segment Everything Everywhere All at Once https://arxiv.org/abs/2304.06718 0 comments
[2304.11082] Fundamental Limitations of Alignment in Large Language Models https://arxiv.org/abs/2304.11082 0 comments

Would you like to stay up to date with Computer science? Checkout Computer science Weekly.

Related searches:

Search whole site: site:magazine.sebastianraschka.com

Search title: AI Research Highlights In 3 Sentences Or Less (April-May 2023)

See how to search.

Submit link to: