[2403.05530] Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Linking pages

I. From GPT-4 to AGI: Counting the OOMs - SITUATIONAL AWARENESS https://situational-awareness.ai/from-gpt-4-to-agi/ 91 comments
How Google DeepMind's AlphaGeometry Reached Math Olympiad Level Reasoning By Combining Creative LLMs With Deductive Symbolic Engines https://codecompass00.substack.com/p/google-deepmind-alpha-geometry-neuro-symbolic-llm-system 21 comments
"Attention, Please!": A Visual Guide To The Attention Mechanism [Transformer Series] https://open.substack.com/pub/codecompass00/p/visual-guide-attention-mechanism-transformers?r=rcorn 7 comments
Evaluating long context large language models https://www.artfish.ai/p/long-context-llms 4 comments
Sholto Douglas & Trenton Bricken - How to Build & Understand GPT-7's Mind https://www.dwarkeshpatel.com/p/sholto-douglas-trenton-bricken 3 comments
Francois Chollet, Mike Knoop - LLMs won’t lead to AGI - $1,000,000 Prize to find true solution https://www.dwarkeshpatel.com/p/francois-chollet 2 comments
The best NLP papers of 2024 - The best NLP papers https://thebestnlppapers.com/ 2 comments
Tips for LLM Pretraining and Evaluating Reward Models https://sebastianraschka.com/blog/2024/research-papers-in-march-2024.html 1 comment
"Attention, Please!": A Visual Guide To The Attention Mechanism [Transformer Series] https://open.substack.com/pub/codecompass00/p/visual-guide-attention-mechanism-transformers 1 comment
What is LoRA?: A Visual Guide to Low-Rank Approximation for Fine-Tuning LLMs Efficiently https://codecompass00.substack.com/p/what-is-lora-a-visual-guide-llm-fine-tuning 1 comment
How OpenAI Uses LLMs to Explain Neurons Inside LLMs At Scale https://codecompass00.substack.com/p/how-openai-uses-llms-to-explain-llm-neurons-at-scale 1 comment
Tips for LLM Pretraining and Evaluating Reward Models https://magazine.sebastianraschka.com/p/tips-for-llm-pretraining-and-evaluating-rms 0 comments
Gemini Nano - Google DeepMind https://deepmind.google/technologies/gemini/nano/ 0 comments
Leopold Aschenbrenner - China/US Super Intelligence Race, 2027 AGI, & The Return of History https://www.dwarkeshpatel.com/p/leopold-aschenbrenner 0 comments
Scaling Up Malware Analysis with Gemini 1.5 Flash | Google Cloud Blog https://cloud.google.com/blog/topics/threat-intelligence/scaling-up-malware-analysis-with-gemini 0 comments
Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark! – The Berkeley Artificial Intelligence Research Blog https://bair.berkeley.edu/blog/2024/07/20/visual-haystacks/ 0 comments
What is QLoRA?: A Visual Guide to Efficient Finetuning of Quantized LLMs https://open.substack.com/pub/codecompass00/p/qlora-visual-guide-finetune-quantized-llms-peft?r=rcorn 0 comments
What is QLoRA?: A Visual Guide to Efficient Finetuning of Quantized LLMs https://codecompass00.substack.com/p/qlora-visual-guide-finetune-quantized-llms-peft 0 comments
The Future of Compute: NVIDIA's Crown is Slipping https://mohitdagarwal.substack.com/p/from-dominance-to-dilemma-nvidia 0 comments