Linking pages
- Why Google failed to make GPT-3 + why Multimodality for Knowledge Work is the path to AGI - with David Luan of Adept https://www.latent.space/p/adept 40 comments
- Segment Anything 2: Demo-first Model Development https://www.latent.space/p/sam2 2 comments
- Why StackOverflow usage is down 50% — with David Hsu of Retool https://www.latent.space/p/retool 1 comment
- Running Open Source LLMs In Python - A Practical Guide https://christophergs.com/blog/running-open-source-llms-in-python 0 comments
- Cloud Intelligence at the speed of 5000 tok/s - with Ce Zhang and Vipul Ved Prakash of Together AI https://www.latent.space/p/together 0 comments
- Worthwhile Research for building SOTA LLMs (Jan 2024 Recap) https://www.latent.space/p/jan-2024 0 comments
- One standard to deploy them all - with Ben Firshman of Replicate https://www.latent.space/p/replicate 0 comments
- Open Source AI is AI we can Trust — with Soumith Chintala of Meta AI https://www.latent.space/p/soumith 0 comments
- Why Google failed and couldn’t Adept - with David Luan of Adept https://www.latent.space/i/142817627/why-google-couldnt-make-gpt 0 comments
Linked pages
- GitHub - microsoft/playwright: Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API. https://github.com/microsoft/playwright 239 comments
- RedPajama-Data-v2: An open dataset with 30 trillion tokens for training large language models https://together.ai/blog/redpajama-data-v2 60 comments
- Segment Anything Model and the Hard Problems of Computer Vision — with Joseph Nelson of Roboflow https://www.latent.space/p/segment-anything-roboflow#details 52 comments
- Nougat https://facebookresearch.github.io/nougat/ 38 comments
- https://cdn.openai.com/papers/dall-e-3.pdf 37 comments
- Announcing OpenFlamingo: An open-source framework for training vision-language models with in-context learning | LAION https://laion.ai/blog/open-flamingo/ 25 comments
- [2103.03230] Barlow Twins: Self-Supervised Learning via Redundancy Reduction https://arxiv.org/abs/2103.03230 22 comments
- https://storage.googleapis.com/deepmind-media/DeepMind.com/Blog/tackling-multiple-tasks-with-a-single-visual-language-model/flamingo.pdf 20 comments
- Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Langage Model https://huggingface.co/blog/idefics 2 comments
- The Accidental AI Canvas - with Steve Ruiz of tldraw https://www.latent.space/p/tldraw 2 comments
- AI image training dataset found to include child sexual abuse imagery - The Verge https://www.theverge.com/2023/12/20/24009418/generative-ai-image-laion-csam-google-stability-stanford 1 comment
- [2203.15556] Training Compute-Optimal Large Language Models https://arxiv.org/abs/2203.15556 0 comments
- GitHub - mlfoundations/open_clip: An open source implementation of CLIP. https://github.com/mlfoundations/open_clip 0 comments
- [2010.11929] An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale https://arxiv.org/abs/2010.11929 0 comments
- Latent Space | swyx | Substack https://www.latent.space/ 0 comments
- [2211.05100] BLOOM: A 176B-Parameter Open-Access Multilingual Language Model https://arxiv.org/abs/2211.05100 0 comments
- GPT-4V(ision) system card https://openai.com/research/gpt-4v-system-card 0 comments
- Multimodality and Large Multimodal Models (LMMs) https://huyenchip.com/2023/10/10/multimodal.html 0 comments
- The Busy Person's Intro to Finetuning & Open Source AI - Wing Lian, Axolotl https://www.latent.space/p/axolotl 0 comments
- The "Normsky" architecture for AI coding agents — with Beyang Liu + Steve Yegge of SourceGraph https://www.latent.space/p/sourcegraph 0 comments
Related searches:
Search whole site: site:latent.space
Search title: How to train your own Large Multimodal Model — with Hugo Laurençon & Leo Tronchon of HuggingFace M4 Research
See how to search.