Linking pages
- Bridging Images and Text - a Survey of VLMs https://nanonets.com/blog/bridging-images-and-text-a-survey-of-vlms/ 4 comments
- GitHub - SkalskiP/awesome-foundation-and-multimodal-models: 👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials] https://github.com/SkalskiP/awesome-foundation-and-multimodal-models 1 comment
- Obvious next steps in AI research - by David Beniaguev https://davidbeniaguev.substack.com/p/obvious-next-steps-in-ai-research 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2306.14824] Kosmos-2: Grounding Multimodal Large Language Models to the World
See how to search.