Linking pages
- LLaVA-1.6: Improved reasoning, OCR, and world knowledge | LLaVA https://llava-vl.github.io/blog/2024-01-30-llava-1-6/ 45 comments
- The Accidental AI Canvas - with Steve Ruiz of tldraw https://www.latent.space/p/tldraw 2 comments
- GitHub - SkalskiP/awesome-foundation-and-multimodal-models: 👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials] https://github.com/SkalskiP/awesome-foundation-and-multimodal-models 1 comment
- GitHub - showlab/Awesome-GUI-Agent: 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents. https://github.com/showlab/Awesome-GUI-Agent 0 comments
Linked pages
Related searches:
Search whole site: site:github.com
Search title: GitHub - THUDM/CogVLM: a state-of-the-art-level open visual language model | 多模态预训练模型
See how to search.