GitHub - THUDM/CogVLM: a state-of-the-art-level open visual language model | 多模态预训练模型 - discu.eu

Linking pages

LLaVA-1.6: Improved reasoning, OCR, and world knowledge | LLaVA https://llava-vl.github.io/blog/2024-01-30-llava-1-6/ 45 comments
GitHub - francedot/acu: A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools. https://github.com/francedot/acu 5 comments
The Accidental AI Canvas - with Steve Ruiz of tldraw https://www.latent.space/p/tldraw 2 comments
GitHub - SkalskiP/awesome-foundation-and-multimodal-models: 👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials] https://github.com/SkalskiP/awesome-foundation-and-multimodal-models 1 comment
GitHub - showlab/Awesome-GUI-Agent: 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents. https://github.com/showlab/Awesome-GUI-Agent 0 comments

Linked pages

Related searches:

Search whole site: site:github.com

Search title: GitHub - THUDM/CogVLM: a state-of-the-art-level open visual language model | 多模态预训练模型

See how to search.

Submit link to: