Linking pages
- Mini-Gemini: A Simple and Effective Artificial Intelligence Framework Enhancing multi-modality Vision Language Models (VLMs) - MarkTechPost https://www.marktechpost.com/2024/03/30/mini-gemini-a-simple-and-effective-artificial-intelligence-framework-enhancing-multi-modality-vision-language-models-vlms/ 1 comment
Linked pages
- [2403.18814] Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models https://arxiv.org/abs/2403.18814 7 comments
- GitHub - huggingface/diffusers: 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch https://github.com/huggingface/diffusers 5 comments
- GitHub - lm-sys/FastChat: An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena. https://github.com/lm-sys/FastChat 4 comments
- GitHub - haotian-liu/LLaVA: Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities. https://github.com/haotian-liu/LLaVA 0 comments
- MathVista: Evaluating Math Reasoning in Visual Contexts with GPT-4V, Bard, and Other Large Multimodal Models https://mathvista.github.io/ 0 comments
- MMMU https://mmmu-benchmark.github.io/ 0 comments
- NousResearch/Nous-Hermes-2-Yi-34B · Hugging Face https://huggingface.co/NousResearch/Nous-Hermes-2-Yi-34B 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - dvlab-research/MiniGemini: Official implementation for Mini-Gemini
See how to search.