Linking pages
- LLaVA-1.6: Improved reasoning, OCR, and world knowledge | LLaVA https://llava-vl.github.io/blog/2024-01-30-llava-1-6/ 45 comments
- GitHub - haotian-liu/LLaVA: Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities. https://github.com/haotian-liu/LLaVA 0 comments
- GitHub - nlpfromscratch/nlp-llms-resources: Master list of curated resources on NLP and LLMs https://github.com/nlpfromscratch/nlp-llms-resources 0 comments
- GitHub - sshh12/multi_token: Embed arbitrary modalities (images, audio, documents, etc) into large language models. https://github.com/sshh12/multi_token 0 comments