GitHub - salesforce/LAVIS: LAVIS - A One-stop Library for Language-Vision Intelligence - discu.eu

Linking pages

GitHub - DLYuanGod/TinyGPT-V: TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones https://github.com/DLYuanGod/TinyGPT-V 37 comments
GitHub - IDEA-Research/Grounded-Segment-Anything: Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything https://github.com/IDEA-Research/Grounded-Segment-Anything 15 comments
GitHub - cmhungsteve/Awesome-Transformer-Attention: An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites https://github.com/cmhungsteve/Awesome-Transformer-Attention 13 comments
GitHub - taishi-i/awesome-ChatGPT-repositories: A curated list of resources dedicated to open source GitHub repositories related to ChatGPT https://github.com/taishi-i/awesome-ChatGPT-repositories 5 comments
GitHub - Alpha-VLLM/LLaMA2-Accessory: An Open-source Toolkit for LLM Development https://github.com/Alpha-VLLM/LLaMA2-Accessory 3 comments
GitHub - open-mmlab/Multimodal-GPT: Multimodal-GPT https://github.com/open-mmlab/Multimodal-GPT 1 comment
Aman's AI Journal • Primers • Overview of Large Language Models https://aman.ai/primers/ai/LLM/ 1 comment
GitHub - LMM101/Awesome-Multimodal-Next-Token-Prediction: Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey https://github.com/LMM101/Awesome-Multimodal-Next-Token-Prediction 1 comment
Salesforce AI Open-Sources 'LAVIS,' A Deep Learning Library For Language-Vision Research/Applications - MarkTechPost https://www.marktechpost.com/2022/09/24/salesforce-ai-open-sources-lavis-a-deep-learning-library-for-language-vision-research-applications/ 0 comments
10 interesting Deep learning libraries to checkout | by karim | MLearning.ai | Nov, 2022 | Medium https://medium.com/mlearning-ai/10-interesting-deep-learning-libraries-to-checkout-ccc5a1db1e15 0 comments
GitHub - ttengwang/Caption-Anything: Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://github.com/ttengwang/Caption-Anything 0 comments
Multimodality and Large Multimodal Models (LMMs) https://huyenchip.com/2023/10/10/multimodal.html 0 comments
GitHub - dvlab-research/LLaMA-VID: Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models https://github.com/dvlab-research/LLaMA-VID 0 comments
GitHub - opendilab/LMDrive: LMDrive: Closed-Loop End-to-End Driving with Large Language Models https://github.com/opendilab/LMDrive 0 comments

Linked pages

https://arxiv.org/pdf/2103.00020.pdf 11 comments

Related searches:

Search whole site: site:github.com

Search title: GitHub - salesforce/LAVIS: LAVIS - A One-stop Library for Language-Vision Intelligence

See how to search.

Submit link to: