Linking pages
- GitHub - mlabonne/llm-course: Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks. https://github.com/mlabonne/llm-course 10 comments
- The New Kings of Open Source AI (Oct 2023 Recap) https://www.latent.space/p/oct-2023 3 comments
- GitHub - nlpfromscratch/nlp-llms-resources: Master list of curated resources on NLP and LLMs https://github.com/nlpfromscratch/nlp-llms-resources 0 comments
- How to train your own Large Multimodal Model — with Hugo Laurençon & Leo Tronchon of HuggingFace M4 Research https://www.latent.space/p/idefics 0 comments
Linked pages
- ChatGPT can now see, hear, and speak https://openai.com/blog/chatgpt-can-now-see-hear-and-speak 892 comments
- [2302.14045] Language Is Not All You Need: Aligning Perception with Language Models https://arxiv.org/abs/2302.14045 115 comments
- [2303.16199] LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention https://arxiv.org/abs/2303.16199 52 comments
- Introducing Pathways: A next-generation AI architecture https://blog.google/technology/ai/introducing-pathways-next-generation-ai-architecture/ 33 comments
- https://cdn.openai.com/papers/GPTV_System_Card.pdf 15 comments
- [2201.07520] CM3: A Causal Masked Multimodal Model of the Internet https://arxiv.org/abs/2201.07520 5 comments
- [2305.06500] InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning https://arxiv.org/abs/2305.06500 5 comments
- [2305.09617] Towards Expert-Level Medical Question Answering with Large Language Models https://arxiv.org/abs/2305.09617 5 comments
- [2205.14100] GIT: A Generative Image-to-text Transformer for Vision and Language https://arxiv.org/abs/2205.14100 1 comment
- RLHF: Reinforcement Learning from Human Feedback https://huyenchip.com/2023/05/02/rlhf.html 1 comment
- [2304.08485] Visual Instruction Tuning https://arxiv.org/abs/2304.08485 1 comment
- NExT-GPT https://next-gpt.github.io/ 1 comment
- GitHub - salesforce/LAVIS: LAVIS - A One-stop Library for Language-Vision Intelligence https://github.com/salesforce/LAVIS 0 comments
- [1707.02968] Revisiting Unreasonable Effectiveness of Data in Deep Learning Era https://arxiv.org/abs/1707.02968 0 comments
- [2301.12597] BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models https://arxiv.org/abs/2301.12597 0 comments
- [2305.15023] Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models https://arxiv.org/abs/2305.15023 0 comments
- MusicGen - a Hugging Face Space by facebook https://huggingface.co/spaces/facebook/MusicGen 0 comments
Related searches:
Search whole site: site:huyenchip.com
Search title: Multimodality and Large Multimodal Models (LMMs)
See how to search.