- Big problem when trying to use ViT as VQGAN backbone https://github.com/thuanz123/enhancing-transformers 3 comments deeplearning
Linked pages
- Stability AI https://stability.ai 69 comments
- GitHub - kakaobrain/minDALL-E: PyTorch implementation of a 1.3B text-to-image generation model trained on 14 million image-text pairs https://github.com/kakaobrain/minDALL-E 22 comments
- GitHub - lucidrains/vit-pytorch: Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch https://github.com/lucidrains/vit-pytorch#vision-transformer-for-small-datasets 3 comments
- [2110.04627] Vector-quantized Image Modeling with Improved VQGAN https://arxiv.org/abs/2110.04627 0 comments
- GitHub - openai/CLIP: Contrastive Language-Image Pretraining https://github.com/openai/CLIP 0 comments
Related searches:
Search whole site: site:github.com
Search title: GitHub - thuanz123/enhancing-transformers: An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch
See how to search.