- [R] Trying to understand the ViTDet paper https://arxiv.org/abs/2203.16527 2 comments machinelearning
Linking pages
- GitHub - cmhungsteve/Awesome-Transformer-Attention: An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites https://github.com/cmhungsteve/Awesome-Transformer-Attention 13 comments
- Meta AI Open-Sources ViTDet Under a New Approach for Building Computer Vision Systems That Can Recognize a Wide Range of Common and Uncommon Objects - MarkTechPost https://www.marktechpost.com/2022/08/02/meta-ai-open-sources-vitdet-under-a-new-approach-for-building-computer-vision-systems-that-can-recognize-a-wide-range-of-common-and-uncommon-objects/ 0 comments
- Scaling Vision Model Training Platforms with PyTorch | PyTorch https://pytorch.org/blog/scaling-vision-model-training-platforms-with-pytorch/ 0 comments
- F-VLM: Open-vocabulary object detection upon frozen vision and language models – Google AI Blog https://ai.googleblog.com/2023/05/f-vlm-open-vocabulary-object-detection.html 0 comments
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:arxiv.org
Search title: [2203.16527] Exploring Plain Vision Transformer Backbones for Object Detection
See how to search.