Linking pages
- End-to-end Generative Pre-training for Multimodal Video Captioning – Google AI Blog https://ai.googleblog.com/2022/06/end-to-end-generative-pre-training-for.html 0 comments
- VideoPrism: A foundational visual encoder for video understanding – Google Research Blog https://blog.research.google/2024/02/videoprism-foundational-visual-encoder.html 0 comments
Related searches:
Search whole site: site:cs.stanford.edu
Search title: Dense-Captioning Events in Videos
See how to search.