Pix2Seq: A New Language Interface for Object Detection – Google AI Blog - discu.eu

Linking pages

Vid2Seq: a pretrained visual language model for describing multi-event videos – Google AI Blog https://ai.googleblog.com/2023/03/vid2seq-pretrained-visual-language.html 16 comments
Google Research, 2022 & beyond: Language, vision and generative models – Google AI Blog https://ai.googleblog.com/2023/01/google-research-2022-beyond-language.html 5 comments
Google at ICLR 2022 – Google AI Blog https://ai.googleblog.com/2022/04/google-at-iclr-2022.html 0 comments
Foundation Models and the Future of Multi-Modal AI https://lastweekin.ai/p/multi-modal-ai 0 comments

Linked pages

Related searches:

Search whole site: site:ai.googleblog.com

Search title: Pix2Seq: A New Language Interface for Object Detection – Google AI Blog

See how to search.

Submit link to: