Linking pages
- GitHub - ddupont808/GPT-4V-Act: AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI https://github.com/ddupont808/GPT-4V-Act 22 comments
- GitHub - microsoft/SoM: Set-of-Mark Prompting for LMMs https://github.com/microsoft/SoM#-set-of-mark-prompting-or-gpt-4v 1 comment
- GitHub - roboflow/multimodal-maestro: Effective prompting for Large Multimodal Models like GPT-4 Vision or LLaVA. 🔥 https://github.com/roboflow/multimodal-maestro 0 comments
- GitHub - tmgthb/Autonomous-Agents: Autonomous Agents (LLMs) research papers. Updated Daily. https://github.com/tmgthb/Autonomous-Agents 0 comments
Related searches:
Search whole site: site:arxiv.org
Search title: [2310.11441] Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
See how to search.