- [D] [P] Web browsing UI-based AI agent: GPT-4V-Act https://github.com/ddupont808/GPT-4V-Act 22 comments machinelearning
Linking pages
- GitHub - microsoft/SoM: Set-of-Mark Prompting for LMMs https://github.com/microsoft/SoM#-set-of-mark-prompting-or-gpt-4v 1 comment
- GitHub - Jiayi-Pan/GPT-V-on-Web https://github.com/Jiayi-Pan/GPT-V-on-Web 0 comments
- GitHub - showlab/Awesome-GUI-Agent: 💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents. https://github.com/showlab/Awesome-GUI-Agent 0 comments
Linked pages
Would you like to stay up to date with Computer science? Checkout Computer science
Weekly.
Related searches:
Search whole site: site:github.com
Search title: GitHub - ddupont808/GPT-4V-Act: AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
See how to search.