Hacker News
- Computer Use Agents https://github.com/francedot/acu 0 comments
- Awesome Agents for Computer Use https://github.com/francedot/acu 1 comment
Linked pages
- Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku \ Anthropic https://www.anthropic.com/news/3-5-models-and-computer-use 742 comments
- GitHub - apple/ml-ferret https://github.com/apple/ml-ferret 428 comments
- GitHub - dockur/windows: Windows inside a Docker container. https://github.com/dockur/windows 184 comments
- AI is about to completely change how you use computers | Bill Gates https://www.gatesnotes.com/AI-agents 97 comments
- GitHub - lavague-ai/LaVague: Automate automation with Large Action Model framework https://github.com/lavague-ai/LaVague 95 comments
- GitHub - microsoft/UFO: A UI-Focused Agent for Windows OS Interaction. https://github.com/microsoft/UFO 62 comments
- GitHub - microsoft/autogen: Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ https://github.com/microsoft/autogen 55 comments
- [2407.21075] Apple Intelligence Foundation Language Models https://arxiv.org/abs/2407.21075 42 comments
- GitHub - AmberSahdev/Open-Interface: Control Any Computer Using LLMs https://github.com/AmberSahdev/Open-Interface 29 comments
- GitHub - Significant-Gravitas/Auto-GPT: An experimental open-source attempt to make GPT-4 fully autonomous. https://github.com/Significant-Gravitas/Auto-GPT 22 comments
- GitHub - OpenInterpreter/open-interpreter: A natural language interface for computers https://github.com/OpenInterpreter/open-interpreter 20 comments
- MULTI·ON https://multion.ai 7 comments
- [2404.05719] Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs https://arxiv.org/abs/2404.05719 7 comments
- GitHub - e2b-dev/desktop: E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to any LLM for secure computer use. https://github.com/e2b-dev/desktop 7 comments
- GitHub - suitedaces/computer-agent: Desktop app powered by Claude’s computer use capability to control your computer https://github.com/suitedaces/computer-agent 3 comments
- GitHub - OthersideAI/self-operating-computer: A framework to enable multimodal models to operate a computer. https://github.com/OthersideAI/self-operating-computer 2 comments
- [2409.12089] The Impact of Element Ordering on LM Agent Performance https://arxiv.org/abs/2409.12089 2 comments
- GitHub - nat/natbot: Drive a browser with GPT-3 https://github.com/nat/natbot 1 comment
- [2307.10088] Android in the Wild: A Large-Scale Dataset for Android Device Control https://arxiv.org/abs/2307.10088 1 comment
- GitHub - THUDM/CogVLM: a state-of-the-art-level open visual language model | 多模态预训练模型 https://github.com/THUDM/CogVLM 1 comment