Hacker News
Linked pages
- Introducing computer use, a new Claude 3.5 Sonnet, and Claude 3.5 Haiku \ Anthropic https://www.anthropic.com/news/3-5-models-and-computer-use 742 comments
- GitHub - apple/ml-ferret https://github.com/apple/ml-ferret 428 comments
- https://openai.com/index/introducing-operator/ 424 comments
- GitHub - Skyvern-AI/skyvern: Automate browser-based workflows with LLMs and Computer Vision https://github.com/Skyvern-AI/Skyvern 213 comments
- GitHub - dockur/windows: Windows inside a Docker container. https://github.com/dockur/windows 184 comments
- AI is about to completely change how you use computers | Bill Gates https://www.gatesnotes.com/AI-agents 97 comments
- GitHub - lavague-ai/LaVague: Automate automation with Large Action Model framework https://github.com/lavague-ai/LaVague 95 comments
- GitHub - microsoft/UFO: A UI-Focused Agent for Windows OS Interaction. https://github.com/microsoft/UFO 62 comments
- GitHub - microsoft/autogen: Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ https://github.com/microsoft/autogen 55 comments
- [2407.21075] Apple Intelligence Foundation Language Models https://arxiv.org/abs/2407.21075 42 comments
- GitHub - AmberSahdev/Open-Interface: Control Any Computer Using LLMs. https://github.com/AmberSahdev/Open-Interface 30 comments
- GitHub - Significant-Gravitas/Auto-GPT: An experimental open-source attempt to make GPT-4 fully autonomous. https://github.com/Significant-Gravitas/Auto-GPT 22 comments
- GitHub - OpenInterpreter/open-interpreter: A natural language interface for computers https://github.com/OpenInterpreter/open-interpreter 20 comments
- MULTI·ON https://multion.ai 7 comments
- [2404.05719] Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs https://arxiv.org/abs/2404.05719 7 comments
- GitHub - e2b-dev/desktop: E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to any LLM for secure computer use. https://github.com/e2b-dev/desktop 7 comments
- GitHub - inferablehq/inferable: Build distributed AI agents from your existing internal codebases with durable execution. https://github.com/inferablehq/inferable 4 comments
- GitHub - microsoft/OmniParser: A simple screen parsing tool towards pure vision based GUI agent https://github.com/microsoft/OmniParser 4 comments
- GitHub - suitedaces/computer-agent: Desktop app powered by Claude’s computer use capability to control your computer https://github.com/suitedaces/computer-agent 3 comments
- GitHub - browser-use/browser-use: Make websites accessible for AI agents https://github.com/browser-use/browser-use 3 comments