# computer-agent **Repository Path**: rooky-top/computer-agent ## Basic Information - **Project Name**: computer-agent - **Description**: Desktop app to control your computer with AI using your terminal, browser, mouse & keyboard 一款桌面应用,可借助终端、浏览器、鼠标和键盘,通过人工智能控制你的电脑 - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2026-05-11 - **Last Updated**: 2026-05-11 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Taskhomie (Computer Use AI Agent) Screenshot 2025-12-29 at 2 06 38 AM Local AI agent that controls your computer. Give it natural language instructions and watch it take screenshots, move your mouse, click, type, and run terminal commands. Built with Tauri, React, and Rust. ## Demo https://github.com/user-attachments/assets/8edd92a7-7d3e-472a-9e48-3b561f0257d6 Here, I used it to autonomously read and reply to tweets, lol. This is purely for demonstration/research, you should not attempt to do the same, lol. ## Modes **Computer Use Mode** - Takes over your screen. Sees what you see via screenshots and controls your cursor and keyboard directly. Use when the task spans multiple apps or needs full desktop access. You step away while it works. **Background Mode** - Runs async while you do other things. Uses Chrome DevTools Protocol for web automation and terminal for everything else. Doesn't touch your mouse or keyboard. Faster and more reliable for web + terminal tasks. ## Setup **Requirements:** - Rust & Cargo - Node.js & npm - Anthropic API key ```bash # install deps npm install # add your api key echo "ANTHROPIC_API_KEY=your-key-here" > .env # run dev npm run tauri dev # or build for production npm run tauri build ``` On macOS, you'll need to grant accessibility permissions when prompted (System Settings → Privacy & Security → Accessibility). ## Shortcuts - `⌃⇧C` - push-to-talk → computer use mode - `⌃⇧B` - push-to-talk → background mode - `⌘⇧H` - help mode (screenshot + quick prompt) - `⌘⇧S` - stop agent ## Stack - **Frontend**: React, TypeScript, Tailwind, Zustand, Framer Motion - **Backend**: Rust, Tauri 2, Tokio - **Models**: Haiku, Sonnet, Opus (selectable in UI) ## Contributing PRs welcome. Hit me up on Twitter @ishanxnagpal. ## License [Apache License 2.0](LICENSE)