Your Desktop,From Anywhere
A remote compute agent that gives you full AI desktop control from your phone — real-time screen streaming, text or voice input, and autonomous task execution.
How It Works
You speak, your phone understands, your desktop executes — all in one seamless flow.
Open Chrome and search for flights to Tokyo
Take a screenshot and summarize what you see
Speak naturally or type — your phone understands what you want to do on your desktop.
An AI model running on your phone interprets what you said and figures out if it's a simple question or a task that needs your desktop.
Simple questions are answered instantly on the phone — only desktop tasks get sent over.
Automatically picks the fastest available path. Your data never passes through third-party servers.
Before any action runs on your desktop, it's automatically evaluated for safety.
Risky commands run in an isolated sandbox — no access to your files, network, or system.
An AI agent on your desktop breaks down your request into steps and executes them autonomously.
$ Searching "flights to Tokyo"
$ Reading screen... 14 elements found▌
Speak naturally, AI understands
Speak or type naturally on your phone. The on-device AI figures out what you need and whether it requires your desktop.
Encrypted, direct, no cloud
Your commands travel directly from phone to desktop through an encrypted tunnel. Nothing is stored or routed through third-party servers.
AI does the work for you
An autonomous agent takes over your desktop — opening apps, clicking buttons, running commands — while keeping you updated every step of the way.
Connectivity
Three connection paths, one seamless experience — your phone always finds the fastest route to your desktop.
Same Wi-Fi Network
Direct connection, lowest latency
Zero-Trust VPN Mesh
WireGuard-based encrypted tunnel
Public Internet Tunnel
No port forwarding, no firewall config
Once a signaling path is found, a direct WebRTC connection is established between your phone and desktop.
Two pairing modes to fit your workflow — quick one-time sessions or persistent always-on setups.
Expires after 4 hours or when you disconnect
No data persisted on device
Stays active for 30 days with auto-reconnect
Reconnects automatically
Automatic Path Discovery
Contop tries the fastest path first and falls back automatically. If your connection drops, it reconnects with smart backoff — no manual intervention needed.
End-to-End Encrypted
All data flows directly between your phone and desktop — encrypted with DTLS and verified with certificate fingerprints. Nothing passes through third-party servers.
Flexible Pairing
Quick sessions for one-time use, persistent connections for your daily setup. Each device gets one active token — re-pairing automatically revokes the old one.
Features
A powerful mobile interface designed for every workflow
Adaptive Layouts
See your desktop screen and conversation side by side. Drag the separator to resize — anywhere from 30% to 70%.
Maximize your desktop view. The chat floats on top as a transparent overlay — tap through it to keep watching.
Focus on the conversation. A small video preview stays pinned at the top so you never lose sight of your desktop.
Rotate your phone and get a widescreen view. Desktop screen on the left, conversation on the right — plus a fullscreen video option for dedicated monitoring.
Dedicate your entire screen to watching your desktop. Minimal floating controls stay out of the way. Perfect for long-running tasks where you just need to keep an eye on things.
5 Modes for Every Use Case
Split View for balanced monitoring, Video Focus for watching the agent work, Thread Focus for reading results, Side-by-Side for landscape multitasking, and Fullscreen Video for dedicated desktop viewing.
Smart Rotation
Rotate your phone and the layout adapts instantly. Set your preferred mode for portrait and landscape — Contop remembers your choices across sessions.
Drag to Resize
Resize the screen and chat panels by dragging the separator. Works horizontally in portrait and vertically in landscape. Constrained so neither panel gets too small.
Intelligent Model Configuration
Configure each AI role independently from your phone — no server restart needed
Runs on your machine · Works offline
Kimi · Qwen · Phi · Molmo · Holotron
Google's built-in · Autonomous multi-step
Text-based · No screenshots needed
Nine vision backends from local to cloud — choose by privacy, speed, or model preference
3.1 Pro · 3 Flash · 2.5 Pro · 2.5 Flash
GPT-5.4 · GPT-4.1 · o3 · o4 Mini
Claude Opus 4.6 · Sonnet 4.6 · Haiku 4.5
Grok · Devstral · Qwen · Nemotron · 300+
Bring your own API keys — use any provider for conversation or execution
Per-Role Model Selection
Three independent AI roles — conversation, execution, and screen interaction — each configurable with 25+ models from Gemini, OpenAI, Anthropic, or OpenRouter. Change from your phone anytime.
Nine Screen Strategies
Nine ways for the agent to see your screen — from local OmniParser to six cloud vision models, Google's native vision, or keyboard-first with no screenshots.
Switch Without Restarting
Change models and backends on the fly from mobile settings. The desktop agent picks up your new configuration on the next command — zero downtime.
Everyday Experience
See every step the agent takes in real time — messages, tool calls, and results stream into a live thread
Pick up where you left off — sessions persist across app restarts with full conversation history
Tell the agent how you want it to behave — set language, project paths, or preferred tools
Control your desktop state from anywhere
Lock your screen or keep it awake during long tasks — all from your phone
Speak or type — your intent becomes the command. Record voice, review, and send, or type directly for quick instructions
Take direct control — move the cursor with a joystick, click, scroll, and send key combos from your phone
See Every Step in Real Time
Watch the agent work through your request step by step. User messages, AI responses, tool calls, and results stream into a live thread — with progress indicators and expandable details.
Pick Up Where You Left Off
Every session is saved automatically with full conversation history. Browse by date, filter by tool or result, rename sessions, and continue any past session with one tap.
Your Desktop, Your Rules
Lock your screen, keep it awake, set custom instructions, use voice input, or take direct control with a joystick overlay for cursor, clicks, and keyboard shortcuts. Switch between AI and manual mode seamlessly.
Agent & Automation
33 built-in tools that let the agent run commands, control your screen, manage files, and automate entire workflows.
✓ Installed in 1.2s
Run shell commands on your desktop just like you would in a terminal — install packages, run scripts, manage files.
The agent sees your screen, identifies every button and element, then clicks, types, and scrolls — just like a person would.
Controls Chrome directly — no screenshots needed. Navigates pages, fills forms, clicks buttons, and reads content efficiently.
Reads page text directly instead of taking screenshots — 10x more efficient for the AI.
Works with any file on your machine — text, code, PDFs, images, and Excel spreadsheets.
Manage windows, read the clipboard, monitor processes, and download files — works the same on every platform.
Launch and close apps, handle Save As and Open dialogs, and create reusable skills to automate repetitive workflows.
Custom Skills
Teach the agent new abilities by creating reusable skills — chain multiple steps into one command.
Three ways to control your desktop
Run terminal commands, automate GUI interactions by seeing your screen, or control Chrome directly — the agent picks the best approach for each task.
Works with any file, any platform
Read and edit code, PDFs, images, and spreadsheets. Manage windows and monitor your system. Same experience on Windows, macOS, and Linux.
Teach it new tricks
Create custom skills to automate your unique workflows. Chain actions together, save them once, and reuse them forever — no coding required.
Model Providers
Use API keys or your existing subscriptions — choose from 4 providers and 20+ models, and configure any combination for any task.
Choose from 4 providers and 20+ models. Use any combination for different tasks — switch anytime from your phone.
The app uses three independent AI roles — assign any provider to any role, and change them at runtime from mobile settings.
Any provider can fill any role — use Gemini for conversation and Claude for execution, or any other combination.
Set up API keys or enable subscription mode on the desktop app. Configuration travels to your phone securely through QR pairing — no manual copying.
Pick the best model for the job
Different tasks benefit from different models. Use a fast model for quick actions and a powerful one for complex reasoning — all from the same app.
Configure from your phone
Switch models and providers at any time from mobile settings. Each AI role can be independently assigned to any supported model.
Your credentials, your control
API keys and subscription preferences never leave your devices. They're configured on your desktop, transferred securely via QR, and stored encrypted on your phone.
Skills
Extensible agent capabilities via the SKILL.md standard — built-in skills included, custom skills easy to create.
Extensible Agent
Add new capabilities by dropping a SKILL.md file into the skills directory. The agent discovers and loads it automatically — no code changes needed.
Deterministic Workflows
Define YAML step sequences for repetitive tasks — keyboard shortcuts, menu navigation, form filling. Runs the same way every time, no AI guesswork.
Create Your Own
Build custom skills as prompt instructions, YAML workflows, Python tools, or any combination. Manage them from the desktop GUI — discover, enable, edit.
Security
Every layer verified against the real codebase — from physical machine protection to encrypted peer-to-peer connections.
Away Mode
Away Mode protects your machine when you're not at the keyboard. PIN overlay, keyboard lock, idle auto-engage, encrypted secrets.
Command Classification
Every command is classified before it runs. Dangerous actions are sandboxed or blocked. You approve what matters.
End-to-End Encrypted
End-to-end encrypted. Peer-to-peer. No cloud relay. Biometric pairing. Your data never leaves the tunnel.
Paired Device Management
See every connected device, where it's connecting from, and revoke access instantly. OS alerts for every connection event.
Use Cases
Real people, real problems, solved in seconds.
PagerDuty fires while Alex is on the train. He opens Contop, speaks one command, and the agent checks the logs, finds the stalled container, and restarts it. Outage resolved in under a minute — no laptop needed.
Sarah asks Contop to run her render script and casually adds “delete temp files in root.” The security gate blocks the dangerous part, asks her phone to confirm, and kicks off just the render. System stays safe.
Marcus left a Blender render running on his workstation. At dinner, his phone shows a GPU memory error dialog blocking the process. He tells Contop to lower the tile size and hit retry — the agent navigates Blender's UI visually, clicks through the settings, and the render resumes.
Instant Response
Resolve critical issues in seconds, not minutes. Voice-to-action from any location.
Safety by Design
Dangerous actions are caught, sandboxed, and confirmed before execution.
Zero Walkthrough
Remote support without asking users to follow complex steps.
Download
Get Contop running on your machine in minutes.
Or install via package manager—no security warnings
brew install slopedrop/contop/contopWindows:scoop bucket add contop https://github.com/slopedrop/scoop-contopthenscoop install contopiOS
Coming Soon
Coming SoonDeveloper Documentation
Setup guides, API reference, skill authoring, and configuration.
Desktop Agent
Control your computer with AI from anywhere. Windows, macOS, and Linux.
Mobile Commander
Voice and text control from your phone. Android (iOS coming soon).
Documentation
Setup guides, API reference, skill authoring, and configuration.