rak00n — your personal Jarvis, on your own hardware

What it is

One agent, on your machine — that talks, sees, remembers, builds, and acts.

🟢

The orb

The whole console is a living green orb that is the agent. Tap to talk, drag it anywhere; the page is its canvas.

🧠

Local brain

A local Qwen model on vLLM — multimodal, fast, OpenAI-compatible. Your data never leaves the box. Point at a cloud model if you have no GPU.

🎙️

Voice

Continuous speech with barge-in and streaming TTS — GPU speech-to-text and neural voice, all on-device.

👁️

Native vision

Share your camera and the agent sees what you show it — straight through the multimodal brain.

🧩

Widgets & canvas

Charts, tables, media, 3D models, code, calendars — and bespoke web apps the agent writes and runs as floating cards.

🔗

Connected apps

Search, music, news, and your cloud accounts — Google & Microsoft — light up mail, calendar, and files.

💾

Memory

Durable file memory plus semantic recall and a relationship graph. It remembers across turns and sessions.

📱

Channels

Reach it from Telegram or WhatsApp — your own account, the full agent answering.

♻️

Self-evolving

It can edit its own source, sandbox-test it, and promote the new version to itself — with automatic rollback.

Install

Self-host it in minutes. Runs on Linux, Windows and macOS.

# on a machine with Docker (and an NVIDIA GPU for the local brain)
git clone https://github.com/MrQbit/rak00n.git && cd rak00n
bash scripts/install.sh

Full local

Everything on one NVIDIA box (e.g. a DGX Spark). Most private, no recurring cost.

Cloud model

No GPU? Run the app locally and point the brain at a remote model — works on a Mac or any laptop.

Windows (WSL2)

A Windows PC with an NVIDIA card runs the whole stack via WSL2 + Docker Desktop.

Full guides: DEPLOYMENT.md · Architecture · README