LLMs on Urbit

The following constitutes an overview of the progress we have made at Groundwire towards the integration of LLMs into the process of software development on Urbit and the development of general purpose LLM personal assistants on Urbit. The aim of this article is to elucidate the motivations and guiding principles behind this work, shed light on its underlying structure, and to make these repos accessible for direct use and further independent development.

The work centers on two repos: clurd, a Python script that gives Claude Code direct access to a running Urbit ship's Dojo, and urbit-master, an experimental desk for building an LLM-powered "everything app" on Urbit. clurd is stable and has been tremendously useful for development. urbit-master is in active flux - it contains most of the interesting ideas but changes frequently. This article documents urbit-master as it exists at commit 1ce76a3.

In this article we will cover:

A summary of 'Clurd'
An overview of Urbit-master
How to get setup
Guiding Principles
A detailed exploration of Urbit-master
- Internal Structure
- sailbox
- tarball

Clurd - a summary

This article will focus mostly on urbit-master, but first I'll give a brief summary of clurd.

clurd is a Python script enabling agentic, terminal-based LLMs like Claude Code to interact directly with a running Urbit ship. Using this tool, Claude Code has access to generators and threads and can run arbitrary Hoon code directly in the Dojo. The project emerged out of explorations into the highest-leverage way to use Claude Code to build on Urbit.

Early experiments involved simply giving Claude Code local access to the core repos - urbit/urbit, urbit/vere, docs.urbit.org - to teach it Hoon. It quickly became apparent that Claude was already conversant in Hoon. It even was capable of giving a pretty thorough and accurate explanation of %spider. The problem wasn't knowledge but feedback: its code frequently failed to compile. It did, however, adjust fairly well when error output was manually copied back into the conversation.

In addition to the obviously exciting idea of giving an LLM general access to an urbit ship, the idea of giving Claude access to tight feedback loops for writing and compiling code motivated the construction of a simple tool for writing to and reading from the Dojo.

It occurred to me that webterm already exposes the Dojo over HTTP. After some tinkering, I pointed Claude at the webterm repo and told it to do what webterm does: submit characters one at a time to the main Dojo session, listen to the same endpoints webterm listens to, reconstruct the terminal output the same way webterm reconstructs it, and return the result after a timer expires.

That's basically all clurd is - a Python script that pretends to be webterm, giving Claude Code a direct line to the Dojo; a simple hack that opened up new horizons and allowed a shift in focus from allowing LLMs to interact with Urbit at all to allowing LLMs to interact with Urbit effectively.

Urbit-master - an overview

Before exploring urbit-master in greater detail I will provide some instructions for setup. But before that I'd like to give an overview of what urbit-master actually is and why it exists.

The purpose of urbit-master is to be a place for me to develop small tools to facilitate daily life; tools that help me organize my thoughts and aims and to coordinate them across time, and eventually to coordinate them with other people. It posits that today LLMs are an indispensable tool to this end.

At commit 1ce76a3 it provides: a web-based Claude chat interface with conversation branching, an MCP server exposing Urbit-native tools to external LLM clients, a task management system (open-loops), scheduled automation (alarms), Telegram notifications, S3 integration for backup, and a file browser for viewing and managing state as ordinary files.

The underlying architecture is built on two key libraries: sailbox, an agent wrapper based on thread-like "fibers" enabling async/await-style programming in Hoon, and tarball, a filesystem-as-state abstraction that makes state legible and portable.

Setup

Prerequisites

A running Urbit ship
API keys for desired integrations (Claude, Telegram, Brave, S3)

Installation

# Clone the repo
git clone https://github.com/gwbtc/urbit-master
cd urbit-master

Configuration

# Copy example config
cp config.example.json config.json

Edit config.json with your credentials:

{
  "ship_url": "http://localhost:8080",
  "access_code": "your-access-code",
  "telegram": {
    "bot_token": "...",
    "chat_id": "..."
  },
  "s3": {
    "access_key": "...",
    "secret_key": "...",
    "region": "...",
    "bucket": "...",
    "endpoint": "..."
  },
  "claude": { "api_key": "..." },
  "brave": { "api_key": "..." }
}

Note: claude and brave keys are not in config.example.json but are expected by the scripts.

Push credentials to ship

./urbit-master update all

Sync desk to ship

In Dojo, create and mount the desk:

|new-desk %master
|install our %master
|mount %master

Then run sync.sh in a separate tmux pane or window (it watches for changes):

./sync.sh

Or simply replace the contents of [your ship]/master with the contents of [urbit-master repo]/desk

Finally, commit in Dojo:

|commit %master

Verify

./urbit-master status                      # Check connectivity
./urbit-master test mcp tools/list         # List available MCP tools

Once running, visit /master in your browser (e.g., http://localhost:8080/master) to see the main interface.

Two Ways to Use Claude

1. Chat Interface (Claude API)

The web-based chat at /master/claude uses the Claude API directly. This requires a Claude API key configured in config.json and pushed to the ship.

To get an API key: create an account at platform.claude.com, then generate an API key at platform.claude.com/settings/keys.

2. Claude Code via MCP

Claude Code can connect to the ship's MCP endpoint to access tools (open loops, alarms, telegram, etc.) while working locally.

In your terminal, add the MCP server:

claude mcp add --transport http urbit-master http://localhost:8080/master/mcp \
  --header "Cookie: urbauth-~your-ship=your-cookie-value"

(Replace localhost:8080 with your ship's URL and set the urbauth cookie for your ship.)

These are independent - you can use one or both.

Setting Up Telegram Notifications

To use the Telegram integration, you need a bot token and chat ID.

Create a bot:

Open Telegram and search for @BotFather
Send /newbot and follow the prompts (name your bot, give it a username ending in bot)
BotFather will give you an API token - save this for config.json

Get your chat ID:

Start a chat with your new bot (send it any message)
Visit https://api.telegram.org/bot<YourBotToken>/getUpdates
Look for the chat.id field in the JSON response

Test it:

https://api.telegram.org/bot<YourBotToken>/sendMessage?chat_id=<ChatID>&text=test

Setting Up clurd (Optional)

clurd lets Claude Code interact directly with a running Urbit ship - running generators, threads, and arbitrary Hoon in the Dojo.

Clone the repo:

git clone https://github.com/niblyx-malnus/clurd
cd clurd

Copy and edit the config:
```
cp config.example.json config.json
```
Set your ship_url and access_code.

Point Claude Code at the README:

Read the README.md in this project to learn how to use the available tools.

Once configured, Claude Code can run Hoon directly on your ship during development sessions.

Guiding Principles

The emergence of urbit-master has been and continues to be guided by a handful of underlying principles which I will elaborate here.

For development:

Tight feedback loops are most desirable - new code can be tested early and often, adjustments made quickly.
Most important errors should be caught at compile time.
Tight feedback loops require low-stress iteration - if every change feels risky, you slow down.
This is enabled by: graceful state migration across upgrades, and easy backup to Earth.
Robustness and speed are mutually reinforcing: the safety net enables the speed.

Tools and interfaces:

Tools are used through interfaces.
The interface must be as simple and legible as possible, balancing ease of use with transparency to the underlying structure.
Tool and interface constrain and shape one another.
Therefore: put a tool under development in direct contact with the medium of its final interface as early as possible, so tool and interface can be developed iteratively as a single system.

Structure and process:

New structure emerges to support new processes, which are invited by new desires, demands and opportunities.
These only emerge in the context of actively inhabited structure - structure which persists by virtue of facilitating an ongoing active process.
New processes must elaborate, amplify or replace old processes as gracefully as possible.
In particular: memory and data which current processes rely on must not be unduly disturbed and must be adapted seamlessly to new structure.
More plainly: we need a system which can be actively relied on for important daily tasks while simultaneously being actively developed to improve and extend function through rapid iteration - unburdened by extreme caution to avoid state destruction or meticulous attention to complicated state migration.

What to build - Start simple and concrete:

Build simple tools that can be directly useful to you in your daily life for the things you actually do.
Choose things that could benefit from being networked but don't require it
Examples: an alarm clock, an egg timer, a notepad, a todo list, the calendar on your fridge, a recipe collection with useful metadata, a place to tally weekly mileage, a simple tool to track your lifts.
It is easier to generalize and abstract a concrete, well-understood tool than to build a general tool from scratch that serves many purposes.
It is easier to network a concrete and well-understood tool that many people use than to build a networked tool from scratch before anyone uses it.

Synchronization and coordinated action across time:

For you and for your computer, synchronization, rhythms, coordinated action across time and deferred plans triggering action at a future moment are critical.
The first urbit-master MCP tool built was the Telegram notifications tool; a dead-simple, no-frills but workable notification system. Your ship can tell you when something has happened, remind you to do something, or nudge you at a scheduled time.
This has also begun to be generalized as a general purpose tool-scheduling system akin to cron jobs.
Synchronization and coordination across the network is the desired end goal.

Asynchronous "imperative" programs (Urbit thread-like things)

We want to give LLMs access to tools.
Tool use implies action, action implies command, command implies specifying what to do under various conditions informed in part by an ordered history of previous actions - in other words, imperative programming.
Many tools interact with real networks or things locally conceived of as networks before completion - they have an asynchronous character.
When you want to do things, think in terms of Urbit "threads".

Networks of autarkies:

There is something in the spirit of Bitcoin which I have absorbed through recent exposure that suggests that the healthiest networks amount, paradoxically, to networks of little autarkies.
A healthy network must be robust to failures of communication at every scale, including at the level of the "application".
Most applications should be locally useful first.
Networking an existing application amounts to specifying a lightweight and minimal protocol - as in all likelihood it will resist being changed.
In principle, even different, independently developed tools that perform similar functions might be networked by the same protocol.
Applications that must be networks first are typically nothing more than such protocols.

Legible state and practical backup:

State should be legible and portable as conventional files and directories - easy to move in and out of Urbit as they are understood on a conventional Unix filesystem.
It should be easy and trivial to back up your state to "Earth" - the world of normal conventional computing as it exists today - whether to your local machine or to a server.
This informs the tarball filesystem-as-state design, the S3 integration (which provides the practical bridge for backing up state to conventional cloud storage), and the web-based file browser at /master/ball which lets you view, upload, download, and manage state as ordinary files.

Recoverable processes (not yet implemented):

All active processes should be recoverable directly from state, even state imported from Earth in a normal directory structure.
If you restore a backup, your alarms, scheduled tasks, and ongoing processes should resume - not just your data.

Sandboxing via namespace (not yet implemented):

Terminal-based agentic LLMs think of themselves as living somewhere in a namespace and running commands somewhere in a namespace.
Anything on Urbit can be conceived of as a "poke" emitted by one location in a namespace and sent to another.
A poke might simply climb to the nearest ancestor and then descend to its destination.
Security becomes: for each node in the namespace, specify what destinations can be emitted up through that node.

urbit-master

App Features (outside MCP)

Claude Chat Interface

Multiple persistent conversations
Conversation branching (explore alternative discussion paths from any message)
Real-time streaming responses via SSE
Context management with character/token caps
Interrupt button for in-flight requests
Tool approval UI (approve/deny/always-allow)
Rate limiting retry/backoff

Credential Management

Stores API keys/secrets for Claude, Telegram, S3, Brave
Configurable via HTTP endpoints

State Browser (/master/ball)

Web-based file browser for viewing state as directories and files
Upload files, create folders, create symlinks
Download any directory as a tarball
Delete files and folders

MCP Server

Exposes an MCP (Model Context Protocol) endpoint at /master/mcp
Allows Claude to call Urbit-native tools during conversation

MCP Tools

External Integrations

Telegram notifications
Web search via Brave API (not working at this commit)

Scheduled Automation (Alarms)

Schedule any MCP tool to execute at specific times
Recurring executions with configurable intervals
ISO-8601 duration support

Task Management (Open Loops)

Create, close, reopen, delete tasks
Labels/tags system
Due dates (best-by)
Search/filter by text regex, labels, state

Desk Management

Commit desks and get version info

Miscellaneous

Generate random values (integers, floats, booleans, UUIDs) (buggy)

System Information

With each API request, the system prompt includes live context:

Ship name (e.g., ~zod)
Current time in user's timezone (e.g., "Wednesday 2025-01-15 3:42pm EST")
Chat ID
Context window stats: messages in context / total, chars / max chars, truncation status
Message ranges with ISO-8601 timestamps
Open loops system guidance (how to use the task tracking tools)

Internal Structure

Core Infrastructure

lib/sailbox.hoon - Fiber-based agent wrapper

Wraps standard gall agent
Provides async/await pattern via fibers
Handles SSE connections
Why it matters: enables imperative async programming

lib/sailboxio.hoon - IO primitives for fibers

State read/write operations
HTTP request/response helpers
Card emission
Why it matters: the verbs available inside a fiber

lib/tarball.hoon - Filesystem-as-state

Hierarchical file storage using axal
Typed read/write with marks
Path operations (mkdir, list, get, put)
Why it matters: enables flexible state without complex migrations

The Agent

app/master.hoon - The main agent

Surprisingly small because logic lives in libraries
Defines: initial state, migration, process handler, SSE hooks
Routes HTTP to appropriate handlers
Why it matters: the entry point, shows how pieces connect

Routing & HTTP

lib/routes/master.hoon - Top-level router

Dispatches GET/POST to sub-routers
Authentication checks
Why it matters: how requests find their handlers

lib/routes/claude.hoon - Chat interface routes

Chat CRUD, message handling, streaming
Why it matters: the main user-facing feature

What is MCP?

The Model Context Protocol (MCP) is an open standard introduced by Anthropic in November 2024 for connecting LLMs to external tools and data sources. It has since been widely adopted (OpenAI, Google, Microsoft, etc.).

The core idea: Before MCP, LLMs could call tools, but every integration was bespoke. MCP standardizes the interface so one tool implementation works with any MCP-compatible client (Claude Code, ChatGPT, custom agents, etc.).

How it works:

JSON-RPC 2.0 over HTTP (or stdio)
initialize - handshake, capability negotiation
tools/list - client asks "what tools do you have?"
tools/call - client invokes a tool with arguments, gets result

In urbit-master:

The MCP implementation is split into two files:

lib/mcp.hoon - Protocol adapter
- Converts tool definitions to MCP JSON format
- Handles JSON-RPC routing (initialize, tools/list, tools/call)
- Delegates actual execution to lib/tools.hoon
lib/routes/mcp.hoon - HTTP endpoint
- Parses incoming JSON
- Calls handle-request:mcp
- Returns JSON response

The key insight: lib/mcp.hoon knows nothing about what tools exist. It just translates between MCP protocol and the generic tool interface in lib/tools.hoon. Add a tool there, and MCP picks it up automatically.

Tools

lib/tools.hoon - Tool definitions

Protocol-agnostic tool registry
Each tool: name, description, params, handler
Why it matters: single source of truth for what LLMs can do

Data Types

sur/claude.hoon - Chat structures

Message, chat (versioned), tool request/result
Branching strategy documented in comments
Why it matters: the shape of persistent data

Feature Libraries

lib/claude.hoon - Claude API client

lib/alarms.hoon - Scheduled execution

lib/open-loops.hoon - Task management

Adding New Tools (for developers)

To add a new tool, you only need to edit lib/tools.hoon. No routing or MCP changes required.

Step 1: Add to all-tools list

:*  'my_tool_name'
    'Description of what the tool does'
    %-  ~(gas by *(map @t parameter-def))
    :~  :-  'param1'
        ^-  parameter-def
        [%string 'Description of param1']
        :-  'param2'
        ^-  parameter-def
        [%number 'Description of param2']
    ==
    ~['param1']  ::  required parameters
    tool-my-tool-name
==

Step 2: Write the handler arm

The handler signature is:

++  tool-my-tool-name
  ^-  tool-handler
  |=  arguments=(map @t json)
  =/  m  (fiber:io ,tool-result)
  ^-  form:m
  ::  ... implementation ...
  (pure:m [%text 'result'])  ::  or [%error 'message']

Minimal example: tool-get-ship

++  tool-get-ship
  ^-  tool-handler
  |=  arguments=(map @t json)
  =/  m  (fiber:io ,tool-result)
  ^-  form:m
  ;<  =bowl:gall  bind:m  get-bowl:io
  (pure:m [%text (scot %p our.bowl)])

sailbox

sailbox is an agent wrapper library that enables async/await-style programming in Hoon. The name reflects its heritage: it began as a sandbox for developers to quickly learn Sail (Hoon's HTML templating). It was then adapted as a means of teaching Claude Sail, and soon incorporated ideas from "Grubbery" - an Urbit-thread-inspired asynchronous-process-based application model - to allow for rapid iteration on Sail-based reactive web applications, eventually becoming a general-purpose async agent wrapper. The name is likely to change in the future to more accurately reflect its current function.

The fundamental premise of sailbox - and Grubbery before it - is that "actions" want to be associated with some agent-like locus of control and yet also often want to be defined as segregated and interleaved asynchronous processes. Threads are Urbit-native asynchronous processes, but they are not associated with a locus of control and do not manage any persistent state directly.

The Problem sailbox Solves

A standard Gall agent handles events one at a time. When you poke an agent, it can emit cards (effects) and update its state, but it cannot wait for a response before continuing. If you need to make an HTTP request and then do something with the result, you must split your logic across multiple event handlers - ++on-poke, ++on-arvo, ++on-agent, ++on-watch, ++on-leave - manually tracking what you were doing when each response arrives.

This becomes unwieldy quickly. A simple workflow like "fetch JSON from an API, parse it, update state, send a notification" might span three or four event handlers with intermediate state to track progress.

The Fiber Pattern

sailbox introduces "fibers" - lightweight processes that can wait for input and continue where they left off. A fiber is essentially a strand (from Urbit threads) adapted to manipulate agent state directly. It is a computation that may need to wait for external input (an HTTP response, a timer, a poke-ack) before continuing.

The key is a continuation monad that lets you write what looks like sequential code:

fetch data from API
parse response
update state
send notification

Each step that requires waiting returns a continuation. When the awaited input arrives, the fiber continues exactly where it left off, with all local bindings intact.

What You Implement

To write a sailbox agent, you implement a handful of arms:

++process: The main fiber that handles incoming pokes. This is where your agent's logic lives, written in the fiber style.
++initial: The starting state for a fresh agent.
++migrate: Transform old state to new state on agent upgrade.
++make-sse-event and ++first-sse-event: Generate SSE events for connected clients.
++on-peek: Handle scry requests.

The ++agent function in sailbox wraps your implementation into a standard Gall agent, handling all the event routing and fiber scheduling internally.

The I/O Vocabulary

sailboxio provides the operations available inside a fiber:

State access: Read and replace the agent's state (a tarball).

Agent communication: Poke other agents, subscribe to paths, handle responses.

Arvo operations: Set timers, read from Clay, make HTTP requests.

HTTP responses: Send simple payloads, emit SSE events.

Tarball operations: Create directories, add files, delete entries.

When a fiber needs to wait for input, it returns a continuation - the next function to call when input arrives. The sailbox wrapper agent stores this continuation and invokes it when new input comes in.

Process Scheduling

Multiple fibers can be active simultaneously. sailbox maintains a "pipe" of running processes, each identified by a unique ID. When one fiber is waiting, another can make progress.

A mutex ("boar") prevents race conditions when fibers need exclusive access to shared state. A fiber can claim the mutex while performing a critical section, blocking other fibers from running until it releases.

HTTP and SSE

sailbox handles HTTP requests by spawning a fiber for each request. The fiber runs until it produces a response or fails. SSE connections are tracked separately, with a keep-alive timer and automatic cleanup.

sailbox extends the standard gall card type with two additions: %simple-payload for responding to HTTP requests and %sse for emitting events to connected SSE clients. %simple-payload is the fundamental mechanism by which fibers respond to web requests - it powers the Claude chat interface, the MCP API, the file browser, and all other HTTP endpoints. %sse enables real-time streaming updates for chat responses, progress indicators, and live state changes.

tarball

tarball is a filesystem-as-state abstraction. Instead of defining rigid structures for agent state, tarball represents state as a tree of directories and files - exactly like a conventional filesystem.

Here, too, the name reflects its origins: it was originally just a way to upload files and directories over the web and download directories as tarballs. That functionality is still there - the file browser at /master/ball lets you do exactly this. The library evolved as I began experimenting with making the entire state of an agent resemble a traditional Unix directory structure. Like sailbox, the name is likely to change in the future to more accurately reflect its current function.

The Core Idea

Agent state is an (axal lump) - a recursive tree structure where each node can contain files and subdirectories. Each file is a cage (a mark and a vase), wrapped with metadata like modification time. This means state can hold any typed Hoon data, organized hierarchically.

Why Filesystem Semantics?

Legibility: State is visible as paths and files. You can browse /chats/abc123/messages rather than inspecting opaque noun structures.

Portability: Export state as a tarball, back it up, move it to another ship, or restore from a backup. The file browser at /master/ball makes this tangible.

Graceful migration: Adding new features often means adding new paths. Existing paths remain valid. Individual data types can be migrated at the mark level - type migration happens per-mark rather than as a monolithic state migration. No need for complex version numbers and migration functions.

Mark integration: Files are cages, so the mark system handles serialization. A .json file is a %json cage; an image is %mime. The conversion machinery already exists.

The `++ba` Interface

tarball provides a door (++ba) for manipulating the state:

Basic operations: get, put, del, has - read, write, delete, and check existence of files at paths.

Directory operations: mkd (make directory), lop (delete subtree), dip (descend into subdirectory).

Listing: lis (list files in directory), lss (list subdirectories).

Bulk operations: tap (flatten to list), gas (insert list), run (apply function to all files).

Typed access: got-cage, got-file, got-cage-as - retrieve files with expected types, crash if wrong.

Symlinks

tarball supports symlinks (road type) - either absolute paths or relative paths (with parent-directory traversal). This is primarily about completeness with respect to Unix-native tarballs: if you want to import and export real tarballs, you need to handle symlinks.

Validation

When you put a cage, tarball validates it using the mark system. If the mark has a dais, the value is normalized. This catches malformed data at write time rather than read time.

Integration with sailbox

sailboxio provides fiber-aware wrappers for tarball operations: put-cage, put-file, mkd, del, lop. These access the current state via get-state and update it via replace.

The combination means: your agent's state is a filesystem, your agent's logic is written as sequential async code, and the two connect through a consistent interface.

This concludes our summary of the fundamentals - not everything was covered in detail, but enough to orient further exploration. We are encouraged by the progress so far and believe there is more latent in these ideas than we have yet extracted. Personal AI on Urbit - AI that serves you, runs on your machine, and answers to no one else - remains a compelling vision, and this work is our attempt to move toward it.

clurd is stable and ready for use. urbit-master remains experimental and under active development, but functional for daily use and experimentation at the documented commit. Both repos are available for direct use and further independent development.

clurd: github.com/niblyx-malnus/clurd
urbit-master: github.com/gwbtc/urbit-master

Questions, feedback, and contributions are welcome.

In this article we will cover:

A summary of 'Clurd'
An overview of Urbit-master
How to get setup
Guiding Principles
A detailed exploration of Urbit-master
- Internal Structure
- sailbox
- tarball

Clurd - a summary

This article will focus mostly on urbit-master, but first I'll give a brief summary of clurd.

Urbit-master - an overview

Before exploring urbit-master in greater detail I will provide some instructions for setup. But before that I'd like to give an overview of what urbit-master actually is and why it exists.

Setup

Prerequisites

A running Urbit ship
API keys for desired integrations (Claude, Telegram, Brave, S3)

Installation

# Clone the repo
git clone https://github.com/gwbtc/urbit-master
cd urbit-master

Configuration

# Copy example config
cp config.example.json config.json

Edit config.json with your credentials:

{
  "ship_url": "http://localhost:8080",
  "access_code": "your-access-code",
  "telegram": {
    "bot_token": "...",
    "chat_id": "..."
  },
  "s3": {
    "access_key": "...",
    "secret_key": "...",
    "region": "...",
    "bucket": "...",
    "endpoint": "..."
  },
  "claude": { "api_key": "..." },
  "brave": { "api_key": "..." }
}

Note: claude and brave keys are not in config.example.json but are expected by the scripts.

Push credentials to ship

./urbit-master update all

Sync desk to ship

In Dojo, create and mount the desk:

|new-desk %master
|install our %master
|mount %master

Then run sync.sh in a separate tmux pane or window (it watches for changes):

./sync.sh

Or simply replace the contents of [your ship]/master with the contents of [urbit-master repo]/desk

Finally, commit in Dojo:

|commit %master

Verify

./urbit-master status                      # Check connectivity
./urbit-master test mcp tools/list         # List available MCP tools

Once running, visit /master in your browser (e.g., http://localhost:8080/master) to see the main interface.

Two Ways to Use Claude

1. Chat Interface (Claude API)

The web-based chat at /master/claude uses the Claude API directly. This requires a Claude API key configured in config.json and pushed to the ship.

To get an API key: create an account at platform.claude.com, then generate an API key at platform.claude.com/settings/keys.

2. Claude Code via MCP

Claude Code can connect to the ship's MCP endpoint to access tools (open loops, alarms, telegram, etc.) while working locally.

In your terminal, add the MCP server:

claude mcp add --transport http urbit-master http://localhost:8080/master/mcp \
  --header "Cookie: urbauth-~your-ship=your-cookie-value"

(Replace localhost:8080 with your ship's URL and set the urbauth cookie for your ship.)

These are independent - you can use one or both.

Setting Up Telegram Notifications

To use the Telegram integration, you need a bot token and chat ID.

Create a bot:

Open Telegram and search for @BotFather
Send /newbot and follow the prompts (name your bot, give it a username ending in bot)
BotFather will give you an API token - save this for config.json

Get your chat ID:

Start a chat with your new bot (send it any message)
Visit https://api.telegram.org/bot<YourBotToken>/getUpdates
Look for the chat.id field in the JSON response

Test it:

https://api.telegram.org/bot<YourBotToken>/sendMessage?chat_id=<ChatID>&text=test

Setting Up clurd (Optional)

clurd lets Claude Code interact directly with a running Urbit ship - running generators, threads, and arbitrary Hoon in the Dojo.

Clone the repo:

git clone https://github.com/niblyx-malnus/clurd
cd clurd

Copy and edit the config:
```
cp config.example.json config.json
```
Set your ship_url and access_code.

Point Claude Code at the README:

Read the README.md in this project to learn how to use the available tools.

Once configured, Claude Code can run Hoon directly on your ship during development sessions.

Guiding Principles

The emergence of urbit-master has been and continues to be guided by a handful of underlying principles which I will elaborate here.

For development:

Tight feedback loops are most desirable - new code can be tested early and often, adjustments made quickly.
Most important errors should be caught at compile time.
Tight feedback loops require low-stress iteration - if every change feels risky, you slow down.
This is enabled by: graceful state migration across upgrades, and easy backup to Earth.
Robustness and speed are mutually reinforcing: the safety net enables the speed.

Tools and interfaces:

Tools are used through interfaces.
The interface must be as simple and legible as possible, balancing ease of use with transparency to the underlying structure.
Tool and interface constrain and shape one another.
Therefore: put a tool under development in direct contact with the medium of its final interface as early as possible, so tool and interface can be developed iteratively as a single system.

Structure and process:

New structure emerges to support new processes, which are invited by new desires, demands and opportunities.
These only emerge in the context of actively inhabited structure - structure which persists by virtue of facilitating an ongoing active process.
New processes must elaborate, amplify or replace old processes as gracefully as possible.
In particular: memory and data which current processes rely on must not be unduly disturbed and must be adapted seamlessly to new structure.
More plainly: we need a system which can be actively relied on for important daily tasks while simultaneously being actively developed to improve and extend function through rapid iteration - unburdened by extreme caution to avoid state destruction or meticulous attention to complicated state migration.

What to build - Start simple and concrete:

Build simple tools that can be directly useful to you in your daily life for the things you actually do.
Choose things that could benefit from being networked but don't require it
Examples: an alarm clock, an egg timer, a notepad, a todo list, the calendar on your fridge, a recipe collection with useful metadata, a place to tally weekly mileage, a simple tool to track your lifts.
It is easier to generalize and abstract a concrete, well-understood tool than to build a general tool from scratch that serves many purposes.
It is easier to network a concrete and well-understood tool that many people use than to build a networked tool from scratch before anyone uses it.

Synchronization and coordinated action across time:

For you and for your computer, synchronization, rhythms, coordinated action across time and deferred plans triggering action at a future moment are critical.
The first urbit-master MCP tool built was the Telegram notifications tool; a dead-simple, no-frills but workable notification system. Your ship can tell you when something has happened, remind you to do something, or nudge you at a scheduled time.
This has also begun to be generalized as a general purpose tool-scheduling system akin to cron jobs.
Synchronization and coordination across the network is the desired end goal.

Asynchronous "imperative" programs (Urbit thread-like things)

We want to give LLMs access to tools.
Tool use implies action, action implies command, command implies specifying what to do under various conditions informed in part by an ordered history of previous actions - in other words, imperative programming.
Many tools interact with real networks or things locally conceived of as networks before completion - they have an asynchronous character.
When you want to do things, think in terms of Urbit "threads".

Networks of autarkies:

There is something in the spirit of Bitcoin which I have absorbed through recent exposure that suggests that the healthiest networks amount, paradoxically, to networks of little autarkies.
A healthy network must be robust to failures of communication at every scale, including at the level of the "application".
Most applications should be locally useful first.
Networking an existing application amounts to specifying a lightweight and minimal protocol - as in all likelihood it will resist being changed.
In principle, even different, independently developed tools that perform similar functions might be networked by the same protocol.
Applications that must be networks first are typically nothing more than such protocols.

Legible state and practical backup:

State should be legible and portable as conventional files and directories - easy to move in and out of Urbit as they are understood on a conventional Unix filesystem.
It should be easy and trivial to back up your state to "Earth" - the world of normal conventional computing as it exists today - whether to your local machine or to a server.
This informs the tarball filesystem-as-state design, the S3 integration (which provides the practical bridge for backing up state to conventional cloud storage), and the web-based file browser at /master/ball which lets you view, upload, download, and manage state as ordinary files.

Recoverable processes (not yet implemented):

All active processes should be recoverable directly from state, even state imported from Earth in a normal directory structure.
If you restore a backup, your alarms, scheduled tasks, and ongoing processes should resume - not just your data.

Sandboxing via namespace (not yet implemented):

Terminal-based agentic LLMs think of themselves as living somewhere in a namespace and running commands somewhere in a namespace.
Anything on Urbit can be conceived of as a "poke" emitted by one location in a namespace and sent to another.
A poke might simply climb to the nearest ancestor and then descend to its destination.
Security becomes: for each node in the namespace, specify what destinations can be emitted up through that node.

urbit-master

App Features (outside MCP)

Claude Chat Interface

Multiple persistent conversations
Conversation branching (explore alternative discussion paths from any message)
Real-time streaming responses via SSE
Context management with character/token caps
Interrupt button for in-flight requests
Tool approval UI (approve/deny/always-allow)
Rate limiting retry/backoff

Credential Management

Stores API keys/secrets for Claude, Telegram, S3, Brave
Configurable via HTTP endpoints

State Browser (/master/ball)

Web-based file browser for viewing state as directories and files
Upload files, create folders, create symlinks
Download any directory as a tarball
Delete files and folders

MCP Server

Exposes an MCP (Model Context Protocol) endpoint at /master/mcp
Allows Claude to call Urbit-native tools during conversation

MCP Tools

External Integrations

Telegram notifications
Web search via Brave API (not working at this commit)

Scheduled Automation (Alarms)

Schedule any MCP tool to execute at specific times
Recurring executions with configurable intervals
ISO-8601 duration support

Task Management (Open Loops)

Create, close, reopen, delete tasks
Labels/tags system
Due dates (best-by)
Search/filter by text regex, labels, state

Desk Management

Commit desks and get version info

Miscellaneous

Generate random values (integers, floats, booleans, UUIDs) (buggy)

System Information

With each API request, the system prompt includes live context:

Ship name (e.g., ~zod)
Current time in user's timezone (e.g., "Wednesday 2025-01-15 3:42pm EST")
Chat ID
Context window stats: messages in context / total, chars / max chars, truncation status
Message ranges with ISO-8601 timestamps
Open loops system guidance (how to use the task tracking tools)

Internal Structure

Core Infrastructure

lib/sailbox.hoon - Fiber-based agent wrapper

Wraps standard gall agent
Provides async/await pattern via fibers
Handles SSE connections
Why it matters: enables imperative async programming

lib/sailboxio.hoon - IO primitives for fibers

State read/write operations
HTTP request/response helpers
Card emission
Why it matters: the verbs available inside a fiber

lib/tarball.hoon - Filesystem-as-state

Hierarchical file storage using axal
Typed read/write with marks
Path operations (mkdir, list, get, put)
Why it matters: enables flexible state without complex migrations

The Agent

app/master.hoon - The main agent

Surprisingly small because logic lives in libraries
Defines: initial state, migration, process handler, SSE hooks
Routes HTTP to appropriate handlers
Why it matters: the entry point, shows how pieces connect

Routing & HTTP

lib/routes/master.hoon - Top-level router

Dispatches GET/POST to sub-routers
Authentication checks
Why it matters: how requests find their handlers

lib/routes/claude.hoon - Chat interface routes

Chat CRUD, message handling, streaming
Why it matters: the main user-facing feature

What is MCP?

How it works:

JSON-RPC 2.0 over HTTP (or stdio)
initialize - handshake, capability negotiation
tools/list - client asks "what tools do you have?"
tools/call - client invokes a tool with arguments, gets result

In urbit-master:

The MCP implementation is split into two files:

lib/mcp.hoon - Protocol adapter
- Converts tool definitions to MCP JSON format
- Handles JSON-RPC routing (initialize, tools/list, tools/call)
- Delegates actual execution to lib/tools.hoon
lib/routes/mcp.hoon - HTTP endpoint
- Parses incoming JSON
- Calls handle-request:mcp
- Returns JSON response

Tools

lib/tools.hoon - Tool definitions

Protocol-agnostic tool registry
Each tool: name, description, params, handler
Why it matters: single source of truth for what LLMs can do

Data Types

sur/claude.hoon - Chat structures

Message, chat (versioned), tool request/result
Branching strategy documented in comments
Why it matters: the shape of persistent data

Feature Libraries

lib/claude.hoon - Claude API client

lib/alarms.hoon - Scheduled execution

lib/open-loops.hoon - Task management

Adding New Tools (for developers)

To add a new tool, you only need to edit lib/tools.hoon. No routing or MCP changes required.

Step 1: Add to all-tools list

:*  'my_tool_name'
    'Description of what the tool does'
    %-  ~(gas by *(map @t parameter-def))
    :~  :-  'param1'
        ^-  parameter-def
        [%string 'Description of param1']
        :-  'param2'
        ^-  parameter-def
        [%number 'Description of param2']
    ==
    ~['param1']  ::  required parameters
    tool-my-tool-name
==

Step 2: Write the handler arm

The handler signature is:

++  tool-my-tool-name
  ^-  tool-handler
  |=  arguments=(map @t json)
  =/  m  (fiber:io ,tool-result)
  ^-  form:m
  ::  ... implementation ...
  (pure:m [%text 'result'])  ::  or [%error 'message']

Minimal example: tool-get-ship

++  tool-get-ship
  ^-  tool-handler
  |=  arguments=(map @t json)
  =/  m  (fiber:io ,tool-result)
  ^-  form:m
  ;<  =bowl:gall  bind:m  get-bowl:io
  (pure:m [%text (scot %p our.bowl)])

sailbox

The Problem sailbox Solves

The Fiber Pattern

The key is a continuation monad that lets you write what looks like sequential code:

fetch data from API
parse response
update state
send notification

Each step that requires waiting returns a continuation. When the awaited input arrives, the fiber continues exactly where it left off, with all local bindings intact.

What You Implement

To write a sailbox agent, you implement a handful of arms:

++process: The main fiber that handles incoming pokes. This is where your agent's logic lives, written in the fiber style.
++initial: The starting state for a fresh agent.
++migrate: Transform old state to new state on agent upgrade.
++make-sse-event and ++first-sse-event: Generate SSE events for connected clients.
++on-peek: Handle scry requests.

The ++agent function in sailbox wraps your implementation into a standard Gall agent, handling all the event routing and fiber scheduling internally.

The I/O Vocabulary

sailboxio provides the operations available inside a fiber:

State access: Read and replace the agent's state (a tarball).

Agent communication: Poke other agents, subscribe to paths, handle responses.

Arvo operations: Set timers, read from Clay, make HTTP requests.

HTTP responses: Send simple payloads, emit SSE events.

Tarball operations: Create directories, add files, delete entries.

Process Scheduling

Multiple fibers can be active simultaneously. sailbox maintains a "pipe" of running processes, each identified by a unique ID. When one fiber is waiting, another can make progress.

HTTP and SSE

tarball

The Core Idea

Why Filesystem Semantics?

Legibility: State is visible as paths and files. You can browse /chats/abc123/messages rather than inspecting opaque noun structures.

Portability: Export state as a tarball, back it up, move it to another ship, or restore from a backup. The file browser at /master/ball makes this tangible.

Mark integration: Files are cages, so the mark system handles serialization. A .json file is a %json cage; an image is %mime. The conversion machinery already exists.

The `++ba` Interface

tarball provides a door (++ba) for manipulating the state:

Basic operations: get, put, del, has - read, write, delete, and check existence of files at paths.

Directory operations: mkd (make directory), lop (delete subtree), dip (descend into subdirectory).

Listing: lis (list files in directory), lss (list subdirectories).

Bulk operations: tap (flatten to list), gas (insert list), run (apply function to all files).

Typed access: got-cage, got-file, got-cage-as - retrieve files with expected types, crash if wrong.

Symlinks

Validation

When you put a cage, tarball validates it using the mark system. If the mark has a dais, the value is normalized. This catches malformed data at write time rather than read time.

Integration with sailbox

sailboxio provides fiber-aware wrappers for tarball operations: put-cage, put-file, mkd, del, lop. These access the current state via get-state and update it via replace.

The combination means: your agent's state is a filesystem, your agent's logic is written as sequential async code, and the two connect through a consistent interface.

clurd: github.com/niblyx-malnus/clurd
urbit-master: github.com/gwbtc/urbit-master

Questions, feedback, and contributions are welcome.

Resources

A walkthrough of how to use current Urbit LLM tooling by ~niblyx-malnus

2026-1-27

Clurd - a summary

Urbit-master - an overview

Setup

Prerequisites

Installation

Configuration

Push credentials to ship

Sync desk to ship

Verify

Two Ways to Use Claude

Setting Up Telegram Notifications

Setting Up clurd (Optional)

Guiding Principles

urbit-master

App Features (outside MCP)

MCP Tools

System Information

Internal Structure

Core Infrastructure

The Agent

Routing & HTTP

What is MCP?

Tools

Data Types

Feature Libraries

Adding New Tools (for developers)

sailbox

The Problem sailbox Solves

The Fiber Pattern

What You Implement

The I/O Vocabulary

Process Scheduling

HTTP and SSE

tarball

The Core Idea

Why Filesystem Semantics?

The ++ba Interface

Symlinks

Validation

Integration with sailbox

LLMs on Urbit

A walkthrough of how to use current Urbit LLM tooling by ~niblyx-malnus

2026-1-27

Clurd - a summary

Urbit-master - an overview

Setup

Prerequisites

Installation

Configuration

Push credentials to ship

Sync desk to ship

Verify

Two Ways to Use Claude

Setting Up Telegram Notifications

Setting Up clurd (Optional)

Guiding Principles

urbit-master

App Features (outside MCP)

MCP Tools

System Information

Internal Structure

Core Infrastructure

The Agent

Routing & HTTP

What is MCP?

Tools

Data Types

Feature Libraries

Adding New Tools (for developers)

sailbox

The Problem sailbox Solves

The Fiber Pattern

What You Implement

The I/O Vocabulary

Process Scheduling

HTTP and SSE

The `++ba` Interface

The `++ba` Interface