StacyVM · live

The secure operating system for AI agents

Runs on your own infrastructure — private cloud, on-prem, or air-gapped. The agent OS for organizations that can't put their work in someone else's cloud. Nothing leaves their boundary.

Personal-first Permissioned Logged Approval-gated Compliance-ready
THE SHIFT

The bottleneck is no longer generation — it is trust.

Most companies will run AI agents in a vendor's cloud. Defense, banks, hospitals, and government legally can't put operational data there.

The centralized cloud "company brain" is disqualified before the first demo. Whichever agent OS those organizations adopt has to run on infrastructure they already own.

Generating software got cheap. Keeping it trustworthy didn't.

Once agents start producing real operational work, the only useful questions are about trust:

  • Who created this knowledge?
  • Can this agent use it?
  • Can it be revoked?
  • Can another machine trust it?
  • Can compliance audit it?

StacyOS is the secure agent OS built for those questions — running on the customer's own infrastructure, never on someone else's cloud.

StacyOS makes agent operations legible — without leaving the customer's boundary.

Meetings, tickets, workflows, dashboards, generated apps, operational decisions, and agent outputs become signed, permissioned artifacts the customer owns. Audit-ready by design. Compliance-ready by default.

01

AI agents now touch your most sensitive surfaces.

Inside regulated organizations, agents already read, call, and write across:

  • customer records
  • production systems
  • privileged credentials
  • internal documents
  • code repositories
  • tickets and decisions
  • model endpoints
  • third-party tools
02

Nothing about that activity is signed, permissioned, or auditable.

There is no canonical record of:

  • who authorized the run
  • which model was called
  • what data was read
  • which boundary it crossed
  • who approved the action
  • what was written or sent
  • where the evidence lives

Compliance teams reconstruct from scattered logs. Auditors are asked to trust screenshots.

03

StacyOS turns every agent action into permissioned, signed, auditable evidence.

Every run carries:

  • verified identity
  • policy decision before execution
  • model boundary recorded
  • data lineage with hashes
  • human approval as signed event
  • tamper-evident receipt chain
  • audit export the customer owns
// BEFORE

Fragmented operational intelligence

AI dashboard who made this? support ticket still valid? generated app trusted version? Slack message lost context workflow agent allowed? sales report duplicated
// AFTER

Coordinated through StacyOS

AI dashboard verifiable support ticket permissioned generated app composable Slack message reusable workflow revocable sales report auditable AI agent output provenance verified workflow execution audit receipted shared object consent enforced revoked artifact denied on next read
// what gets recorded

Eight primitives. One audit trail you own.

Compliance-ready, not automatically compliant. Evidence is generated as work happens; framework mapping is your auditor's call.

Identity
Agent, user, install, system, key-derived identity, delegated authority.
Authorization
Allowed / denied decision before data access, model call, tool call, write, send, or share.
Data lineage
Inputs, files, systems, objects, hashes, freshness, source labels.
Model provenance
Model / provider / version, deployment boundary, retention / training policy, prompt / response hash.
Human approval
Approval, denial, escalation, override, revocation, reviewer identity.
Execution history
Commands, tools, files, network boundaries, outputs, tests, cost, timing.
Tamper evidence
Hashes, signatures, receipt chains, external verifier package.
Audit export
Framework-mappable evidence packages customers own inside their boundary.

The future company is a coordination system.

For organizations whose operational data cannot leave their boundary — defense, financial services, healthcare, government, energy, pharma, high-frequency trading — AI-native means more than running models inside a VPC. It means an operating system that turns every agent action into permissioned, signed, auditable evidence.

Today, software coordinates humans. Tomorrow, agents coordinate humans, systems, and each other — inside the customer's network, under their policies.

The AI-native company is not just searchable. It runs a governed self-improvement loop.

Observe → Compare → Correct → Verify → Remember. Every interaction, ticket, workflow, dashboard, generated app, and operational decision feeds the next agent run with verified, signed, permissioned context. Humans supervise high-stakes and novel decisions — the system does not silently rewrite the company.

// the recursive AI loop

The future enterprise runs a closed feedback loop.

recursive
AI loops
hover a step
01
operational signals
raw inputs from people, systems, and sensors
02
workflows
deterministic processes that move work
03
policy systems
rules and guardrails agents must obey
04
deterministic tools
trusted functions agents are allowed to call
05
agent actions
what AI actually decides to do
06
evals
Compare: actual work vs. policy, controls, goals, and evals.
07
audits
verifiable record of what happened
08
learning
feeds back into signals and policy
01operational signalsraw inputs from people, systems, and sensors
02workflowsdeterministic processes that move work
03policy systemsrules and guardrails agents must obey
04deterministic toolstrusted functions agents are allowed to call
05agent actionswhat AI actually decides to do
06evalsCompare: actual work vs. policy, controls, goals, and evals.
07auditsverifiable record of what happened
08learningfeeds back into signals and policy
before

Humans route information between teams and tools.

after

Operational intelligence coordinates itself — across agents, humans, and machines.

StacyOS is the secure agent OS for organizations whose work cannot leave their boundary — every action permissioned, logged, and signed into an audit trail they own.

StacyVM StacyVM · live
The trusted execution layer of Stacy. Installable today.

StacyVM is the live execution layer of Stacy — self-hosted sandbox infrastructure for AI agents, with verifiable worker identity and signed Execution Receipts. Every AI action needs a safe place to run and a trustworthy way to prove it happened. StacyVM gives both.

As generated software becomes disposable and regeneratable, StacyVM gives every generated action a safe runtime and every execution a verifiable receipt.

Run generated code, agents, workflows, and previews in isolated disposable environments with verifiable worker identity. Choose your isolation: Firecracker microVMs (~28ms boot), Docker (runc, gVisor, or Kata), PRoot for restricted hosts, E2B, or a custom provider — same API across all. Runs in your own private cloud, on-prem, or air-gapped.

Open source. Self-hostable. Python and TypeScript SDKs. Live preview URLs. React web dashboard and a stacyvm tui terminal cockpit. Installable today.

01
Isolated sandboxes
Firecracker microVMs, Docker, gVisor, Kata, PRoot — same API, swap providers without rewrites.
02
Live previews
Spin up a generated app and get a public preview URL in seconds. Click, see, approve.
03
Worker identity
Every execution attestable to a verified worker. No bearer-token guessing.
04
Pool mode economics
100 users on ~20 VMs instead of 100. Cost bounded by the box, not the workload — ~60% less infrastructure vs. per-user provisioning.
05
Web Dashboard
React control surface — sandbox management, live terminal, file browser, logs. Ships with the OSS install.
06
Terminal cockpit (TUI)
stacyvm tui — full-screen terminal UI for operators who'd rather stay in the shell. Same primitives as the dashboard.
07
Signed Execution Receipts
Every action StacyVM runs produces a signed receipt — worker identity, timestamp, content hash, provenance graph entry. The next agent, the next team, the next system can verify what happened without trusting you. Sandbox infra logs. Stacy receipts attest.
$npx stacyvm-setup@latest
Self-hosted by default. Runs in your private cloud, on-prem, or air-gapped. Or use ./stacyvm serve.
the architecture

Customer-owned by design.
Audit-ready by architecture.

StacyOS makes every agent action permissioned, signed, and audit-ready — so each output an agent produces can be trusted, reused, and proven across the customer's own infrastructure. No central runtime. No shared tenant. The audit record lives where the work lived.

Hover or tap any component to see its current status, what it does, and where to find its code and docs.

// the seven-layer model

The OS stacks as seven layers. L0 is live. L1–L6 are designed.

  1. L0 Runtime Kernel Stacy Sandbox LIVE
  2. L1 Policy & Entitlements Stacy Policy designed
  3. L2 Model & Credential Governance Stacy Guard designed
  4. L3 Provenance & Evidence Spine Stacy Evidence designed
  5. L4 Evaluation & Corrective Loop Stacy Correct designed
  6. L5 Integration & Deployment Fabric Stacy Fabric designed
  7. L6 Federation & Applications Stacy Sync + Console designed

// invariant Evidence and Knowledge Objects live on the customer's infrastructure — never on Stacy Cloud. The customer owns the audit record.

Tier 01 A single Stacy install on one person's machine.
// stacy · install · single person StacyVM · live
A Arpanperson identity
Stacy CLI
terminal cockpit
Control Panel
visual cockpit
Agent / API
programmatic
Intent Compiler // intent → runnable plan
StacyVM
trusted execution · phase 14
01
Deterministic execution
02
Agent runtime
03
Live preview
04
Worker identity
05
Provenance
06
Provider abstraction
Stacy Sync
Coordination Intelligence Layer
shared schemas
provenance graph
permissions
compatibility
registry
receipts
Generated Software
object family · 01
App specs, modules, schemas, dependency graphs, deployment records — everything the agents build.
Personal Object Store
object family · 02
Knowledge Objects — conversations, decisions, generated artifacts, workflows, operational context, and machine-readable enterprise intelligence owned by this install.
// external world · reached via attested calls · every call signed and receipted
LLM Providers Anthropic · OpenAI · Local
MCP Servers Model Context Protocol
External APIs third-party services
Sovereignty Layer · wraps everything above
identity RBAC consent audit retention revocation
// public synchrony — research direction
optional substrates for cross-organization verification · explored, not shipped
Local-first default · no public substrate Every Stacy install runs sovereign by default. Public substrates are opt-in, per workload.
Data availability Celestia · Avail · research For workloads that need verifiable, censorship-resistant publication of app or context commitments.
Verifiable compute EigenCloud · TEEs · research For execution that needs cryptographic attestation beyond a single trusted host.
IP & provenance Story · registry · research For generated apps and Knowledge Objects that need public, programmable IP records.
// stacy is substrate-neutral · adapters are explored per workload, never required

Public coordination substrates are optional and workload-specific. StacyOS is sovereign and local-first by default.

Tier 02 Mini-brains federate into a coordination fabric.

The federation model · powered by Stacy Sync (research stage)

// stacy · federation · customer-owned boundary tier 02 · coordination mesh

Ten installs become ten mini-brains. Each person owns theirs. They federate through Stacy Sync — opt-in, granular, revocable end-to-end — into a coordination fabric on the customer's own infrastructure. The same protocol holds as the team grows. Same substrate. No central tenant. No vendor lock-in.

Most enterprise AI systems are building company memory.
StacyOS is the secure agent OS that lets that work be shared without leaving the customer's boundary.

+ 6 more
A
mini-brain
Arpan
M
mini-brain
Meera
R
mini-brain
Rohan
S
mini-brain
Sana
Coordination Fabric
tenant · stacy / acme
// scale arc · same substrate at every scale · long-term vision
SOLO
1 install
SMALL TEAM
long-term vision
DEPARTMENT
long-term vision
LARGER ENTERPRISE
long-term vision
Same substrate · Same primitives · Same consent model · No central tenant. Today's installs are solo-scale; the architecture is sized for the rest.
// the unit test for customer-owned federation

Two installs. Consent. Revocation. Audit chain.

One signed Knowledge Object. One consent grant. One revocation. One verifiable audit chain. No shared server. No centralized record.

N=2 is where federation becomes real: two installs can't cheat with a shared database, so every operation goes through the protocol. The cost is O(1) and pairwise — adding install number k needs no global consensus. Two installs are the unit test. Ten are a team. A hundred are a department. The same substrate scales beyond, on the customer's own infrastructure.

// protocol primitives Ed25519 identities canonical JSON content-addressed objects per-object consent grants producer-signed revocation tombstones replay-resistant messages hash-chained receipts signed verification reports
Working demo today · ~25s end-to-end · reproducible 3/3 Read the complete demo report
01 / OPT-IN
You choose what federates.
Default scope is personal. Federation requires explicit consent — per object, per connector, per rule.
02 / PROVENANCE
Every fact traces home.
Each Knowledge Object is signed by the worker that produced it. The collective never loses origin.
03 / REVOCABLE
Changed your mind?
Revocation propagates end-to-end. Consumers see it on next read. Receipts log the change.
Tier 03 What Stacy delivers.

When StacyOS lives in your network, every agent in your organization runs under one substrate.

For a bank, a hospital, a defense prime, or a government agency: once StacyOS is installed inside your VPC, on-prem, GovCloud, or air-gapped enclave, your developers, analysts, support team, clinicians, traders, and case officers stop running disconnected AI tools. Their agents start running under one identity model, one authorization policy, one approval workflow, one audit trail.

Over time the system accumulates company intelligence inside your boundary — not a "memory" in someone else's cloud, but a permissioned, signed, revocable graph of your own decisions, your own evidence, your own Knowledge Objects. The substrate your auditors, your security team, and your next-generation agents all draw from.

The durable asset is not the generated app or the agent of the week.

The durable asset is what stays on your infrastructure:

  • your identity model
  • your authorization policies
  • your data classification labels
  • your model boundary rules
  • your approval workflows
  • your execution history
  • your evidence chain
  • your Knowledge Objects

Generated agents and generated apps can be regenerated. The permissioned company intelligence they ran on cannot.

What you get when StacyOS lives in your network
Every agent runs under one permission model.
hover for detail
Company intelligence accumulates inside your boundary.
hover for detail
The audit trail lives on your infrastructure.
hover for detail
Cross-team & cross-organization sharing by consent.
hover for detail

Every output attested, every action audited, every object owned — inside the customer's boundary, under the customer's policies. StacyOS is the secure substrate your AI lives on.

stacy — live
build status

What ships today. What's next.

Stacy unfolds one layer at a time. StacyVM is live and self-hostable. Stacy CLI is in active development. Stacy Sync is research with three published papers. Federation demo working today — ~25s end-to-end, reproducible 3/3.

StacyVMruns AI work safely.
Stacy Synccoordinates the knowledge that work produces.
StacyOScombines execution, coordination, provenance, consent, revocation, and audit into one enterprise coordination substrate.

StacyVM

// trusted execution layer
Live · self-hostable

The execution layer of Stacy. Self-hosted sandbox infrastructure for AI agents — multi-provider runtime (Firecracker, Docker, gVisor, Kata, PRoot, E2B, or custom), live preview URLs, verifiable worker identity, Python and TypeScript SDKs, React web dashboard, and a stacyvm tui terminal cockpit. Live and installable today.

Stacy CLI

// intent layer entry point
In active development

Trust-first control plane for assigning, supervising, and auditing AI agents. Every agent has an owner, a budget, a workspace, a run history, logs, approvals, and a visible stop button. Connect your own Codex or Claude, assign tasks, watch live runs, track cost and risk, gate risky actions behind approvals. Built with Git worktree isolation per task, secrets redaction, local sandbox defaults, and full backup/restore tooling. Public command surface in active development.

Stacy Sync

// the Coordination Intelligence Layer
Research stage · 3 papers

The Coordination Intelligence Layer. The layer that coordinates everything Stacy generates. Generated software and Knowledge Objects become signed, content-addressed objects — each carrying provenance, ownership, content hash, and signature. Objects move under per-object consent, are revocable at read time, and leave hash-chained receipts. Permission lives in the object lifecycle, not a platform access table.

Contains: Knowledge Objects · consent · federation · receipts · synchrony.

A Knowledge Object is a signed operational artifact — meeting summary, dashboard, workflow, ticket, generated app, report, or agent output — carrying provenance, permissions, ownership, consent rules, and audit history.

The federation primitive — two installs sharing a signed Knowledge Object by consent, revoked end-to-end in ~25s, reproducible 3/3 — is a working demo today; public release in 90 days.

research

Three papers behind the architecture.

Stacy sits on top of published research. The operating system framing, the synchrony layer protocol, and the public synchrony infrastructure each have their own paper.

The research behind StacyOS is the trust, provenance, and evidence substrate for AI agents in regulated, sovereign environments: signed objects, per-object consent, revocation, hash-chained receipts, and federation that doesn't require a central tenant.

// pick your language

Drop in, in two lines. Python or TypeScript.

One client. Same calls both sides. Self-hosted compute sandboxes for AI agents — spawn, exec, destroy.

$pip install stacyvm
from stacyvm import Client

client = Client("http://localhost:7423")

sandbox = client.spawn(image="python:3.12")
result = sandbox.exec("python3 -c 'print(40 + 2)'")
print(result.stdout)        # "42\n"

sandbox.destroy()
$npm install stacyvm
import { Client } from "stacyvm";

const client = new Client("http://localhost:7423");

const sandbox = await client.spawn({ image: "node:20" });
const result = await sandbox.exec("node -e 'console.log(1+1)'");
console.log(result.stdout);   // "2\n"

await sandbox.destroy();

Built by a small team. Open to design partners. Reach us at accesstoarpan@gmail.com