Agent Cache· Forge

In-memory data, on every core.

Forge is an in-memory store rebuilt from a blank page in Rust for hardware with dozens of cores. It runs across all of them instead of one, speaks a lean binary protocol over raw TCP, or shared memory for programs on the same box, and still understands the commands your existing Redis tools already use.

300+ operations, Redis-class

3 transports: TCP, UDP, shared memory

All cores not single-threaded

Request access All core systems

Why Forge

The fastest stores still run on one core.

Redis and its forks are single-threaded and speak a text protocol designed for 2009-era hardware, so they leave most of a modern server idle and pay parsing cost on every request. Even a client on the same machine has to go through a network socket.

Forge keeps the familiar feature set but removes the structural limits: it uses every core, talks a compact binary protocol, and lets co-located programs skip the network entirely.

Capabilities

What it can do.

A full data-structure toolkit

Strings, counters, hashes, lists, sets, sorted sets, streams with consumer groups, and pub/sub, plus probabilistic cardinality and bitmaps, around three hundred operations in all.

Drop-in Redis compatibility

Any existing Redis client or tool, including redis-cli, connects unchanged. The binary protocol is the fast path; Redis is the compatibility path, on one engine.

Three transports, one engine

Binary over TCP for remote clients, UDP for fire-and-forget writes, and a lock-free shared-memory channel for same-machine programs that skips the network stack.

Server-side scripting

Run multi-step logic atomically next to the data with an embedded Lua engine and loadable function libraries.

Optional durability

Off by default for pure speed; switch on a checksummed write-ahead log plus background snapshots for crash recovery, with selectable fsync strength.

Secondary indexes

B-tree indexes on fields give exact-match and range lookups beyond the primary key, something standard Redis cannot do natively.

Transactions and queues

Optimistic-locking transactions and blocking pops for safe queues and work distribution.

Built-in web Studio

A management UI with a data browser, console, live dashboard, and Lua editor, plus an HTTP API and Prometheus metrics, embedded in the binary.

Built different

Why it is not just another cache.

Multi-threaded from line one

A work-stealing pool puts a thread on every core with per-thread sharding, where Redis and Valkey inherit a single-threaded design. It scales with cores instead of around them.
Shared memory for local clients

Same-machine programs talk over a lock-free shared-memory channel and skip syscalls and serialization entirely, a transport Redis simply does not have. A natural fit for an agent and its tools sharing one sandbox.
A binary protocol with no parsing

Fixed-layout frames keyed by a number, with native multiplexing over one connection, instead of text commands and pipelining workarounds.
Rust, no GC pauses

Memory-safe with lock-free structures and no garbage collector, so tail latency stays predictable instead of stalling.
Runs as a microVM image

A unikernel build boots Forge straight as a microVM with env-var config, made for per-sandbox agent infrastructure.

By the numbers

Engineered for microsecond access.

Architectural design targets, not yet independently benchmarked.

Sub-µsshared-memory path (target)

Single-µsTCP path (target)

Lua 5.4atomic server-side scripts

WAL + snapshotoptional durability

Agent-first

The hot-state layer for every sandbox.

Forge is where an agent platform keeps the state it touches constantly: session routing, idempotency keys, rate limits, short-lived locks, and working memory, all at microsecond access instead of a database round-trip. Because it runs as a microVM and offers a same-machine shared-memory channel, an agent reads and writes state with no network hop, and atomic transactions keep multi-step changes race-free.

Per-sandbox state with TTL expiry, so abandoned sessions clean themselves up.
Atomic counters and ephemeral locks over shared memory to coordinate concurrent actions.
Pub/sub and durable streams for fan-out events and agent work queues.

Put Agent Cache to work.

See HQ running in your own Slack or Teams, on the operating system we built for agents.

Request access Explore qOS