NYMPHDX1 — The AI card. Private, dedicated, always on.

What it does

Four things no GPU can give you.

Online & offline

In the cloud, or fully disconnected. Nothing leaves your machine.

Hardware-native models

Run new SSM/RWKV language models on silicon, at constant memory.

It frees your GPU

Audio, memory, retrieval and perception move to dedicated silicon.

Memory that lasts

64 GB persistent state. Your AI remembers — it never starts from zero.

What runs on it

One card. Every AI workload.

Language models

NYMPH AI — production-grade local model, on-card
Mamba, RWKV & SSM LLMs — native on the NPU
Llama, Qwen, Gemma & any open-source model, better

Vision

Detection · classification · segmentation
Monocular depth · OCR · pose
Thousands of FPS at single-digit watts

Speech & voice

Whisper speech-to-text, on-device
Neural text-to-speech
Real time, fully offline

Memory & retrieval

Embeddings · semantic search · reranking
Private RAG over your own data
Persistent state that survives reboots

Production-grade local AI, including NYMPH AI — plus an open toolchain to run and supercharge any open-source model. Bigger transformers run on your GPU under orchestration; the rest runs on the card.

GPU + NYMPH

Two processor classes. One compute fabric.

A patent-pending orchestration layer coordinates the card and your host —CPU, RAM, GPU— as one system. NYMPH carries memory, retrieval and perception; your GPU keeps 100% of its power for what it was bought for.

NYMPHDX1 architecture — dual AI engines, switch fabric, persistent memory

What doesn't exist today

Where data can't leave and context can't be forgotten.

An AI that remembers permanently. — not a chat, an identity. Memory that survives reboots and migrations.

A local digital person, 24/7. — hears, speaks, remembers, relates — nothing sent to the cloud.

Persistent agents at near-zero cost. — that live for days, months, years. No per-token bill.

Compatibility

Works with the AI you already use.

One OpenAI-compatible local endpoint and a native MCP server. Point your tools at NYMPH and they just work — now private, persistent and offline.

ChatGPT CodexClaude CodeOpenClawChromeCursorContinueLangChainOpen WebUIany OpenAI-compatible app

What NYMPH adds to any of them: persistent memory · private local RAG · offline fallback · a freed GPU. It also makes any open-source AI run better — local, private and always on.

For the tools you use

What NYMPH does for you.

Claude Code users

Your whole repo lives on the card: persistent project memory, private local RAG, decisions remembered, commands policy-gated. It stops re-reading your codebase every session — and your code never leaves the machine.

OpenClaw users

On-card skill routing, persistent agent memory and a safety gate. Your agent remembers across runs, validates its own actions, and keeps working offline — at near-zero cost.

ChatGPT & Codex users

Point Codex at a local OpenAI-compatible endpoint: private code and context, no per-token bill, and an offline fallback when you want it. Same workflow, on your hardware.

Chrome & everyday users

On-device AI for the browser and apps: local transcription, summarization, search and memory — fast, private, nothing sent out.

Who it's for

For everyone who uses AI — online or local.

Anyone using AI

Give ChatGPT, Claude or any model a private local memory, an offline fallback, and lower cost. Your assistant stops forgetting.

Developers

A resident coding engineer via MCP: your repo indexed on-card, private RAG, every command policy-gated in under 2 ms. Code never leaves your machine.

Gamers & creators

Real NPCs that come alive — characters that see, hear, remember you and react in real time, on dedicated silicon so your GPU keeps 100% for the game. Build AI-native games, live voice and overlays — all local.

Specifications

Single slot. Bus-powered. No extra cables.

31–56 TOPS

dedicated AI · INT8

13–20 W

bus-powered

64 GB

persistent memory

PCIe x16

Gen4 switch onboard

RK3588 + DX-M1

orchestration + 1–2× NPU

Windows · Linux

OpenAI-compatible API · MCP

Coming soon

Solo & Duo. Same platform, your scale.

DX1 SOLO

$590 USD

1× DX-M1 + RK3588 NPU · 31 TOPS
4 GB NPU · 64 GB persistent
~13 W · field-upgradable to Duo

DX1 DUO

$750 USD

2× DX-M1 + RK3588 NPU · 56 TOPS
8 GB NPU · 64 GB persistent
~18–20 W · parallel pipelines

Early access — August–September 2026. Pre-orders open closer to launch. Join the list to be notified.