A complete AI computer on a single PCIe card. Private, dedicated, always on.

In the cloud, or fully disconnected. Nothing leaves your machine.
Run new SSM/RWKV language models on silicon, at constant memory.
Audio, memory, retrieval and perception move to dedicated silicon.
64 GB persistent state. Your AI remembers — it never starts from zero.
Production-grade local AI, including NYMPH AI — plus an open toolchain to run and supercharge any open-source model. Bigger transformers run on your GPU under orchestration; the rest runs on the card.
A patent-pending orchestration layer coordinates the card and your host —CPU, RAM, GPU— as one system. NYMPH carries memory, retrieval and perception; your GPU keeps 100% of its power for what it was bought for.
One OpenAI-compatible local endpoint and a native MCP server. Point your tools at NYMPH and they just work — now private, persistent and offline.
What NYMPH adds to any of them: persistent memory · private local RAG · offline fallback · a freed GPU. It also makes any open-source AI run better — local, private and always on.
Your whole repo lives on the card: persistent project memory, private local RAG, decisions remembered, commands policy-gated. It stops re-reading your codebase every session — and your code never leaves the machine.
On-card skill routing, persistent agent memory and a safety gate. Your agent remembers across runs, validates its own actions, and keeps working offline — at near-zero cost.
Point Codex at a local OpenAI-compatible endpoint: private code and context, no per-token bill, and an offline fallback when you want it. Same workflow, on your hardware.
On-device AI for the browser and apps: local transcription, summarization, search and memory — fast, private, nothing sent out.
Give ChatGPT, Claude or any model a private local memory, an offline fallback, and lower cost. Your assistant stops forgetting.
A resident coding engineer via MCP: your repo indexed on-card, private RAG, every command policy-gated in under 2 ms. Code never leaves your machine.
Real NPCs that come alive — characters that see, hear, remember you and react in real time, on dedicated silicon so your GPU keeps 100% for the game. Build AI-native games, live voice and overlays — all local.

Early access — August–September 2026. Pre-orders open closer to launch. Join the list to be notified.
Coming soonPerformance figures are based on component-level benchmarks and pre-production engineering data; system performance varies with host and workload. Target MSRP; final pricing set at general availability. Specifications subject to change. NYMPH and S-QUANTUM are trademarks of Punky Tiger Labs, Inc. DX-M1 is a product of DEEPX Co., Ltd. RK3588 is a product of Rockchip. © 2026 Punky Tiger Labs, Inc.