.nai-sec{padding:5.5rem 0} .nai-alt{background:var(--card)} .nai-grid{margin-top:2.6rem} .nai-cta-row{display:flex;gap:1rem;flex-wrap:wrap;margin-top:2.6rem} .nai-code{background:#0d0d10;color:#e8e8ea;border-radius:var(--rl,16px);padding:1.6rem 1.8rem;overflow-x:auto;font-family:'JetBrains Mono',ui-monospace,monospace;font-size:13.5px;line-height:1.75;margin-top:2.2rem;border:1px solid #20202a} .nai-code .c{color:#7c7c86}.nai-code .s{color:#f9a35a}.nai-code .k{color:#ff8a3d} .nai-sec .section-sub a{color:var(--orange);text-decoration:none}

NYMPH AI — production-grade local intelligence that runs on your hardware

93.1%
GSM8K · Math
90.2%
HumanEval · Code
4.5B
Parameters
OpenAI
Compatible API
The model

A small model that punches far above its size.

4.5B parameters. Runs on one 24 GB GPU or a 16 GB Mac — no datacenter. A drop-in OpenAI-compatible API, so your existing apps and SDKs work unchanged. Private by design: your data never leaves your machine.

Best-in-class, small

Competes with 10–15× larger models in math, code and reasoning — at a fraction of the cost and power.

OpenAI-compatible

Point any OpenAI client at NYMPH. Zero code changes. Chat, tools and streaming, all supported.

Private & local

Run it on your own hardware. No cloud dependency, no per-token surprises, no data exposure.

Agent-ready

A multi-phase harness with tools, retrieval and verification — built for real agentic work, not just chat.

Quickstart

If you've used OpenAI, you're done.

Get a key at nymphtech.com/access, point your base URL at NYMPH, and call it like any OpenAI endpoint.

# pip install openai
from openai import OpenAI

client = OpenAI(
    base_url="https://api.nymphtech.com/v1",
    api_key="nymph_live_...",   # from nymphtech.com/access
)

resp = client.chat.completions.create(
    model="nymph-4.5b",
    messages=[{"role": "user", "content": "Refactor this function for readability."}],
)
print(resp.choices[0].message.content)
The mind + where it lives

NYMPH AI is the intelligence. NYMPH S-Quantum is its home.

Run NYMPH AI in the cloud through our API today — or on your own machine with the NYMPH S-Quantum card: persistent memory, full privacy, single-digit watts, always on. Same model, same API, your hardware.