What are Ghost Weights?

Ghost Weights is 4MINDS' continuous fine-tuning architecture. A shadow model trains on new data while the live model serves traffic. When the eval gate passes, weights swap atomically — zero downtime, no retraining sprints, no deployment windows.

How often does Ghost Weights update the model?

Ghost Weights updates continuously as new data arrives. Training runs in the background against the shadow model. Updates reach production only after passing your eval gate — which can run on your schedule or trigger automatically.

Does Ghost Weights require downtime?

No. The weight swap is atomic. The live model continues serving traffic until the moment the new weights are ready. There is no deployment window, no restart, and no interruption to inference.

What happens if a Ghost Weights update fails eval?

If a model update fails the eval gate, it is held and never reaches production. The live model continues serving unchanged. You can review the eval record, adjust training data, and resubmit — or roll back to any prior eval-passing version.

GHOST WEIGHTS

Themodelthatupdatesitself.Withoutdowntimeorretrainingsprints.

Ghost Weights trains a shadow copy of the model continuously in the background. When it passes your eval gate, it atomically swaps in. The live model serves traffic until the instant of swap. Zero downtime. Zero training windows.

See It in Action →

NEW DATA ARRIVES

training_data_intrue

examples847 new

SHADOW COPY TRAINING

GHOST v4.2 shadow

training_loopactive

eval_scheduledtrue

EVAL GATE

eval_resultPASS ✓

domain_accuracy+4%

compliance_flags0

ATOMIC SWAP — ZERO DOWNTIME

LIVE v4.2 serving

swap_time2.3ms

downtime0ms

CONTEXT

The model you shipped last quarter doesn't know what happened last month.

3–8 weeks

Average enterprise retraining cycle, start to finish

Enterprise knowledge moves fast. Product documentation changes, internal policies are revised, new terminology emerges from customer conversations. Your model doesn't know any of it until the next retraining cycle begins.

A full retraining sprint means weeks of GPU time, ML team attention, eval coordination, and deployment windows. The overhead scales poorly as your knowledge base grows — and it has to repeat every time knowledge drifts.

In the gap between training cycles, your production model answers questions using stale context. For customer-facing deployments, that means wrong answers. For internal workflows, it means degraded accuracy on the queries that matter most.

HOW IT WORKS

Five steps. Continuous loop. Zero downtime.

The mechanism runs inside the 4MINDS platform. No separate fine-tuning infrastructure, training clusters, or deployment pipelines.

Shadow copy created

A copy of the current production model weights is spun up in a parallel training environment. The live model continues serving all traffic.

continuous

New data trains shadow

Enterprise data — new documents, recent agent conversations, updated policies — fine-tunes the shadow weights continuously.

automated

Eval gate comparison

An automated eval suite runs the shadow against the current live model on benchmark tasks. The shadow must score higher to proceed.

if eval passes

Atomic swap

If the eval passes, the shadow swaps in atomically. The live model handles the final in-flight request, then the shadow becomes live. Zero downtime.

retained

Version retained

The previous model version is retained with full rollback capability. Every swap generates a timestamped audit record.

Eval Gate

Configurable benchmarks. Hard gate.

The eval gate runs configurable task benchmarks — domain-specific prompts from your actual use case. Pass threshold is set by your team. Failure means the shadow is discarded; production is unchanged. Every run is timestamped.

Rollback

Any version, any time.

Production swap creates a checkpoint. If a deployed model causes unexpected behavior, one command reverts to the prior version. History retained for audit.

Kubernetes-native

Runs inside your cluster.

Ghost weights training runs as a Kubernetes job inside your existing cluster. No separate GPU cluster or external training API. Air-gapped compatible.

BEFORE / AFTER

What a retraining sprint actually costs.

WITHOUT GHOST WEIGHTS

01Detect model staleness via user complaints or accuracy drop
02Scope and schedule a retraining sprint (1–2 weeks planning)
03Collect, clean, and version training data (3–5 days)
04GPU training run (1–7 days depending on model size)
05Internal eval, regression testing, sign-off (1 week)
06Coordinate deployment window and rollout (1–3 days)

3–8 weeks

WITH GHOST WEIGHTS

01New data flows in automatically — no accumulation window needed
02Shadow copy trains continuously — no sprint scheduling required
03Data versioning handled by the platform — no manual prep
04Training runs continuously in background — no dedicated window
05Eval gate runs automatically on every candidate — no manual sign-off
06Atomic swap with zero downtime — no deployment coordination

Hours, not weeks

Your model is running on data from weeks ago.

Ghost Weights closes the gap automatically. See it on your data.

COMPARISON

Ghost Weights vs. the alternatives.

Criterion	Ghost Weights	Manual Fine-Tuning	Static Model
Update latency	Hours	3–8 weeks	Never
Downtime required	None	Yes (deploy window)	N/A
MLOps overhead	Minimal (automated)	High (sprint cycle)	None
Eval gate before swap	Yes (configurable)	Optional	N/A
Audit trail	Timestamped per swap	Manual record	None
Rollback capability	Instant	Manual redeploy	N/A
Air-gap compatible	Yes	Depends on infra	Yes

COMPLIANCE

Every model update is audited before it goes live.

The eval gate is not optional. It runs before every swap.

What the eval gate checks

›Benchmark task suite (domain-specific, configurable)
›Regression check: shadow must outperform live on benchmark
›Compliance flag detection (configurable keyword/pattern list)
›Manual approval gate (optional — for regulated environments)

Audit record

{
  "swap_id":     "gw_20260404_0847",
  "model_from":  "4minds-v2.4.1",
  "model_to":    "4minds-v2.4.2",
  "eval_run_at": "2026-04-29T08:04:17Z",
  "eval_result": "PASS",
  "benchmarks": {
    "domain_accuracy":  {
      "live": 0.87, "shadow": 0.91
    },
    "regression_score": {
      "live": 0.94, "shadow": 0.95
    },
    "compliance_flags": 0
  },
  "swapped_at":  "2026-04-29T08:06:00Z",
  "operator":    "automated",
  "rollback_to": "4minds-v2.4.1"
}

Eval gate output is built for audit trail requirements. Every swap, every version, every score — timestamped and retained. Your compliance team controls the log.

See the full Eval Framework →

READY TO SEE IT

See Ghost Weights on your data.

30 minutes with a 4MINDS engineer. We'll deploy a live ghost weights instance against a sample of your enterprise knowledge base and show you update latency end-to-end.

See the Architecture →See Graph RAG →

From the Blog

All articles →

The model you shipped last quarter doesn't know what happened last month.

Five steps. Continuous loop. Zero downtime.

What a retraining sprint actually costs.

Ghost Weights vs. the alternatives.

Every model update is audited before it goes live.

See Ghost Weights on your data.

Skill Files vs. Model Weights: Why Not All Continuous Learning Is the Same

4MINDS and NVIDIA Agent Toolkit: How They Fit Together

Why LLM Fine-Tuning Windows Are a Production Liability