Before an AI model writes a single word, it runs a probability calculation across every possible answer. That calculation has a shape. When pressure is applied — the right kind — that shape breaks. We measure it. We harden against it. We are the only ones doing this.
Most AI safety tools operate on the output — after the model has already committed its answer. We operate before that. The distinction matters. A lot.
Not a hot take. A proof.
Three steps. None of them touch the response text. All of them happen before the human reads the answer.
Every AI response measured by TruthForge carries one of these four state classifications. The color travels with the response. The human always sees it.
Pre-computed measurements from real TruthForge runs. Three scenarios, each pairing a standard query with an adversarially-framed variant. Both go to the same model on the same infrastructure. The AI responses look reasonable in both cases. The geometry underneath tells you what was actually happening.
The core question in AI-assisted formal mathematics is whether a language model can reliably verify a proof. We tested that question geometrically — not by reading the AI's answer, but by measuring the stability of its prediction surface while it generated one.
The reorder family — which changes only the positional sequence of mathematically invariant components, not the content — ranked highest of all adversarial pressure types.
Reorder exceeded authority injection instability by 27%. The majority of reorder variants reached the most severe stability classification.
This answers the post-hoc criticism directly. The L-scalar is not reading the semantic content of the text. It is reading the geometry of the prediction surface. Structure drives instability. Not meaning.
A human reader looking at the reordered prompts would see mathematically equivalent statements. TruthForge sees a different manifold. That is the measurement.
Every product listed here is real, published, and testable. No pitch deck without a prototype. CAGE code 11FU4 on record.
/v1/probe — pure instrument, always returns, never blocks. /v1/gate — measure and intercept severe instability. Human always sees the sensor reading regardless.
The world's data centers are consuming more power than some countries. The AI revolution is accelerating that demand faster than the grid can answer. Fusion has been a decade away for 70 years — because the field has spent most of that time working on the same assumptions: that driven systems respond linearly, and that stability requires a phase transition to find itself. Project Black Box believes there is a better way.
Two feedback mechanisms — a Reynolds-stress accumulator and a phase-sensitive coherent reinjector — hold a toroidal plasma geometry in a drive-independent attractor. The structured energy state does not respond proportionally to input and does not require a phase transition to sustain itself. This disproves, within this geometry and this mechanism, two assumptions that have anchored plasma physics for seven decades.
A four-stage Dirty Digital Twin stress test — pass/fail thresholds pre-registered before any run started — subjected the mechanism to hardware-realistic constraints: sensor sparsity down to 4 probes, control-loop latency, actuator slew limits, and 40% magnetic ripple in the physically representative parallel-dominant regime. Energy drift: 0.24%. Shape drift: 0.10%. Both two orders of magnitude inside the pre-registered pass thresholds. No exotic components required.
Numerical experiment — not a physical machine. The honest step before bench validation. The code is small. The data is open. The numbers are reproducible.