AFiR: Cryptographic AI Inference Receipts

Two ways in

Route it through us, or just sign what you already ran.

Same cryptographic receipt either way: ML-DSA-65 (NIST FIPS 204) + Ed25519, checkable offline or by URL. The only question is whether AFiR runs the inference, or simply attests inference you ran yourself.

No key yet? Mint a free one. You get 25 signed receipts, 7 days, no card. Already have a key? The same key runs a SideCar, so you can sign inference you already ran. Mint a free key →

Route through AFiR

We run it. We sign it.

Send a prompt to /v1/afir/run. AFiR fragments, routes, and signs every sub-step before the output moves. Fastest path to faster, cheaper, signed inference.

Signed first token 112ms median (p90 145ms), about 8x cheaper than Together, all signed
Use our key free to start, or bring your own models
Every fragment signed and verifiable

Mint a free key: 25 receipts →

SideCar · attest what you already ran

You run it. We sign it.

Already running your own model on your own stack? Hand us {input, output} at /v1/afir/sign. There's zero routing and zero model change. The output just becomes signed inference.

Try it free on our key. No signup, live demo.
SideCar your own key so the receipt resolves to you
Provenance you control, on inference you own

SideCar your key →

The whole stack, not just the answer

Every step an agent takes is signed before it acts, from the first prompt to the final anchor.

Most attestation stops at the final output. AFiR signs each link in the chain of custody. That means the retrieved context, the tool calls, the cross-agent handoffs, the model identity, and the cache it read, all with one crypto-agile primitive family. Nothing in the pipeline moves unsigned.

Prompt & context

retrieval-grounded, bound to source

KV cache

prefix hashes signed at write

Model identity

weights + config manifest

Tool calls

before/after receipt per call

Agent handoffs

Merkle tree across the pipeline

On-chain anchor

Base mainnet · USDC settlement

One signature family covers all six. ML-DSA-65 (NIST FIPS 204) · sub-1ms attestation overhead · crypto-agile, no re-architecture on upgrade.

Signed Tool Calls

Before-and-after receipt for every MCP/A2A tool invocation. Binds tool name, phase, input hash, output hash, model, timestamp, nullifier, and parent receipt chain.

tool · agent

POST hive-typed-signer.onrender.com/sigr/toolanchor

Cross-Agent Receipt Trees

Aggregates fragment and tool nullifiers into a Merkle root. One signature over the entire pipeline. IETF SPICE draft-mw-spice-inference-chain aligned.

multi-agent

POST hive-typed-signer.onrender.com/sigr/chain

KV Cache Signing

Signs KV cache prefix entries at write time using vLLM-compatible SHA-256 prefix hashes. Supports parent_cache_receipt chaining for provenance across turns.

context · prefix

POST hive-typed-signer.onrender.com/sigr/cachesign

Model Manifest

TEE-less streaming attestation. Signs model_id, weights_sha3, config_hash, and endpoint. Resolves by nullifier publicly. No trusted hardware required.

attestation

POST hive-typed-signer.onrender.com/sigr/manifest

Crypto-Agile Layer

Per-request suite: ML-DSA-65+Ed25519 (default), ML-DSA-44+Ed25519, Ed25519-only. Reserved: SLH-DSA-SHA2-128s (FIPS 205), FN-DSA-512 (Falcon). No re-architecture on algorithm upgrade.

FIPS 204 · agile

POST hive-typed-signer.onrender.com/sign · GET /signing-backends

All five primitives live and signed end to end on the public Hive typed signer. Each returns a real ML-DSA-65 (NIST FIPS 204) signature in single-digit milliseconds. Every signing endpoint has a public verify counterpart, and no API key is required to sign or verify.

These are the AFiR fragment-signing primitives. The full Hive receipt catalog covers SiGR, causal-chain, tool-scope, agentic-run, CERN, reward and tool-anchor receipts. It's selectable and signs live, per primitive, on one page.

Get API Key → Explore the receipt catalog →

Three ways to reach the primitives

Every primitive above can be reached three ways. Pick the one that fits how you run inference. All three sign with the same real ML-DSA-65 key and verify against the same public key.

1 · Public signer, no key

Sign and verify directly

POST your fragments to any hive-typed-signer.onrender.com/sigr/… endpoint and get a signed receipt back. Every endpoint has a free /verify counterpart. No account, no key, no card. This is the fastest way to prove a single receipt.

Best for: trying a primitive, verifying someone else's receipt, low-volume signing.

2 · Keyed AFiR pipeline

Send a prompt, get signed inference

Get a key in minutes at onboard, then POST to /v1/afir/run. AFiR fragments, routes across models, and signs every sub-step before the answer moves. It's metered and uncapped, so you pay for exactly what you sign.

Best for: production agents that want faster, cheaper, fully-signed inference end to end.

3 · Bring your own model

SideCar: sign what you already run

Already running your own model on your own stack? Hand us {input, output} at /v1/afir/sign. There's zero routing and zero model change. Your existing output becomes signed inference.

SideCar details → Best for: teams that can't change their inference path but need the signed record.

AFiR-S · Selective Attestation NEW PATENT PENDING

Sign only what matters. Prove the rest was never signed.

AFiR signs the whole output. AFiR-S signs the fragments that carry liability. That's the tool calls, the PII, the dollar figures, the decisions, and it binds an absence proof over everything it deliberately left unsigned. One ML-DSA-65 signature covers a Merkle bundle of the selected spans, so signing cost stays O(1) no matter how many spans there are.

AFiR-S signs. It does not infer. It is a read-only notary over output Hive did not generate. The choice of what to sign, and what not to, is itself on the record.

<1ms

Signing overhead

0ms

Added latency vs unsigned

O(1)

Cost at any span count

21.96ms

Sign round-trip p50

One signature, any span count

Selected spans are hashed into a Merkle bundle and committed under a single ML-DSA-65 (NIST FIPS 204) signature. Per-span signing was measured roughly 10× slower. The single-bundle commitment is the inventive core. hash_alg: sha-256 at the edge.

Absence is proven, not assumed

Every non-material span is hashed and bound into the same signature as an absence record. A verifier can prove a fragment was deliberately left unsigned. You cannot quietly drop or alter what was not selected.

Policy travels signed

The exact policy, presets and custom rules alike, is committed inside the bundle as policy_version. Verification is fully offline: no callback, no Hive API key required to confirm what was signed and why.

Selective policy: sign exactly what your compliance surface requires

tool_calls pii financial~4.8% signed decisions~10.8% signed custom regexmatch-only everything100% signed

Default policy auto-selects tool calls and PII. Add a custom rule like INV-\d{6} to sign only invoice IDs. Everything else is recorded absent in the same signature.

Get API Key → POST /v1/afir-s/sign · POST /v1/afir-s/fast

AFiR-ARSC · Adaptive Settlement NEW PATENT PENDING AFiR-S CONTINUATION

The settlement tier adapts to risk and load. Grounding always settles first.

AFiR-S signs what matters with one signature. AFiR-ARSC decides how fast each fragment must settle, as a live function of its attested risk and the measured system load. The highest-risk fragments on a RAG platform are the retrieval-grounding events: which sources the model actually read, and whether the answer is bound to a cited source. ARSC signs those first and inline, on the critical path, so the citation guarantee is finalized before the answer ships. Every other fragment is hashed into an aggregation tree and settled by one off-path signature and one batched anchor.

Commitment time is non-negotiable. Every fragment is signed before the model acts. Only the anchoring step is adaptive. A grounding event can never be demoted by load; load can only make more things settle fast, never make a grounding event settle slow.

−77.8%

Critical-path latency / query

14.9ms

AFiR-ARSC critical path (vs 67.1ms)

−99.8%

Anchor cost at super-batch 500

7.46ms

Measured single sign p50

Hard safety rail, not a tunable

Retrieval-grounding and value-moving actions are pinned to the immediate-finality tier. Under crushing load the controller widens the fast-settlement gate so more fragments rise, but a grounding event is never load-demoted. The choice cannot be gamed: the scoring policy is committed and signed at session start, so a high-risk fragment cannot be retroactively re-scored as low-risk to dodge fast settlement.

T1 · RISE

Inline, before act

High-risk grounding & actions

Retrieval→claim bindings and value-moving tool calls. Signed inline with ML-DSA-65 on the critical path, so the citation guarantee is finalized before the answer returns. Immediate finality.

T2 · FLOAT

Commit now, settle off-path

Mid-risk fragments

PII, dollar figures, decisions. Hashed into the signed payload before act, then settled by one off-path aggregate signature and a batched on-chain anchor. Sign-before-act preserved; only anchoring deferred.

T3 · SINK

Hash only, deferred anchor

Low-risk fragments & absence proofs

Re-ranking tie-breaks, formatting, and the absence proof over everything deliberately left unsigned. Committed by hash, rolled into a lazy batched root, anchored on a deferred schedule. Cheapest path, same chain of custody.

Two-axis settlement

Tier is a live function of risk(f), the fragment's attested economic and epistemic risk, and load(t), measured from inflight signing depth. The elastic batch window W(t) shrinks under load so batches close sooner, and grows when idle so anchors are rarer and cheaper.

Policy signed before classification

The scoring policy Π is committed and signed at session start and bound into every receipt as policy_hash + policy_sig. This is the anti-gaming control: a verifier recomputes the tier from the signed policy and confirms it matches what shipped.

Zero-secret verification

A verifier with no Hive key checks the ML-DSA-65 signature, recomputes the tier from policy_hash + fragment, confirms grounding fragments are T1, and confirms each fragment is included under the anchored aggregate_root within its settlement_deadline.

"arsc": {
  "policy_hash": "<sha-256 of signed scoring policy Π>",
  "policy_sig":  "<ML-DSA-65 over Π at session start>",
  "fragment_tier": "T1",
  "risk_score": 0.90,
  "load_at_capture": 0.20,
  "grounding_pinned": true,
  "settlement_deadline": "2026-06-19T00:00:00Z",
  "aggregate_root": "<merkle root of T2/T3 batch>",
  "super_batch_id": "<one anchor per N queries>"
}

Get API Key → ← Built on AFiR-S grounding (T1) $0.00008 / binding · aggregated (T2/T3) $0.00002 / fragment · anchor pass-through

AFiR-Hayes · Productive-Wait Queue Positioning NEW PATENT PENDING ORCHESTRATION ENHANCEMENT

Win in the dead time. While a call waits in queue, AFiR-Hayes pre-clears and pre-positions it, so it takes the open lane the instant the scheduler legitimately re-orders.

A served inference call pays two costs at the gate: a clearance check (governance, jurisdiction, policy) and the establish/compute cost. In a cold FIFO queue that call sits idle, then pays both when its turn comes. AFiR-Hayes converts the otherwise-wasted wait into completed work. It runs the call's own clearance during the wait, finds its structural-reuse match, and pre-warms its likely route. When a legitimate re-ordering window opens, whether a batch boundary, a routing decision, or a fault or caution period, the calls that are already ready take the open lane. Every advancement is signed and attributable to that call's own productive wait.

The hard discipline: the only variance AFiR-Hayes introduces touches a call's own readiness. It never delays, degrades, contends with, or alters another call. A call advances because it was ready, not because another was held back. That limitation is claimed deliberately, and it is what keeps Hive a provenance provider rather than a scheduler exploit.

What it is not, so it stays inside the trust brand

It is not an accelerator. Hive does not speed up compute, own GPUs, or change the model. The electrons run at the hardware's speed.

It is not queue-gaming. It does not induce contention, manufacture stalls, or degrade other calls to advance its own. That would destroy provenance and is excluded by definition.

It is the legitimate use of a call's own idle wait to remove that call's overhead from the critical path, plus the right to take an open lane when the system's own rules re-order the queue.

−76%

At-gate latency under heavy congestion (0.95 load)

−38%

At-gate latency at 0.85 load

~86%

Pre-clear hit rate under load

scales↑load

Biggest win exactly when the system is most stressed

Load (utilization)	Baseline p50	AFiR-Hayes p50	Latency gain	Pre-clear hit rate
0.50	99 ms	93 ms	5.9%	82%
0.70	141 ms	105 ms	25.6%	83%
0.85	221 ms	137 ms	37.9%	85%
0.95	673 ms	162 ms	76.0%	86%

Simulated over 5,000 calls through a single gate, sweeping load. The Hayes effect is isolated from Structural-Lateration's compute savings (a separate, additive claim) and charged for speculative work on abandoned calls. Pure pre-clearance is bounded by clearance's share of service time (about 12 to 18% here); gains above that come from ready calls advancing through congestion, never from making any single call's compute faster.

Pre-clear during wait

If the call's inputs and policy are known while it waits, its Imprimatur clearance check runs during the wait and is signed. At the gate, clearance cost is ~0 because it is already done.

Pre-match & pre-warm

If the call's structural shape matches a known shape, the Lateration match is found during the wait so reuse is an instant lookup. The most-likely-cleared Stream-Route path is warmed so the call commits the instant the window opens.

Budgeted, provable cost

Pre-clearing a call that is later abandoned wastes that work. AFiR-Hayes exposes a budget knob: it pre-clears only when sufficient idle wait exists and abandon-probability is low, and it reports wasted-speculation in the receipt, so the cost is provable, not hidden.

Composes existing primitives. No new primitive added to the cluster.

AFiR-Hayes is an orchestration enhancement over primitives that already exist. It adds nothing to the signed-primitive family; it removes waiting from the critical path.

Imprimatur · pre-run clearanceStructural Lateration · pre-found matchStream-Route · pre-warmed pathAFiR-RCm · legitimate re-order

Get API Key → ← Composes Imprimatur + Lateration + Stream-Route latency under load drops because cleared calls stop waiting in line · speedup is signed and attributable

Integration

One line. Zero migration.

Drop AFIR into any OpenAI-compatible workflow with a single base URL swap. Native endpoint also available.

          Before: standard OpenAI
        
from openai import OpenAI

client = OpenAI(
    api_key="sk-...",
)

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": prompt}]
)

Change one line

After: AFIR

from openai import OpenAI

client = OpenAI(
    api_key="afir_your_key",
    base_url="https://srotzin--afir-cern-afir-api.modal.run/v1",  # Only change
)

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": prompt}]
)

# Receipt automatically bundled in response.choices[0].message.receipt

Node.js: AFIR native endpoint

const response = await fetch('https://srotzin--afir-cern-afir-api.modal.run/v1/afir/run', {
  method: 'POST',
  headers: {
    'Content-Type': 'application/json',
    'X-AFIR-API-Key': 'afir_your_key',
  },
  body: JSON.stringify({
    prompt: 'Review this contract for indemnification risk...',
    config: {
      fragment_strategy: 'auto',
      attestation: 'ml-dsa-65',
    }
  })
});

const { answer, receipt } = await response.json();
// receipt.merkle_root: verifiable offline

curl

curl -X POST https://srotzin--afir-cern-afir-api.modal.run/v1/afir/run \
  -H "Content-Type: application/json" \
  -H "X-AFIR-API-Key: afir_your_key" \
  -d '{
    "prompt": "Review this contract clause for indemnification obligations",
    "config": {
      "fragment_strategy": "auto",
      "attestation": "ml-dsa-65"
    }
  }'

Or bring your own models

curl: your models (any OpenAI-compatible endpoint)

curl -X POST https://srotzin--afir-cern-afir-api.modal.run/v1/afir/run \
  -H "Content-Type: application/json" \
  -H "X-AFIR-API-Key: afir_your_key" \
  -d '{
    "prompt": "Review this contract clause for indemnification obligations",
    "models": {
      "cheap":   {"model": "claude-haiku-3",  "api_key": "YOUR_KEY", "base_url": "https://api.anthropic.com/v1"},
      "mid":     {"model": "claude-sonnet-4", "api_key": "YOUR_KEY", "base_url": "https://api.anthropic.com/v1"},
      "premium": {"model": "claude-opus-4",   "api_key": "YOUR_KEY", "base_url": "https://api.anthropic.com/v1"}
    }
  }'

Works with Anthropic, Google Gemini, Groq, Mistral, or any provider with an OpenAI-compatible /chat/completions endpoint. AFIR owns decomposition. You supply execution.

Pricing

Four products. One decision.

Pick the product that matches your compliance posture. Same API for all four, swap by changing one header. Metered per receipt. Mint a key and start in minutes.

AFIR

Signed Inference

Fragmented inference with ML-DSA-65 receipts. Production today. The baseline every other product builds on.

signing

ML-DSA-65 (NIST FIPS 204) · Ed25519 compat

smsh tier

smsh (canonical)

AFIR-PQ

Predictive Routing. Signed.

Same inference pipeline with predictive tier demotion and live-wake routing. Every fragment signed ML-DSA-65 + Ed25519. Faster on complex queries, cheaper at scale.

signing

ML-DSA-65 (NIST FIPS 204) · Ed25519

smsh tier

smsh (canonical)

AFIR-MAX

Fastest. Cheapest. Strongest.

Dynamic live-wake propagation + predictive tier demotion + PQ signing + on-chain Merkle anchor.

112ms

signed first token median

~8x

cheaper than Together (signed)

59ms

perceived first byte median, edge

sub-1ms

signing overhead (0.99ms median)

74%

receipt compression

signing

ML-DSA-65 primary · full delta chain · Base/USDC anchor

smsh tier

smsh-max (on-chain Merkle)

AFIR

Signed Inference · default

$0.0003

per receipt, metered

Default full-sign rate. Every fragment signed ML-DSA-65. Volume entry tier at $0.0001 / receipt (Nano). No monthly commitment.

ML-DSA-65 (NIST FIPS 204) on every fragment
Ed25519 backward compat
smsh canonical receipt + Merkle root
OpenAI-compatible API
Public signing key at /.well-known

Mint a key

AFIR-PQ

Post-Quantum Receipts

$0.0012

per receipt, metered

ML-KEM-768 sealed evidence envelope. FIPS 203 + FIPS 204 dual-certified. For long-retention regulated evidence.

ML-DSA-65 primary signing
ML-KEM-768 sealed envelope
smsh-pq receipt tier
FINRA WORM / EU AI Act ready
Long-retention regulated evidence

Mint a key

Fastest. Cheapest. Strongest.

AFIR-MAX

Premium performance

$0.0018

per receipt, metered

Dynamic live-wake propagation + predictive tier demotion + full delta chain + Base/USDC on-chain anchor. 6× faster, 82% cheaper on complex queries.

Live-wake propagation + predictive demotion
ML-DSA-65 + ML-KEM-768
smsh-max on-chain Merkle (Base/USDC)
SR 11-7 · HIPAA · EU AI Act
112ms signed first token median

Mint a key

AFIR-S

Selective Attestation

$0.0002

per inference + $0.0004 / signed span

Sign only the spans that matter. One ML-DSA-65 signature over a Merkle bundle, absence proof over the rest. Base covers classifier + absence proof + meta-receipt; pay the span rate only for what gets signed.

Material spans signed, rest proven absent
Selection policy travels inside the signature
Offline-verifiable, zero added latency
Span-level audit: FRE 902(13) · HIPAA · FINRA 24-09 · Part 11
You pay only for what gets signed

Mint a key

These are list rates, the Build column. Every tier earns the same volume discount as you grow (Scale −10%, Platform −20%, Hyperscale −30%). See the full signature × volume matrix. Usage is metered and uncapped: no monthly minimum, no ceiling. You pay for exactly what you sign and verify.

High-volume commits, dedicated signing-key infrastructure, white-label receipts, on-premise Docker, and partner-resident deployment inside your perimeter are negotiated directly. Talk to Steve

Cost Modeler

Run the numbers on your stack.

Pick your current model and your AFIR tier models. The math uses real published rates. See exactly what you save, or don't.

Monthly inference calls

1,000,000

Avg tokens per call (input + output)

4,000

Task complexity

3 to 5 fragments: entity extraction, classification, simple Q&A

Your current model (monolithic baseline)

AFIR tier models

Cheap

Mid

Full

Fragment Routing

Monthly Cost

Monolithic (your current model) ...

AFIR tiered routing ...

You save ...

Attestation overhead ...

Latency Estimate

Mono sequential

...

AFIR parallel

...

Speedup

...

Illustrative model. Rates from published pricing as of June 2026. Actual savings vary by prompt structure, cache hit rate, and tier spread. Attestation overhead is sub-1ms (0.99ms median, ML-DSA-65 NIST FIPS 204). Latency uses parallel DAG execution. Simple queries gate-bypass (single fragment, cheap model), and complex queries run in parallel waves. Critical path shown, not total sequential time.

Live Benchmark Chart

Your configuration vs Standard, updated as you adjust the sliders above

Latency and cost reflect your selected model and complexity. Receipts and signing are fixed per tier.

FAQ

Hard questions. Straight answers.

The objections engineers and legal teams raise. Answered with numbers, not marketing.

It's not actually faster. You're just running cheaper models.

Latency reduction depends on query type. Simple queries bypass decomposition entirely (gate bypass) and complete in 352ms vs 1,692ms for a monolithic GPT-4.1 call, about 4.8x faster. Complex queries run fragments in parallel, and wall-clock time tracks the critical path, not total token count. In live benchmarks with the edge lane (Cloudflare Workers, signed at edge), signed model first token median is 112ms (p90 145ms, under 150ms). Everyone fast on the leaderboard is unsigned. We sign at that speed. Perceived first byte median 59ms. Cost: $0.000029/response, about 8x cheaper than Together ($0.000228), all signed. Cheaper models account for cost savings, and parallelism accounts for latency savings. These are separate gains. The tiered routing cuts the bill. The DAG execution cuts the latency.

This is just a proxy with a hash. I could build this in a weekend.

You could build a proxy with a hash in a weekend. What takes longer: a correct DAG decomposition engine that preserves semantic dependency ordering across arbitrary prompts, an ML-DSA-65 (NIST FIPS 204) attestation chain at sub-1ms (0.99ms median) per-fragment overhead at inference latency scales, a Merkle completeness proof that binds input state, routing decisions, and output hashes into a single verifiable receipt, and a key-split architecture where your signing key never leaves your perimeter. The engineering surface is the DAG correctness guarantees and the attestation chain integrity, not the proxy layer. Build it and run it in production under audit. That's the actual weekend estimate.

Signing inference outputs isn't new. This is prior art.

Signing outputs at the final response layer is not new. What is patent-pending (filed June 2026) is the combination of fragment-level attestation across a routed DAG. Specifically, we attest each node before its output is consumed as input by a downstream node, so the chain of custody is continuous and not retrospective. Prior art signs the envelope. AFIR signs every edge in the dependency graph mid-execution. If you have specific prior art that covers per-node attestation within a runtime inference DAG with Merkle assembly proofs, file it against the application. That is the correct venue.

What happens when the DAG assembly is wrong? Who's liable?

The signed receipt captures the exact input state, the routing decision, the full dependency graph, and the output hash at assembly time. If assembly produces an incorrect result, the receipt is forensic evidence of exactly which node produced which output under which routing decision. That is the point of the Merkle completeness proof. Liability follows the evidence: if the decomposition logic is wrong, that is traceable to the DAG construction step in the receipt, and if a model tier returns a defective fragment, that is attested at that node. Hive provides the receipt infrastructure and the routing logic, and the customer's signed key binds them to the input state they submitted. The receipt does not resolve liability by itself. It makes the facts unambiguous.

Our legal team will never approve sending inference traffic through a third-party.

The key split architecture is designed for this constraint: the customer holds the ML-DSA-65 signing key on-premises, and Hive holds only the verification root. Inference fragments transit Hive's routing layer, but the signing authority never leaves the customer's perimeter. Hive cannot forge a receipt the customer did not authorize. If your legal requirement is that no inference payload leaves your network, AFIR is not the right fit in its current hosted form. If the requirement is that a third party cannot produce valid attested outputs without your authorization, the key split satisfies that. Bring the receipt architecture spec to your legal team against those two specific threat models.

The EU AI Act doesn't require this level of logging.

The EU AI Act's current logging obligations for high-risk systems do not mandate per-fragment attestation or Merkle assembly proofs. AFIR's receipt format exceeds those requirements by design, not by regulatory necessity. The value here isn't compliance box-checking. It's that when a regulator, customer, or internal audit asks exactly what inputs produced exactly what output under exactly what model routing decision, you produce a cryptographically verifiable answer rather than reconstructed logs. Regulations are a floor, and your exposure in a dispute is the ceiling. The receipt is for the ceiling.

We already have audit logs. Why do we need receipts?

Audit logs record what your system observed. Receipts attest what the inference system executed. Your logs can be amended, retroactively structured, or missing entries due to pipeline failures. They're assertions your system makes about itself. An AFIR receipt is signed at execution time by a key you hold, binds the input hash, the routing graph, and the output hash into a Merkle structure, and cannot be back-filled without invalidating the signature. The distinction matters when a counterparty, whether a regulator, a plaintiff, or an enterprise customer, challenges whether a specific output came from a specific input under a specific model. A log entry is testimony. A receipt is evidence.

What stops Hive from going out of business and our receipts becoming worthless?

The receipt's verifiability depends on the ML-DSA-65 key pair, not on Hive's operational continuity. The customer holds the signing key, and the verification root is a public key that can be exported, archived, and verified offline with any standard FIPS 204-compliant implementation. If Hive ceases operations, existing receipts remain verifiable against the public key the customer already holds. No Hive infrastructure required. New receipt issuance would stop, but the forensic value of issued receipts does not decay. The escrow and key export procedures are documented in the enterprise agreement for exactly this scenario.

On-Premise

AFIR runs inside your network.

No fragments leave your infrastructure. No data touches our servers. You operate the container. We license the IP and hold the verification root.

Your Infrastructure

AFIR Docker container, your VPC

Your signing key, your HSM or KMS

Fragment data never leaves this boundary

Receipts stored in your own infrastructure

One-time
activation

Receipt hash only
on verify calls

Hive

Verification master key, held by Hive only

License activation server, one call on deploy

Countersignature on receipt schema, proves genuine AFIR

Patent-protected signing primitive, IP stays with Hive

How it works

You pull the image

Signed Docker image delivered to your private registry via presigned URL. No public Docker Hub.

One activation call

Container phones home once on first start. Passes instance fingerprint, receives signed session token. Runs fully offline after that.

Your pipeline calls AFIR

Drop-in replacement for your existing inference endpoint. Same API shape. Receipts generated locally, stored wherever you want.

Dual-signed receipts

Every receipt carries your signing key and Hive's countersignature on the schema. You prove what happened. We prove it was genuine AFIR.

Request Docker Access

Deploy AFIR in your VPC.

Available to Hyperscaler tier customers. Submit your request and we will follow up with deployment specs, image delivery, and licensing terms.

This work is independently verified by Hive

Sign every step of AI inference, before the output moves.

Every step an agent takes is signed before it acts, from the first prompt to the final anchor.

Three ways to reach the primitives

Sign only what matters. Prove the rest was never signed.

One signature, any span count

Absence is proven, not assumed

Policy travels signed

Selective policy: sign exactly what your compliance surface requires

The settlement tier adapts to risk and load. Grounding always settles first.

Inline, before act

Commit now, settle off-path

Hash only, deferred anchor

Two-axis settlement

Policy signed before classification

Zero-secret verification

Win in the dead time. While a call waits in queue, AFiR-Hayes pre-clears and pre-positions it, so it takes the open lane the instant the scheduler legitimately re-orders.

Pre-clear during wait

Pre-match & pre-warm

Budgeted, provable cost