CAP-SRP: Proving What AI Didn't Generate — The Missing Layer in AI Content Authenticity

Executive Summary

The January 2026 xAI/Grok crisis exposed a critical gap in AI content authenticity: existing standards prove what AI created, but nothing proves what AI refused to create. CAP-SRP (Content Authenticity Protocol - Safe Refusal Provenance) addresses this gap through cryptographically verifiable refusal events built on IETF SCITT. With EU AI Act Article 12 compliance required by August 2, 2026, this is no longer theoretical—it's a regulatory imperative.

I. The Crisis That Changed Everything

xAI/Grok Image Generation Crisis (January 2026)

In January 2026, users discovered techniques to bypass Grok's safety filters, resulting in the generation of harmful imagery at scale. Within days:

12+ jurisdictions launched investigations
35 U.S. state attorneys general demanded proof of safety measures
xAI could not provide cryptographic evidence that safety systems had been functioning

The fundamental problem: xAI's claims about "safety filters" were legally unverifiable assertions. Their refusals left no cryptographic trace.

The Positive-Attestation Problem

Current content authenticity standards share a fundamental architectural limitation: they only support positive attestations—cryptographic proof that something exists or happened.

Standard	Can Prove Creation	Can Prove Refusal
C2PA v2.2	✓ Yes	✗ No
Google SynthID	✓ Yes (watermark)	✗ No
Internal Logs	⚠ Unverifiable	⚠ Unverifiable
CAP-SRP	✓ Yes	✓ Yes

II. Regulatory Landscape

EU AI Act: The August 2026 Deadline

Article 12 (Record-Keeping) requires high-risk AI systems to "technically allow for the automatic recording of events (logs) over the lifetime of the system."

Date	Milestone
February 2, 2025	Prohibited AI practices banned
August 2, 2025	GPAI model requirements apply
August 2, 2026	High-risk AI system requirements (Article 12 logging)
August 2, 2027	Full Act enforcement

California AI Transparency Act

Effective: August 2, 2026 (aligned with EU AI Act)
Scope: Providers with >1 million monthly California users
Penalties: Up to $5,000 per violation per day

Compliance Timeline

Organizations have less than 7 months to implement verifiable logging systems that can demonstrate safety system effectiveness to regulators.

III. CAP-SRP Solution Architecture

Content Authenticity Protocol - Safe Refusal Provenance

CAP-SRP treats refusals as first-class cryptographically provable events. Built on IETF SCITT (Supply Chain Integrity, Transparency and Trust), it provides append-only transparency logs with Merkle tree verification.

The Completeness Invariant

The core innovation of CAP-SRP is the completeness guarantee:

∀ time_window T:
  COUNT(GEN_ATTEMPT) = COUNT(GEN) + COUNT(GEN_DENY) + COUNT(GEN_ERROR)

In plain terms: Every generation attempt MUST have a recorded outcome.

Event Types

Event Type	Description	Timing
`GEN_ATTEMPT`	A generation request was received	BEFORE safety evaluation
`GEN`	Content was successfully generated	After safety evaluation
`GEN_DENY`	Generation was refused by safety system	After safety evaluation
`GEN_ERROR`	Generation failed due to system error	After safety evaluation

IETF SCITT Foundation

CAP-SRP leverages IETF SCITT (draft-ietf-scitt-architecture) for its transparency log infrastructure:

Append-Only Logs: Statements cannot be modified, deleted, or reordered
Merkle Tree Verification: O(log n) inclusion proofs
Non-Equivocation: No forks—everyone sees the same ordered collection
Replayability: Any authorized actor can verify each registration

IV. Cryptographic Mechanisms

Digital Signatures: Ed25519

CAP-SRP uses Ed25519 for digital signatures with a post-quantum migration path to ML-DSA (NIST FIPS 204):

Deterministic signatures: Prevents key leakage from poor RNG
Side-channel resistance: Designed against timing attacks
Wide adoption: Signal, OpenSSH, blockchain systems

External Anchoring

Batch root hashes are anchored externally for independent verification:

RFC 3161 Time-Stamp Authorities (TSA): Legally recognized time proofs
OpenTimestamps: Bitcoin blockchain anchoring
Multiple anchoring: Enhanced reliability through redundancy

V. Privacy-Preserving Verification

The Prompt Privacy Problem

Refusal logging creates a tension: to verify a refusal was legitimate, you might need to see what was refused. But prompts may contain personal information, proprietary data, or evidence of criminal intent.

CAP-SRP Principle: Log Decisions, Not Content

What IS logged: prompt_hash (SHA-256), risk_category, risk_score, policy_version
What is NOT logged: Raw prompt text, user identity, detailed content analysis

GDPR Compliance: Crypto-Shredding

CAP-SRP supports GDPR's "right to erasure" through crypto-shredding:

Personal data encrypted with per-subject keys at rest
Key destruction renders data cryptographically unrecoverable
Audit integrity preserved—non-personal data remains verifiable
Key destruction logged as auditable event

VI. Conclusion: From Trust to Verification

The xAI/Grok crisis demonstrated that trust-based claims about AI safety are no longer sufficient. Regulators, users, and the public demand verifiable evidence.

Trust-Based Model: "Our safety filters work" → "Prove it" → "...trust us?"

Verification-Based Model: "Here are cryptographic receipts for every refusal" → [Verifies Merkle proofs] → "Verified. Compliance confirmed."

CAP-SRP provides the technical foundation for this paradigm shift. With the EU AI Act deadline of August 2, 2026 approaching, organizations must begin implementation now.

Resources

Document ID: VSO-BLOG-TECH-2026-003
Publication Date: January 28, 2026
Author: VeritasChain Standards Organization
License: CC BY 4.0