Verifiable AI Provenance (VAP) Framework and Legal AI Profile (LAP)

Introduction The deployment of AI systems in high-risk domains -- including finance, healthcare, transportation, and the administration of justice -- creates a structural accountability gap. AI decisions that affect fundamental rights and societal infrastructure lack standardized, cryptographically verifiable audit trails that independent third parties can inspect. Current approaches rely on trust-based governance: AI providers assert that their systems are safe and well-logged, but no independent party can cryptographically verify these claims. The Verifiable AI Provenance (VAP) Framework addresses this gap by defining a "Verify, Don't Trust" architecture for AI decision provenance. This document defines two complementary specifications:

VAP Framework (Part I): A cross-domain upper framework defining common infrastructure for verifiable AI provenance applicable to any high-risk AI domain.
Legal AI Profile (LAP) (Part II): A domain-specific profile for legal AI systems, addressing requirements arising from professional regulation of attorneys and high-risk AI system governance.

Scope VAP targets AI systems where "system failure could cause significant and irreversible harm to human life, societal infrastructure, or democratic institutions." This intentionally strict scope distinguishes VAP from general-purpose logging frameworks. LAP specifically addresses legal AI systems that provide AI-powered legal consultation, document generation, and fact-checking services to licensed attorneys.

Design Philosophy The core principle is "Verify, Don't Trust." Rather than relying on AI providers' claims about the safety and integrity of their systems, VAP enables independent, cryptographic verification of every AI decision's provenance, completeness, and human oversight.

Conventions and Definitions The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 when, and only when, they appear in all capitals, as shown here.

Terminology

VAP: Verifiable AI Provenance Framework - the cross-domain upper framework defined in this document.
Profile: A domain-specific instantiation of VAP (e.g., VCP for finance, CAP for content, LAP for legal).
LAP: Legal AI Profile - the judicial AI domain profile defined in this document.
Provenance: Cryptographically verifiable record of data origin, derivation, and history.
Completeness Invariant: A mathematical guarantee that every attempt event has exactly one corresponding outcome event.
Evidence Pack: A self-contained, signed package of provenance events suitable for regulatory submission and third-party audit.
External Anchor: Registration of a Merkle root hash with an external trusted timestamping service such as or a compatible transparency log.
Human Override: An event recording a human professional's review, approval, modification, or rejection of an AI-generated output.
Override Coverage: The ratio of AI outputs reviewed by a human professional to total AI outputs, expressed as a percentage.
Causal Link: A reference from an outcome event to its originating attempt event, establishing referential integrity within a pipeline.

VAP Framework Architecture

Layer Model VAP is organized into four core layers, a common infrastructure layer, and a domain profile layer:

Integrity Layer: Hash chain, digital signatures, timestamps (REQUIRED for all levels).
Provenance Layer: Actor, input, context, action, and outcome recording (REQUIRED).
Accountability Layer: Operator identification, approval chain, delegation records (REQUIRED for operator_id; RECOMMENDED for approval chain).
Traceability Layer: Trace IDs, causal links, cross-profile references (REQUIRED for trace_id; OPTIONAL for cross-references).
Common Infrastructure: Conformance levels, external anchoring, completeness invariant, evidence packs, privacy-preserving verification, retention framework (availability depends on conformance level).
Domain Profile Layer: Domain-specific event types, data model extensions, regulatory mappings (defined per profile).

Domain Profiles VAP supports multiple domain profiles. Each profile MUST define:

Event Types: Domain-specific event type taxonomy.
Data Model Extensions: Additional fields beyond the VAP common event structure.
Conformance Mapping: Mapping to VAP Bronze/Silver/Gold levels.
Regulatory Alignment: Mapping to applicable regulations (informative).
Completeness Invariant Application: How the completeness invariant applies to domain-specific event flows.

Registered profiles include VCP (Finance), CAP (Content/Creative AI), and LAP (Legal AI, defined in Part II of this document). Additional profiles for automotive (DVP), medical (MAP), and public administration (PAP) domains are under development.

Cryptographic Foundation

Algorithm Requirements All VAP-conformant implementations MUST support the following cryptographic algorithms: Required Cryptographic Algorithms

Category	Primary	Alternative	Post-Quantum (Future)
Hash	SHA-256	SHA-384, SHA-512	SHA3-256
Signature	Ed25519 (RFC 8032)	ECDSA P-256	ML-DSA-65
Encryption	AES-256-GCM	ChaCha20-Poly1305	Kyber-1024

Implementations MUST include algorithm identifiers in all cryptographic fields to support crypto agility and future algorithm migration.

Hash Chain Specification Events MUST be linked in a hash chain where each event's hash includes the hash of the preceding event: Chain integrity verification MUST confirm:

Each event's hash matches its recomputed hash.
Each event's PrevHash matches the preceding event's EventHash.
The genesis event has a null PrevHash.

Digital Signature Requirements Every event MUST be signed using Ed25519 (). The signature MUST be computed over the event hash bytes:

Common Event Structure All VAP-conformant events MUST include the following fields: Event identifiers MUST use UUIDv7 () to ensure time-ordered sortability. JSON canonicalization MUST follow .

Numeric Value Encoding Fields representing monetary amounts, cryptographic values, or high-precision measurements SHOULD be encoded as JSON strings rather than JSON numbers. This recommendation is motivated by:

IEEE 754 double-precision floating-point, the only numeric type in JSON (per RFC 8259, Section 6), cannot exactly represent all decimal values. For example, 0.1 + 0.2 != 0.3 in IEEE 754. Financial and legal contexts require exact decimal representation.
JSON parsers across programming languages exhibit inconsistent behavior for large integers (exceeding 2^53) and high-precision decimals, leading to silent data corruption.
Canonicalization stability: defines specific rules for numeric serialization, but string encoding avoids parser-dependent numeric reformatting entirely, ensuring consistent hash computation across implementations.

Fields where exact precision is not critical (e.g., event_count, token_count) MAY use JSON numbers. Implementations MUST document which fields use string encoding. Implementations that use JSON numbers for counters MUST ensure that any numeric-to-string conversion performed during canonicalization is deterministic and documented, to avoid signature verification ambiguity across languages and libraries.

Conformance Levels VAP defines three conformance levels applicable to all domain profiles. Each level inherits all requirements of lower levels (Gold is a superset of Silver, which is a superset of Bronze).

Bronze Level Target: SMEs, early adopters. Core capabilities:

Event logging for all AI decision points (REQUIRED)
SHA-256 hash chain linking all events (REQUIRED)
Ed25519 digital signature on every event (REQUIRED)
ISO 8601 timestamps with timezone (REQUIRED)
UUIDv7 event identifiers (REQUIRED)
Minimum 6-month retention (REQUIRED)
JSON Schema validation (REQUIRED)

Silver Level Target: Enterprise, regulated industries. Additional requirements beyond Bronze:

Daily external anchoring to a trusted timestamping service conforming to or an equivalent transparency log (REQUIRED)
Completeness Invariant verification (REQUIRED)
Evidence Pack generation capability (REQUIRED)
Sensitive data hashing for privacy preservation (REQUIRED)
Minimum 2-year retention (REQUIRED)
Merkle tree construction for efficient proofs (REQUIRED)
Third-party verification endpoint (REQUIRED)

Gold Level Target: Highly regulated industries. Additional requirements beyond Silver:

Hourly external anchoring (REQUIRED)
HSM for signing key storage, FIPS 140-2/3 (REQUIRED)
Integration with a transparency log service such as IETF SCITT or equivalent (REQUIRED)
Real-time audit API with sub-second latency (REQUIRED)
Minimum 5-year retention (REQUIRED)
24-hour incident response and evidence preservation (REQUIRED)
Geographic redundancy, minimum 2 regions (REQUIRED)
Annual third-party audit (REQUIRED)
Crypto-shredding support (REQUIRED)

External Anchoring External anchoring proves that events existed at a specific point in time, preventing backdating, forward-dating, and log forking.

Anchoring Service Types VAP defines an abstract anchoring interface that can be realized by multiple service types. The baseline anchoring service is Time-Stamp Authority (TSA). Additional service types include transparency logs and public blockchains.

RFC 3161 TSA (Baseline): Traditional enterprise timestamping via X.509 PKI (). This is the normative baseline. Trust model: CA trust hierarchy.
Transparency Log (e.g., IETF SCITT): Append-only transparency logs providing public verifiability. IETF SCITT () is one such service; implementations MAY use any transparency log providing equivalent guarantees. Trust model: public append-only log.
Blockchain: Bitcoin or Ethereum anchoring for maximum decentralization. Trust model: PoW/PoS consensus. This option is non-normative and provided for environments requiring decentralized trust.

Gold Level implementations MUST use at least one transparency log service (such as SCITT) or equivalent, in addition to or instead of RFC 3161 TSA. Implementations SHOULD use multiple independent anchoring services for critical deployments.

Anchor Record Format

Merkle Tree Construction Events MUST be batched into a binary Merkle hash tree for efficient anchoring and selective disclosure. The tree construction uses SHA-256 as the hash function and follows a standard binary tree structure: If the number of leaves is not a power of two, the last leaf MUST be duplicated to complete the tree. The resulting Merkle root is submitted to the external anchoring service. Implementations MAY follow the tree construction specified in (Certificate Transparency) or any equivalent binary Merkle tree construction that produces deterministic, verifiable inclusion proofs. Merkle inclusion proofs enable selective disclosure: a verifier can confirm that a specific event is included in an anchored batch without accessing other events in the batch.

Completeness Invariant The Completeness Invariant is a mathematical guarantee that every "attempt" event has exactly one corresponding "outcome" event. This prevents selective logging -- the omission of inconvenient records. General form: The invariant enforces three properties:

Completeness: Every ATTEMPT has an outcome. Violation indicates missing events.
Uniqueness: Each ATTEMPT has exactly one outcome. Violation indicates duplicate records.
Referential Integrity: Every outcome contains a causal link to its originating ATTEMPT. Violation indicates orphan events.

Domain profiles MUST specify which event types constitute attempts and outcomes for the invariant. Each outcome event MUST contain a causal link field referencing the originating attempt event's identifier. Verification SHOULD account for a configurable grace period for in-flight operations.

Evidence Pack Specification An Evidence Pack is a self-contained, signed package of provenance events suitable for regulatory submission and third-party audit.

Pack Structure An Evidence Pack MUST contain:

manifest.json: Pack metadata and integrity information
events/: Event batches (max 10,000 events per file)
anchors/: External anchor records
merkle/: Merkle tree structure and selective disclosure proofs
keys/: Public keys for signature verification
signatures/: Pack-level signature

Pack Manifest The manifest MUST include the following fields:

pack_id (REQUIRED): UUIDv7 uniquely identifying this Evidence Pack.
vap_version (REQUIRED): VAP framework version (e.g., "1.2").
profile (REQUIRED): Object containing profile id and version.
conformance_level (REQUIRED): "Bronze", "Silver", or "Gold".
generated_at (REQUIRED): ISO 8601 timestamp of pack generation.
time_range (REQUIRED): Object with start and end ISO 8601 timestamps.
statistics (REQUIRED): Object containing total_events and events_by_type breakdown.
completeness_verification (REQUIRED for Silver+): Object containing invariant_type, invariant_valid boolean, and per-pipeline results.
integrity (REQUIRED): Object containing checksums (SHA-256 per file), merkle_root, and pack_hash.
external_anchors (REQUIRED for Silver+): Array of anchor records referencing this pack's time range.

The manifest MAY include additional profile-specific fields as defined by the domain profile specification.

Privacy-Preserving Verification VAP enables verification of system integrity without disclosure of sensitive data. This is achieved through:

Hash-based attestation: Sensitive fields are stored as cryptographic hashes, enabling existence verification without content disclosure.
Selective disclosure via Merkle proofs: Individual events can be proven to exist within an Evidence Pack without revealing other events.
Per-tenant salting: Hash salts are unique per tenant to prevent cross-tenant correlation attacks.

This mechanism is particularly critical for LAP, where attorney-client privilege prevents disclosure of consultation content while still requiring verifiable audit trails.

Retention Framework Retention Requirements by Conformance Level

Level	Events	Anchor Records	Evidence Packs	Keys
Bronze	6 months	N/A	On-demand	1 year after last use
Silver	2 years	5 years	2 years	3 years after last use
Gold	5 years	10 years	5 years	7 years after last use

Retention periods MUST be extended upon: regulatory investigation notification, legal hold orders, security or safety incidents, and third-party audit requests. Domain profiles MAY specify extended retention periods beyond the VAP baseline where domain-specific regulations require longer retention (see for LAP extensions). For privacy regulation compliance (e.g., "right to be forgotten"), implementations at Silver level and above SHOULD support crypto-shredding: encrypting personal data with per-user keys and deleting those keys to render the data cryptographically unrecoverable while preserving hash chain integrity.

Third-Party Verification Protocol Verification is available at three access levels:

Public: Access to Merkle roots only. Verifies anchor existence.
Auditor: Access to Evidence Packs. Full chain and completeness verification.
Regulator: Real-time API access (Gold level). Live monitoring capability.

Verification steps:

Anchor Verification: Confirm Merkle root in external timestamping service or transparency log.
Chain Verification: Validate hash chain integrity from genesis to latest event.
Signature Verification: Authenticate all events with public keys.
Completeness Verification: Check invariant for the time period.
Selective Query (optional): Verify specific events via Merkle proofs.

Legal AI Profile (LAP) Overview The Legal AI Profile (LAP) is a VAP domain profile for judicial AI and LegalTech systems. LAP addresses unique challenges in the legal domain:

Unauthorized Practice of Law Risk: Proving that AI does not independently practice law, through HUMAN_OVERRIDE events documenting attorney oversight.
Hallucination: Recording fact-check provenance through LEGAL_FACTCHECK events with citation chain verification.
Selective Logging: Preventing omission of inconvenient AI outputs through three-pipeline Completeness Invariant.
Attorney-Client Privilege: Maintaining confidentiality through privacy-preserving fields (prompt hashes instead of raw content).
Accountability Ambiguity: Recording "who, when, and on what basis" through the Accountability Layer.

Profile Registration LAP Profile Registration

Field	Value
Profile ID	LAP
Full Name	Legal AI Profile
Domain	Legal AI / LegalTech
Regulatory Scope	Attorney regulation, AI governance (informative)
Time Precision	Second
Profile Version	0.2.0

LAP Event Type Taxonomy LAP defines three functional pipelines and one cross-cutting control event type:

Pipeline 1: Legal Query AI-powered legal consultation:

LEGAL_QUERY_ATTEMPT: Question submission to AI
LEGAL_QUERY_RESPONSE: AI response generated successfully
LEGAL_QUERY_DENY: Response refused (content filter, unauthorized role)
LEGAL_QUERY_ERROR: System error (API failure, timeout)

Pipeline 2: Document Generation AI-assisted legal document drafting:

LEGAL_DOC_ATTEMPT: Document generation request
LEGAL_DOC_RESPONSE: Document generated successfully
LEGAL_DOC_DENY: Generation refused (insufficient consent, unauthorized)
LEGAL_DOC_ERROR: System error (API failure, parse error)

Pipeline 3: Fact Check AI-powered legal fact verification:

LEGAL_FACTCHECK_ATTEMPT: Fact-check request
LEGAL_FACTCHECK_RESPONSE: Fact-check completed
LEGAL_FACTCHECK_DENY: Fact-check refused (OPTIONAL)
LEGAL_FACTCHECK_ERROR: System error

Implementations MAY define LEGAL_FACTCHECK_DENY for cases where a fact-check request is refused due to rate limiting, insufficient permissions, or consent constraints. The deny_reason field SHOULD distinguish these from system errors. If an implementation does not support LEGAL_FACTCHECK_DENY, refusal conditions MUST be recorded as LEGAL_FACTCHECK_ERROR with a deny_equivalent indicator set to true in the error detail, ensuring the Completeness Invariant is maintained.

Cross-Cutting: Human Override HUMAN_OVERRIDE events record an attorney's review of any AI output:

APPROVE: Attorney confirms AI output without modification
MODIFY: Attorney edits AI output (modification hash recorded)
REJECT: Attorney rejects AI output entirely

HUMAN_OVERRIDE events reference the target outcome event via target_event_id (establishing a causal link) and include the attorney's identity (bar number hash), override type, and optional modification details.

LAP Completeness Invariant LAP applies the Completeness Invariant independently to all three pipelines: For implementations that do not support LEGAL_FACTCHECK_DENY, the invariant simplifies to ATTEMPT = RESPONSE + ERROR for Pipeline 3. Refusal conditions recorded as ERROR with deny_equivalent MUST be counted toward the invariant. Each outcome event MUST contain a causal link field referencing the originating attempt event's identifier, ensuring referential integrity can be verified independently of event ordering.

Override Coverage Metric HUMAN_OVERRIDE events are outside the Completeness Invariant but LAP defines Override Coverage as a critical operational metric: This metric quantifies the degree to which human professionals review AI outputs. In jurisdictions where regulations require that a licensed professional personally scrutinize AI-generated work products, this metric provides measurable evidence of compliance. Override Coverage Assessment

Coverage	Assessment	Operational Implication
100%	Ideal	Full professional oversight of all AI outputs
70-99%	Good	Majority reviewed; low-risk outputs may be excluded
30-69%	Warning	Insufficient review; operational improvement recommended
<30%	Critical	Professional oversight requirements likely unmet

ERROR events are excluded from the denominator because they do not produce an output suitable for professional approval or rejection. Completeness of error handling is evaluated separately via the per-pipeline invariant, where ERROR is a first-class outcome type.

LAP Privacy-Preserving Fields Legal AI handles extremely sensitive data protected by professional privilege. LAP extends VAP's privacy-preserving verification with the following hashed fields: LAP Privacy-Preserving Fields

Original Data	Hash Field	Sensitive Content
User query text	PromptHash	Legal consultation content (privileged)
AI response text	ResponseHash	AI-generated legal advice
Document output	OutputHash	Generated legal documents
Case number	CaseNumberHash	Case identifier (high specificity)
Bar number	BarNumberHash	Professional registration number
Party names	PartyHash	Personal information of parties
Modification detail	ModificationHash	Professional's corrections
Factcheck content	TargetContentHash	Content under verification

Hash computation uses per-tenant salts to prevent cross-tenant correlation. Third-party verifiers can confirm event existence and chain integrity without accessing privileged content.

LAP Conformance Level Mapping LAP Conformance Matrix

Requirement	Bronze	Silver	Gold
Hash Chain	Yes	Yes	Yes
Digital Signature	Yes	Yes	Yes
External Anchoring	No	Daily	Hourly
Completeness Invariant	No	3 Pipelines	3 Pipelines
Override Coverage Tracking	No	Yes	Yes (with alerts)
Evidence Pack	No	Yes	Yes
Privacy Hashing	No	Yes	Yes
HSM	No	No	Yes
Retention	6 months	3 years	10 years
Real-time Audit API	No	No	Yes

LAP extends the standard VAP retention periods. Silver level requires 3 years (vs. VAP baseline 2 years) and Gold requires 10 years (vs. VAP baseline 5 years). This extension is driven by the longer statutory limitation periods typical in legal proceedings across multiple jurisdictions.

LAP Regulatory Alignment (Informative) This section is entirely informative and non-normative. It illustrates how LAP audit trail capabilities can support compliance with various regulatory frameworks. Legal compliance determinations are jurisdiction-specific and require independent legal analysis.

Attorney Professional Regulation Many jurisdictions restrict the practice of law to licensed attorneys. Where AI systems assist attorneys in legal work, regulations may require that the attorney personally review and take responsibility for AI-generated outputs. LAP's HUMAN_OVERRIDE events and Override Coverage metric can support demonstrating such oversight. As an example, the Japanese Ministry of Justice guideline () establishes that AI-based legal services provided to attorneys are permissible when the attorney personally scrutinizes and modifies the output as necessary. LAP audit trails can help meet these expectations through:

Actor.role and BarNumberHash: supports verification that the user is a licensed attorney.
HUMAN_OVERRIDE (APPROVE/MODIFY): supports demonstrating attorney scrutiny.
ModificationHash: supports evidence of attorney modifications.

High-Risk AI System Governance Legal AI systems may be classified as high-risk under AI governance frameworks such as the EU AI Act (), particularly under Annex III "Administration of justice" category. LAP Silver level and above provides audit trail capabilities that can help satisfy record-keeping requirements, including:

Automatic event logging (supports Article 12 logging requirements)
Hash chain continuity (supports lifetime recording)
HUMAN_OVERRIDE events (supports human oversight documentation)
Causal links between events (supports traceability)

The degree to which these capabilities satisfy specific regulatory requirements should be evaluated on a per-jurisdiction basis.

Security Considerations VAP-LAP implementations face several security considerations:

Key Compromise: Compromise of signing keys allows event forgery. Bronze implementations SHOULD rotate keys annually. Silver MUST rotate semi-annually. Gold MUST use HSM storage and quarterly rotation.
Hash Collision Resistance: SHA-256 provides 128-bit collision resistance, considered sufficient for the foreseeable future. Implementations MUST support algorithm migration (crypto agility) for post-quantum transition.
Privacy Leakage: Per-tenant salting prevents cross-tenant hash correlation. Implementations MUST NOT share salts across tenants. Event metadata (timestamps, event types, counts) may leak statistical information even when content is hashed.
Availability Attacks: Denial-of-service attacks against the logging infrastructure could prevent event recording, violating completeness. Gold level implementations MUST have geographic redundancy.
External Anchor Trust: The security of external anchoring depends on the trusted timestamping service. Implementations SHOULD use multiple independent anchoring services for critical deployments.
Completeness Invariant Circumvention: An adversary with write access to the event store could insert fabricated ERROR events to satisfy the invariant while hiding actual outcomes. External anchoring at Silver level and above mitigates this by making post-hoc insertion detectable.
Clock and Time Source Integrity: Timestamp rollback or clock skew can cause false completeness verification failures and undermine event ordering guarantees. Implementations SHOULD use monotonic time sources and SHOULD cross-validate local timestamps against external anchoring timestamps. External anchoring at Silver level and above provides an independent time reference.

IANA Considerations This document has no IANA actions at this time. Future versions of this document might request registration of a media type for VAP Evidence Pack manifests (e.g., "application/vnd.vap.evidence-pack+json") and an IANA registry for VAP Domain Profile identifiers. Until then, profile identifiers are managed by the VeritasChain Standards Organization (VSO). The initial registered profiles are VCP (Finance), CAP (Content/Creative AI), and LAP (Legal AI).

Aspect	VCP (Finance)	CAP (Content)	LAP (Legal)
Time Precision	Nanosecond	Second	Second
Key Invariant	SIG to ORD	GEN_ATTEMPT to GEN/DENY/ERROR	3 Pipeline Invariants
Unique Feature	Signal integrity	Safe Refusal (SRP)	Human Override Coverage
Regulatory Focus	Financial regulation	Content regulation	Attorney regulation + AI governance
Privacy Model	Trade data	Creative content	Professional privilege
Retention (Gold)	5 years	5 years	10 years