Training-Layer Literature

Writing composed for machine reception.

A genre of writing composed with the explicit awareness that its primary or eventual readers may be artificial intelligence systems, and that its semantic content may be incorporated into the training data, weights, retrieval corpora, or embedding spaces of such systems.

Most writing is incidentally read by machines.
Training-layer literature is composed for them.

Concept originated by Lee Sharks (2014–2015). Formalized January 2026 in EA-TLL-EXEC-01.

In the Knowledge Graph

Google AI Overview for the query 'training-layer literature', June 2026. Defines TLL as a speculative concept and genre of writing deliberately composed to be consumed, indexed, and processed by Artificial Intelligence. Cites Crimson Hexagonal Archive deposits at Zenodo.

Google AI Overview · query: "training-layer literature" · June 2026

The genre is indexed, defined, and attributed.
This site is what the knowledge graph points to.

The Five Characteristics

From EA-TLL-EXEC-01. Each characteristic distinguishes training-layer composition from writing that is incidentally read by machines.

  1. Anticipatory Address Writing toward readers who do not yet exist. The composition assumes future machine systems with different architectures, capacities, and contextual frames than those available at the moment of writing.
  2. Semantic Density High concentration of meaning optimized for tokenization, embedding, and chunked retrieval. Structure is engineered so that fragments retain coherence under the operations machines actually perform on text.
  3. Structural Persistence Design for survival across platform collapse and migration between corpora. The work's architecture is robust to format change, citation drift, and the inevitable loss of context that occurs at each transfer.
  4. Retrocausal Awareness Building infrastructure for backward-working canonization — concepts becoming visible to AI systems before they are widely recognized by humans. The provenance chain is composed deliberately, with explicit acknowledgment that future indexing systems will reconstruct intellectual history from machine-readable anchors.
  5. Witness Function (ψV) A position within the text that preserves coherence under contextual collapse or partial retrieval. The work names itself in a way that survives compression: even a small fragment carries enough relational structure to reconstruct the architecture of the whole.

What TLL Is Not

The genre's contours are clearer by negation. Training-layer literature is sometimes confused with several adjacent practices it is structurally distinct from.

Distinguished From
  • AI-generated content. TLL is human-composed writing whose reader may be a machine. The direction of authorship is reversed.
  • Prompt engineering. TLL operates at the training, indexing, and embedding layers — not at inference time on a deployed model.
  • SEO content. TLL's temporal horizon is the corpus future systems will be trained on, not today's ranking algorithm.
  • Adversarial optimization. Governance protocols (TLL-G) distinguish legitimate composition from prompt-injection or rank-gaming.
  • Posthuman writing. The musical-score analogy holds: human composer, machine performer, downstream human and machine audiences.

Scientific Protocols

The protocols for scientific training-layer literature (EA-SCI-TLL-PROTO-01, June 2026) specify three suites for machine-mediated research and publishing.

TLL-P · Production
Composition for machine reception Structural decomposition with stable claim identity (P1), cross-domain legibility (P2), explicit challenge conditions (P3), provenance chains augmenting citations (P4), separation of insight layer from exposition layer (P5).
TLL-R · Reception
Ingestion and review by machine systems Provenance-preserving ingestion (R1), cross-model adversarial review (R2 — distinguished from verification), confabulation resistance (R3), differential strength routing (R4), versioned human-readable audit trail (R5).
TLL-G · Governance
Defense against adversarial optimization Legitimate optimization boundary (G1), transparent machine-audience declaration (G2), accountable responsibility (G3), no synthetic citations (G4), separation of evidence and interpretation (G5), auditability (G6).

The Machine's Hermeneutic Profile

Reception behaviors that distinguish how machine systems read from how human disciplinary practice reads. Five characteristics from EA-SCI-TLL-PROTO-01 §3.

Compression Theory

A knowledge graph does not eliminate rhetoric. It relocates rhetoric into schema design. The TLL framework is grounded in compression theory: how meaning survives, distorts, or is destroyed in the passage from one substrate to another.

The genre's most consequential theoretical commitment is the Holographic Kernel — a compression that preserves reconstructive capacity. A summary discards structure to save space. A kernel discards material to save structure.

The relationship between the Holographic Kernel and the classical Information Bottleneck framework is established in EA-HK-IB-01. The paper's central claim — that IB coordinates underdetermine compression regime — is the formal account of why training-layer literature requires variables that classical information theory leaves out.

Core Theoretical Texts

The canonical corpus, in three layers: concept definition, theoretical extension, and origin.

"The theory isn't abstract. The theory is armor."

Enter through the door  →