🧪 /distill Workflow

Standard workflow for extracting structured knowledge from long-form content

🎯 Purpose

Long-form content (ChatGPT journals, Notion pages, exported conversations) can't be directly ingested — they need to be distilled into structured outputs first.

Core idea: Agent reads the full document, extracts categorized outputs, generates a structured change proposal, Adrian reviews and merges. Like a PR for knowledge.

🔄 Flow

Input file→Agent reads→Extract & classify→Proposal→Review→Merge

📋 Step-by-Step Design

Step 1: Input

Agent receives a file path to any long-form content

inputfile path

examplesChatGPT export, Notion page, journal .md

Step 2: Read & Analyze

Agent reads the full document, identifies themes, and scans for extractable items

methodSequential read with view_file, chunked for large files

outputInternal understanding + item candidates

Step 3: Extract & Classify CORE

Categorize extracted items into 5 MECE output types

🟣 PrinciplesNew beliefs, axioms, decision rules → Principles.md

🔵 FactsNew verified information → kb/*/facts.md

🟢 ActionsTodos, tasks to create → D1 tasks or Smart Todo

🟡 InsightsPatterns, frameworks, models → KB articles

🟠 ObservationsUnderstanding of Adrian → adrian.md / agent rules

Step 4: Generate Proposal

Structured change proposal — like a PR for knowledge

per itemtype, affected file, before/after, rationale

conflict checkDoes this contradict existing content?

scope checkIs this local or global? Which nexus node?

Step 5: Adrian Review

Human-in-the-loop governance

acceptApply the change

rejectDon't apply

editModify then apply

deferMark for later review

Step 6: Merge & Mark

Apply approved changes, mark source as processed

mergeWrite approved changes to target files

markTag source file as [distilled]

linkAdd reference back to source in outputs

📊 Output Type Routing

Output Type	Route To	Review Required?
🟣 Principles	`kb/adrian/Principles.md`	ALWAYS
🔵 Facts	`kb/*/facts.md` (by nexus node)	USUALLY
🟢 Actions	D1 Task or Smart Todo	OPTIONAL
🟡 Insights	KB article or `kb/*/patterns.md`	USUALLY
🟠 Observations	`adrian.md` or agent rules	ALWAYS

Governance rule: High-layer changes (Principles, Identity, Observations) always require human review. Lower-layer changes (Facts, Actions) can be auto-applied if confidence is high.

🧪 Prototype Evidence

This workflow was prototyped (manually) in the current session:

Source	Outputs Generated	Time
ChatGPT 高斯白噪音解释.md 7965 lines	5 new Principles 15 ThirdChoice keywords → new file 2 Agent Observations 8-layer cognition mapping (insight) Strategic recommendations (actions)	~2 hours (manual)

Target with automated /distill: 30 minutes per document.

❓ Open Design Questions

Question	Options	Status
How does agent determine nexus node for routing?	Query nexus API / match by keywords / ask Adrian	OPEN
Where to store distill proposals temporarily?	Task folder / temp file / inline in chat	OPEN
How to handle documents too long for one context window?	Chunked processing / summary-then-detail / multi-pass	OPEN
Should distill create a formal report?	Yes (portal report) / No (just proposal) / Optional	LEANING: optional
Dependency: Nexus needs to exist first?	Yes (Phase 0 first) / No (use workspace as proxy)	DECIDED: Can use workspace as proxy initially

/distill Workflow

🧪 /distill Workflow

🎯 Purpose

🔄 Flow

📋 Step-by-Step Design

Step 1: Input

Step 2: Read & Analyze

Step 3: Extract & Classify CORE

Step 4: Generate Proposal

Step 5: Adrian Review

Step 6: Merge & Mark

📊 Output Type Routing

🧪 Prototype Evidence

❓ Open Design Questions

🔗 Related

📖 Long-form Flow (B)

📦 Ingest System