Research — hexagrams, computational analysis, warring states simulation
Research & Methodology

Ancient Algorithms
Tested Against Modern ML

We tested whether the King Wen I-Ching sequence's anti-habituation properties improve neural network training. The answer is no. The sequence has genuine statistical structure—confirmed by Monte Carlo analysis against 100,000 random baselines—but these properties destabilize gradient-based optimization rather than helping it.

By Augustin Chan with AI · Published January 2025 · Updated June 2026

New · Phase 2 Publishedcs.MA · cs.AI · cs.LG

Symbolic Reasoning Frameworks Modulate LLM Risk Aversion in Multi-Agent Strategic Settings

The Diplomacy-style simulation now has a preprint. In a seven-player Warring States variant, large language models turn out to be inherently risk-averse as strategic agents — and symbolic reasoning frameworks, applied as reflective prompts, measurably shift that bias and each framework's winner distribution. The guided agent never won, but framework choice changed its territorial performance. The effect runs through the act of reflection itself, not the framework's semantic content.

Research Materials

PDF

Research Paper

Negative result: the King Wen sequence has genuine statistical properties but they do not improve neural network training. Tested via LR modulation, curriculum ordering, and seed sensitivity analysis on two platforms.

cs.LG

Phase 1 — arXiv Preprint

Published in cs.LG (Machine Learning). The paper reports a rigorous negative result: the King Wen sequence's anti-habituation properties are statistically real but do not help neural network training. Includes experiments on two platforms with a 30-seed sensitivity analysis.

cs.MA

Phase 2 — arXiv Preprint

The Diplomacy-style simulation. In a seven-player Warring States variant, symbolic reasoning frameworks applied as reflective prompts modulate LLMs' inherent risk aversion and shift each framework's winner distribution. The guided agent never won, but framework choice changed its territorial performance — an effect that runs through reflection itself, not semantic content.

Results

Experimental Findings

LR Modulation

King Wen surprise profile as learning rate modulation degrades performance at all amplitudes (0.15, 0.3, 0.5). Worse than both random and Shao Yong controls.

Curriculum Ordering

As data ordering strategy, King Wen is the worst non-sequential ordering on CUDA and within noise on MLX. Random shuffle beats everything.

Statistical Properties

The sequence genuinely has anti-habituation structure: high transition distance, negative autocorrelation, yang balance. Confirmed vs 100K random permutations.

Why It Doesn't Work

The sequence's high variance—the property that makes it statistically distinctive—destabilizes gradient-based optimization. Negative autocorrelation disrupts optimizer momentum. Anti-habituation is premature for models still in early learning. A fixed 3,000-year-old sequence cannot adapt to the learner's state.

Phase 2 Design

The Oracle Changes the Board

All seven states are language-model agents with historical personas. The only variable is the symbolic framework Han — the weakest state — reflects through each round before it issues orders. Four conditions were run across a seven-player Diplomacy variant: a baseline with no oracle, the King Wen I-Ching (yarrow), tarot, and a scrambled-text ablation that preserves the ritual's structure but strips its meaning.

Han never wins, and its survival rate is flat across every condition. But the framework it reflects through reshapes which other state dominates — and because the framework's content (hexagram themes, tarot postures) does not predict Han's actions, the effect runs through the act of reflection itself, not the symbols' meaning.

Framework Han reflects throughWho dominates the boardHan's own outcome
Baseline (no oracle)Yan dominates (7/11)Never wins
King Wen I-Ching (yarrow)Yan & Chu co-dominate; Qin suppressed (0/10)Never wins; survival flat
TarotQin dominates (5/10, p = 0.006)Higher peak territory, still no win
Scrambled text (ablation)Qi dominates (5/10, p = 0.006)Never wins

The finding: the oracle changes the board, not the oracle-user. A symbolic framework injected into one weak agent modulates its inherent risk aversion and ripples outward to reshape outcomes for every other agent in the system. Full results in arXiv:2606.07552.

Why Han?

The ultimate underdog test case

Historical Constraints

  • Smallest territory of the seven Warring States
  • Poor resources and strategic depth
  • Strategically boxed in by Qin, Wei, and Chu
  • First to fall to Qin (230 BC)

AI Opportunity

Early 3-state trials eliminated Han in 93% of games, confirming the need for full 7-state geopolitical complexity. The published Phase 2 run (arXiv:2606.07552) settled the survival question: the guided agent never won, and survival stayed flat across frameworks. The real finding is subtler — the symbolic framework an agent reflects through measurably shifts its risk aversion and territorial performance, even when the framework's semantic content does not predict its moves.

Potential Strategies

  • Overcome extreme geopolitical constraints
  • Form strategic alliances
  • Engineer asymmetric strategies
  • Leverage espionage and diplomacy

Scaling the Experiment

Alternative Test States

For testing a new learning algorithm, you want a historically disadvantaged but initially viable state. These candidates could serve as interesting underdogs:

Wei

Once Mighty, Then Declined

Challenge: Reverse-engineer successful Qin deterrence

Opportunity: Test long-term planning and strategic correction

Yan

Isolated and Slow to Act

Challenge: Build stable power base in the north

Opportunity: Use unconventional warfare and early alliances

Zhao

Brave but Overwhelmed

Challenge: Balance tactical vs. strategic skills

Opportunity: Leverage military talent with better strategy

Qi

Economic Powerhouse

Challenge: Translate wealth into lasting dominance

Opportunity: Leverage Jixia Academy and salt monopoly

Chu

Complex Geopolitics

Challenge: Manage vast territory effectively

Opportunity: Test complex alliance dynamics

Qin

Test Restraint

Challenge: Test restraint, not power

Opportunity: Avoid over-aggression and maintain stability

Recommendation: Use Han as the Test Bed

Han is historically the most constrained—minimal land, minimal power, first to die. If your AI can lead Han to survive or even dominate, you have a powerful system.

Scaling Strategy: Start with Han (baseline AI), then scale to Wei/Yan (advanced challenge), Zhao/Chu (complex geopolitics), and Qin (test restraint, not power).