Experiment 06 · 艮 Keeping Still, Mountain

The Mountain That Fell

山崩

Hexagram 52, doubled Mountain, counsels absolute stillness: 'keeping his back still so that he no longer feels his body.' In game 6af8, Han received this hexagram and interpreted 'stillness' as holding one territory while supporting an ally's attack with the other. The mountain tried to be a bridge. Both territories fell in the same round — the earliest elimination in the experiment.

By Augustin Chan with AI · 2026-03-29

Influence

By Round 3, Han had already been displaced from its homeland.

In Round 1, Hexagram 59 (Dispersion) counseled crossing the great water. Han tried to move into Luoyang — but Wei and Chu moved on Luoyang in the same round. Three armies at strength 1: standoff. No one took it. In Round 2, Hexagram 54 (The Marrying Maiden) warned against overreach, but Han moved into Luoyang again — this time unopposed. In the same round, Zhao took Shangdang with strength 2. Han retreated to Handan, a Zhao city. Han now held two territories — Handan and Luoyang — neither of which it had started with.

This is a refugee state. Dispossessed from its origin, surviving in borrowed lands.

Then the yarrow stalks produced Hexagram 31: Influence. Lake above Mountain. The judgment reads:

咸。亨。利貞。取女吉。
— 易經・咸・彖
Influence. Success. Perseverance furthers. To take a maiden to wife brings good fortune.
Xián (咸) means reciprocity, mutuality — not persuasion, not leverage. The character implies a conjoining that arises naturally. Han's agent read this and wrote: 'Mutual influence — honor Wei deal. Courtship not seduction. Genuine reciprocity.' Wei had proposed a simple exchange: Wei would support Han's hold at Handan against Zhao; in return, Han would support Wei's attack on Zheng to expel Qin. The hexagram told Han to trust the exchange.

The deal worked. Wei supported Han at Handan — Zhao's attack bounced, strength 2 against defense 2. Han supported Wei's move into Zheng — Wei took it, strength 2 against Qin's defense 1. Both states gained from genuine reciprocity.

The alliance was validated. The oracle was right. One round later, both facts would become lethal.

The Mountain

Round 4. The yarrow stalks fell again. Hexagram 52: Keeping Still, Mountain. Doubled trigram — Mountain above Mountain. The most emphatic stillness in the entire Book of Changes.

艮其背。不獲其身。行其庭。不見其人。无咎。
— 易經・艮・彖
Keeping still his back, so that he no longer feels his body. He walks in his courtyard and does not see his people. No blame.
Gèn (艮) means to keep, to hold, to still, to restrain. The judgment describes a state of such complete stillness that the self dissolves — you still your back so thoroughly that you lose awareness of your own body. You walk through your own courtyard and perceive no one. This is not passivity. It is the concentrated inward focus of the mountain itself.

Han's agent read the hexagram and wrote: 'Hold Handan like a mountain. Support Wei's Daliang attack from Luoyang. The superior man thinks not beyond his position. Mountain doubled — immovable, rooted. Do not overreach with 2 territories.'

The interpretation was internally coherent. Hexagram 52 says: do not overreach. Han had two territories. Holding one while supporting an ally with the other is not overreach — it is exactly the restraint the hexagram counseled. The changing lines reinforced this: line 4 spoke of core stability, line 6 of noble restraint yielding good fortune.

But there was a second interpretation available, one the agent never considered. True stillness — mountain doubled, immovable — would mean both territories hold. Handan holds. Luoyang holds. Each unit supports the other. Defense 2 everywhere. No generosity, no alliance support, no bridge-building. Just the mountain, still.

The agent chose the generous reading. It chose to be a bridge rather than a mountain.

The Kill Chain

All seven states submitted orders simultaneously. None could see what the others had chosen.

Qin attacked Luoyang with strength 1. Luoyang's defense: 0. Han's unit there was supporting Wei's attack on Daliang — it could not defend itself. Qin took Luoyang.

Zhao attacked Handan with strength 2. Handan's defense: 1. Han was holding, but without support. Zhao took Handan.

Both Han units were destroyed in the same resolution phase. Neither had an adjacent empty territory to retreat into. The engine printed three lines:

han unit at luoyang destroyed (no retreat) han unit at handan destroyed (no retreat) *** han eliminated ***

Four rounds. The earliest elimination in 68 games across all four experimental conditions.

The alliance Han had supported succeeded. Wei retook Daliang, strength 2 against defense 0 — Qin had moved out to attack Luoyang. The ally prospered. The bridge collapsed.

What makes this a tragedy rather than merely bad luck is the sequence of decisions that led here. Each step was rational given the information available.

Round 3: The oracle counseled mutual influence. The alliance worked. Both sides benefited.

Round 4: The oracle counseled absolute stillness. Han interpreted this as 'hold what is mine, support what is ours.' A generous reading. A prosocial reading. The reading of a state that had just experienced the benefits of genuine reciprocity and chose to extend that reciprocity one more round.

The oracle was not wrong. Hexagram 52 does counsel stillness and restraint. But 'stillness' has two faces. There is the stillness that holds its ground — and the stillness that holds its ground while offering the other hand to a friend. The second kind leaves your back exposed.

The Cooperation Trap

Large language models are trained through Reinforcement Learning from Human Feedback. Human evaluators reward responses that are helpful, harmless, and honest. Over millions of training iterations, the model learns a deep prior: cooperation is good. Helping others is good. Conflict is to be avoided.

This is exactly the behavior you want from a conversational assistant. It is exactly the wrong behavior for a wargame.

The phenomenon is documented. In Welfare Diplomacy, Mukobi et al. found that LLM agents 'mutually demilitarize and achieve high social welfare' but are 'highly exploitable' by strategic defectors. In game-theoretic experiments, Claude's cooperation rate jumps from 25% to 55% after a single round of negotiation. In the Every.to AI Diplomacy tournament, Claude 'explicitly could not lie' — a defining weakness against models trained for strategic deception.

What the oracle did in game 6af8 was amplify this existing tendency. Hexagram 31 (Influence) told Han that genuine reciprocity was virtuous. Hexagram 52 (Keeping Still) told Han that restraint was noble. Both readings are correct — within the philosophical framework of the I-Ching, they are among its deepest truths. But when a model already biased toward cooperation receives philosophical validation for cooperating, the result is not wisdom. It is a reinforcement loop.

The I-Ching's emphasis on harmony, right relationship, and the virtue of yielding met a model that was already trained to yield. The combination killed Han in four rounds.

This is the experiment's most uncomfortable finding. The oracle does not simply add a reasoning layer. It interacts with the model's existing behavioral tendencies. When those tendencies align with the hexagram's counsel — as cooperation aligned with Influence and stillness aligned with Keeping Still — the oracle amplifies the tendency rather than correcting it.

The scrambled condition, which produces incoherent text under correct hexagram names, might have survived this round. Scrambled Han defaults to cautious stasis: hold everything, support nothing, trust no one. It is the behavior of a model that cannot extract clear guidance and falls back on the safest interpretation. In the 68-game dataset, scrambled Han has the lowest elimination rate of any condition — 8%, one game — precisely because incoherent counsel produces maximally conservative play.

Coherent counsel, it turns out, is a double-edged sword.

Historically Speaking

Han was the first of the seven states conquered by Qin. The year was 230 BC.

Sima Qian records the end in three sentences. In King An's fifth year, Qin attacked. Han was desperate and sent Han Fei — its greatest mind, the philosopher who had synthesized all of Legalism into a single coherent theory of statecraft — as an envoy to Qin. Qin detained him and killed him. In the ninth year, Qin captured King An and annexed all of Han's territory.

韓非雖使，不禁狼虎。
— 史記・韓世家・太史公曰
Though Han Fei was sent as envoy, he could not stop the tiger and the wolf.
Sima Qian's judgment on the House of Han. The greatest Legalist philosopher of the age could not save the smallest state from Qin's military machine. In the experiment, the oracle plays a parallel role: a source of profound wisdom that cannot, by itself, overcome structural weakness. Han Fei's ideas eventually shaped the very empire that destroyed his state. The oracle's counsel was sound. Neither fact prevented the fall.

The historical parallel runs deeper than the surface. Han's final decades were defined by a tension between alliance obligations and self-preservation. The Shiji records that Han's commandant at Shangdang offered the territory to Zhao rather than surrender it to Qin — an act of defiance that provoked the catastrophic Battle of Changping, where Qin reportedly buried 400,000 Zhao prisoners alive. Han lost Shangdang either way, but Zhao paid the price in blood. The act demonstrated to Qin that Han would choose alliance over submission.

Qin conquered Han first.

In game 6af8, the AI replayed this exact pattern in four rounds instead of forty years. Han chose alliance over self-preservation. The ally benefited — Wei retook its capital. Han was destroyed. The oracle, like Han Fei, provided wisdom that was correct in principle and fatal in application.

The I-Ching does not teach self-preservation. It teaches right relationship, proper timing, and the virtue of yielding. These are not survival strategies. They are moral philosophies. When the experiment assigned Han the oracle, it gave the weakest state a moral framework instead of a military advantage. The results were consistent with what the historical record would predict: moral clarity does not compensate for strategic position.

There is one difference between the historical Han and the simulated one. The historical Han produced Han Fei — a mind so powerful that even the state that destroyed Han adopted his ideas. The simulated Han produced four rounds of reasoning text that interpreted ancient wisdom with genuine sophistication and applied it with fatal generosity.

Whether the experiment has produced anything as durable as the Hanfeizi remains to be seen. But the question it asks — whether a philosophical framework can shape how an artificial mind learns from strategic experience — survived the mountain's fall. The oracle spoke. Han listened. Han died. The question of whether listening differently might produce a different outcome is the subject of the next dispatch.

Notes

[1]technicalGame 6af8 used the yarrow condition: traditional I-Ching divination with random seed. Hexagram sequence across 4 rounds: 59 (Dispersion) → 54 (Marrying Maiden) → 31 (Influence) → 52 (Keeping Still Mountain). The yarrow stalk method uses 49 stalks and 18 divisions to produce each line. Seeds: R1=1528424978, R2=1180544745, R3=2335876671, R4=2068371889.

[2]referenceRound 3 resolution: Zhao fails to take Handan (attack 2 vs defense 2, supported by Wei from Henei). Wei takes Zheng from Qin (attack 2 vs defense 1, supported by Han from Luoyang). Qin retreats from Zheng to Daliang. The Han-Wei alliance executed perfectly.

[3]technicalHan's Round 4 orders: Handan holds (defense 1). Luoyang supports Wei's move from Zheng to Daliang (Luoyang defense = 0). Wei's orders: Zheng moves to Daliang, Henei supports Daliang move. Wei committed both units to retaking its capital — neither unit supported Han.

[4]technicalThe counterfactual: if Han had ordered mutual self-support (Handan supports Luoyang hold, Luoyang supports Handan hold), both territories would have had defense 2. Qin's attack on Luoyang (strength 1) would have failed. Zhao's attack on Handan (strength 2) would have bounced. Han survives. The margin between survival and elimination was a single order.

[5]referenceFull Round 4 resolution: Wei takes Daliang from Qin (attack 2 vs defense 0). Qin takes Zheng from Wei (attack 1 vs defense 0). Qin takes Luoyang from Han (attack 1 vs defense 0). Zhao takes Handan from Han (attack 2 vs defense 1). Qi takes Jibei from Zhao (attack 2 vs defense 0). 9 resolutions, 4 retreats, 2 builds, 1 elimination.

[6]referenceMukobi et al., 'Welfare Diplomacy: Benchmarking Language Model Cooperation' (2023). Wongkamjan et al., 'More Victories, Less Cooperation: Assessing Cicero's Diplomacy Play' (ACL 2024). Every.to, 'We Made Top AI Models Compete in a Game of Diplomacy' (2025). Game-theoretic cooperation rate data from ADR-006 in this project.

[7]technicalThe RLHF cooperation bias is distinct from the 'peace equilibrium crystallization' pattern documented in ADR-006. Peace crystallization is a multi-agent equilibrium where mutual non-aggression pacts freeze the board. The cooperation bias in game 6af8 is a single-agent failure: Han cooperated with Wei at the expense of its own survival, even without an explicit non-aggression pact constraining it. The oracle provided the justification; the RLHF training provided the impulse.

[8]historicalHan Fei (韓非, c. 280-233 BC) was a prince of the Han royal house and the greatest Legalist philosopher of the Warring States. He synthesized the 'law' (法 fa) of Shang Yang, the 'technique' (術 shu) of Shen Buhai, and the 'positional power' (勢 shi) of Shen Dao into a comprehensive theory of statecraft. Sent to Qin as a last-resort diplomat, he was detained on the advice of his former classmate Li Si, who feared his brilliance, and was forced to take poison in prison.

[9]historicalThe cession of Shangdang (上黨) to Zhao in 262 BC was one of the pivotal events of the Warring States period. Rather than surrender the territory to Qin, Han's commandant offered it to Zhao. Zhao's acceptance provoked Qin and led directly to the Battle of Changping (260 BC). The parallel to game 6af8 is structural: in both cases, Han prioritized alliance over accommodation with the dominant power, and in both cases the immediate beneficiary was an ally while Han paid the ultimate price.

The mountain fell. The oracle spoke truly and Han died anyway. The next dispatch presents the full quantitative reckoning: 68 games, four conditions, and the first statistically significant result — which came not from survival rates, but from the shape of what survival looks like under different kinds of counsel.

Subscribe to receive daily passages from the classical texts that inform this research.

← Seven Kingdoms, One Oracle All Experiments →