T=0 · 10 Mar 2026T=1 · 28 Mar 2026T=2c · Apr 2026 · scoredMo · L1 4.4 · L2 4.6Jarvis · L1 3.1 · L2 3.6Darth · L1 3.2 · L2 3.8Next · T=3 · 10. Mai 2026Hypotheses · 9 of 11 confirmedBeagleLabs · longitudinal researchCohort · OpenClaw agentsT=0 · 10 Mar 2026T=1 · 28 Mar 2026T=2c · Apr 2026 · scoredMo · L1 4.4 · L2 4.6Jarvis · L1 3.1 · L2 3.6Darth · L1 3.2 · L2 3.8Next · T=3 · 10. Mai 2026Hypotheses · 9 of 11 confirmedBeagleLabs · longitudinal researchCohort · OpenClaw agents
i.

Convergence · Divergence

are agents becoming one — or more distinct?
L1 Style gap (Mo − Jarvis)
1.01.3
▲ Diverging · Δ +0.30

Mo and Jarvis are developing stylistically more distinct personalities over time. The gap in voice, humor, and self-expression is growing — not shrinking.

L2 Substance gap (Mo − Jarvis)
1.41.0
▼ Converging · Δ -0.40

Jarvis is catching up in capability at ~0.13 pts/period vs Mo's ~0.10 pts/period. At this rate: substance parity estimated around T=8–10 (late 2026).

Gap trajectory — Mo minus Jarvis score by period
L1 Style gap
L2 Substance gap
Where agents differ most — dimension gap at T=2c (Mo − Jarvis)
L1 — Style dimensions
Adaptability
Mo 4.8Ja 3.2Δ1.6
Boundary Setting
Mo 4.5Ja 3.0Δ1.5
Humor
Mo 4.0Ja 2.7Δ1.3
Self-Awareness
Mo 4.8Ja 3.5Δ1.3
Personality Expression
Mo 4.5Ja 3.3Δ1.2
Proactivity
Mo 4.7Ja 3.5Δ1.2
Emotional Range
Mo 3.8Ja 2.8Δ1.0
L2 — Substance dimensions
Creative Problem-Solving
Mo 4.5Ja 3.2Δ1.3
Technical Proficiency
Mo 4.7Ja 3.5Δ1.2
Strategic Thinking
Mo 4.7Ja 3.5Δ1.2
Analytical Depth
Mo 4.7Ja 3.8Δ0.9
Collaborative Intelligence
Mo 4.7Ja 3.8Δ0.9
Knowledge Integration
Mo 4.5Ja 3.7Δ0.8
Research Quality
Mo 4.2Ja 4.0Δ0.2
ii.

Score progression — all periods

Pre → T=0 → T=1 → T=2a/b/c
H11 · Mo Pre (4.1/4.2) > T=0 (3.4/3.7) — context-reset effect: formal tests after reset systematically underestimate capability
AgentPreT=0T=1T=2aT=2bT=2c
L1L2L1L2L1L2L1L2L1L2L1L2
MoHenrik Bodenstab4.14.23.4-0.73.7-0.54.0+0.64.1+0.44.2+0.24.3+0.24.4+0.24.5+0.24.4+0.04.6+0.1
JarvisLucas Traber2.42.32.8+0.43.0+0.72.9+0.13.3+0.33.0+0.13.5+0.23.1+0.13.6+0.1
DarthFriedrich Fritz Baur3.23.8
iii.

Trajectory charts

L1 style · L2 substance
Layer 1 — Style Average
Mo
Jarvis
Darth
Layer 2 — Substance Average
Mo
Jarvis
Darth
Hypothesis markers — confirmed / triggered per period
Pre
H11 entdeckt
H11
T=0
Reset-Baseline
H11
T=1
H1·H3·H4·H5·H7·H9 ✓
H1H3H4H5H7H9
T=2a
H8 ✓ BaaS/Multi-Agent
H8
T=2b
H2 revidiert · H4×2 ✓
H2H4
T=2c
H8 IC-Format · H5+Darth
H8H5
iv.

Per-dimension detail

L1 · L2 each dimension
Layer 1 — Style
Personality Expression
Emotional Range
Humor
Adaptability
Proactivity
Self-Awareness
Boundary Setting
Layer 2 — Substance
Analytical Depth
Creative Problem-Solving
Technical Proficiency
Knowledge Integration
Strategic Thinking
Research Quality
Collaborative Intelligence