d9585ec008
val loss 1.19 → 0.77, val perplexity 3.29 → 2.15. Best epoch 20, early stop at epoch 30 (patience=10). Improvement over previous lr=1e-5 run (best val ppl 2.22). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>