Files
hamori/checkpoints/pretrained.log.csv
H1K0 8a73394df9 data: update pretrained checkpoint results (BAR-free tokenizer)
Re-run pre-training results with the corrected 84-token vocabulary and
max_seq_len=320.  Previous checkpoint was trained on stale data with BAR
tokens and a corrupted tokenizer.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-05-20 14:28:00 +03:00

2.2 KiB

1epochtrain_lossval_lossval_ppllrelapsed_s
212.0431050.8603802.362.205000e-0413.1
320.6824360.5872711.802.998721e-0411.9
430.5679410.5448751.722.991598e-0412.0
540.5294460.5129121.672.978255e-0412.5
650.5054090.4908171.632.958747e-0412.4
760.4848910.4717181.602.933156e-0412.5
870.4671220.4569031.582.901587e-0412.7
980.4502300.4428131.562.864174e-0412.9
1090.4358960.4284901.532.821072e-0413.1
11100.4256300.4200621.522.772460e-0413.1
12110.4148100.4111511.512.718542e-0412.9
13120.4054920.4096871.512.659542e-0412.9
14130.3968820.3919231.482.595706e-0412.9
15140.3876160.3872741.472.527301e-0412.8
16150.3791350.3851161.472.454612e-0412.9
17160.3717480.3745181.452.377941e-0413.0
18170.3644970.3672601.442.297610e-0412.9
19180.3574270.3645241.442.213952e-0412.9
20190.3503120.3585401.432.127316e-0412.9
21200.3429510.3498011.422.038065e-0412.9
22210.3376510.3437821.411.946569e-0412.8
23220.3308090.3370081.401.853211e-0412.8
24230.3247710.3323361.391.758381e-0412.8
25240.3193910.3249071.381.662472e-0412.8
26250.3140730.3215011.381.565886e-0412.9
27260.3098130.3177181.371.469026e-0412.8
28270.3042610.3134381.371.372294e-0412.9
29280.2999980.3107631.361.276095e-0412.9
30290.2950390.3072411.361.180830e-0412.9
31300.2901080.3034461.351.086896e-0412.8
32310.2880200.3020411.359.946846e-0512.8
33320.2835070.2993171.359.045806e-0512.8
34330.2805220.2948161.348.169597e-0512.8
35340.2758770.2919191.347.321873e-0512.9
36350.2736870.2888191.336.506170e-0512.8
37360.2705660.2878311.335.725888e-0513.0
38370.2678930.2865151.334.984283e-0513.0
39380.2659960.2847561.334.284447e-0513.0
40390.2645270.2836631.333.629298e-0513.0
41400.2622610.2827171.333.021569e-0512.9
42410.2608120.2821751.332.463794e-0512.8
43420.2588720.2807041.321.958300e-0512.8
44430.2578640.2802041.321.507193e-0512.8
45440.2567700.2793581.321.112356e-0512.8
46450.2549420.2792631.327.754357e-0613.0
47460.2555600.2788731.324.978363e-0612.8
48470.2550110.2786501.322.807158e-0612.9
49480.2543040.2785831.321.249797e-0612.8
50490.2524420.2784811.323.127754e-0712.8
51500.2538670.2784941.320.000000e+0012.8