==================================================== PRE-TRAINING REPORT ==================================================== Total epochs run : 50 Best epoch (val loss) : 48 Convergence epoch : 42 (val ≤ best+1 %) Best val loss : 0.2542 Best val perplexity : 1.29 Final train loss : 0.2371 Unique parameters : 1,384,128 Checkpoint : checkpoints/pretrained.pt Log CSV : checkpoints/pretrained.log.csv ==================================================== epoch train val ppl lr ----- -------- -------- ------- ---------- 1 2.0319 0.8082 2.24 2.20e-04 2 0.6414 0.5509 1.73 3.00e-04 3 0.5239 0.4964 1.64 2.99e-04 4 0.4857 0.4720 1.60 2.98e-04 5 0.4642 0.4475 1.56 2.96e-04 6 0.4460 0.4348 1.54 2.93e-04 7 0.4320 0.4170 1.52 2.90e-04 8 0.4177 0.4097 1.51 2.86e-04 9 0.4056 0.3969 1.49 2.82e-04 10 0.3948 0.3910 1.48 2.77e-04 11 0.3846 0.3788 1.46 2.72e-04 12 0.3762 0.3707 1.45 2.66e-04 13 0.3667 0.3642 1.44 2.60e-04 14 0.3589 0.3532 1.42 2.53e-04 15 0.3512 0.3455 1.41 2.45e-04 16 0.3445 0.3431 1.41 2.38e-04 17 0.3375 0.3367 1.40 2.30e-04 18 0.3314 0.3323 1.39 2.21e-04 19 0.3256 0.3229 1.38 2.13e-04 20 0.3185 0.3193 1.38 2.04e-04 21 0.3138 0.3150 1.37 1.95e-04 22 0.3072 0.3112 1.37 1.85e-04 23 0.3025 0.3034 1.35 1.76e-04 24 0.2971 0.3030 1.35 1.66e-04 25 0.2927 0.2928 1.34 1.57e-04 26 0.2871 0.2899 1.34 1.47e-04 27 0.2825 0.2893 1.34 1.37e-04 28 0.2783 0.2863 1.33 1.28e-04 29 0.2748 0.2824 1.33 1.18e-04 30 0.2703 0.2783 1.32 1.09e-04 31 0.2670 0.2750 1.32 9.95e-05 32 0.2631 0.2718 1.31 9.05e-05 33 0.2606 0.2691 1.31 8.17e-05 34 0.2578 0.2691 1.31 7.32e-05 35 0.2540 0.2667 1.31 6.51e-05 36 0.2518 0.2650 1.30 5.73e-05 37 0.2498 0.2630 1.30 4.98e-05 38 0.2472 0.2601 1.30 4.28e-05 39 0.2456 0.2587 1.30 3.63e-05 40 0.2432 0.2584 1.29 3.02e-05 41 0.2421 0.2572 1.29 2.46e-05 42 0.2409 0.2567 1.29 1.96e-05 43 0.2398 0.2560 1.29 1.51e-05 44 0.2387 0.2553 1.29 1.11e-05 45 0.2381 0.2550 1.29 7.75e-06 46 0.2372 0.2550 1.29 4.98e-06 47 0.2365 0.2546 1.29 2.81e-06 48 0.2363 0.2542 1.29 1.25e-06 ← 49 0.2359 0.2543 1.29 3.13e-07 50 0.2371 0.2543 1.29 0.00e+00