hamori

tech/hamori

Fork 0

Commit Graph

Author	SHA1	Message	Date
H1K0	8a73394df9	data: update pretrained checkpoint results (BAR-free tokenizer) Re-run pre-training results with the corrected 84-token vocabulary and max_seq_len=320. Previous checkpoint was trained on stale data with BAR tokens and a corrupted tokenizer. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-20 14:28:00 +03:00
H1K0	329952b02e	data: add pre-training results from Google Colab run Includes log CSV (50 epochs), loss-curve plot, and report. Training ran on Colab GPU (T4). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-20 13:10:34 +03:00

Author

SHA1

Message

Date

H1K0

8a73394df9

data: update pretrained checkpoint results (BAR-free tokenizer)

Re-run pre-training results with the corrected 84-token vocabulary and
max_seq_len=320.  Previous checkpoint was trained on stale data with BAR
tokens and a corrupted tokenizer.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-20 14:28:00 +03:00

H1K0

329952b02e

data: add pre-training results from Google Colab run

Includes log CSV (50 epochs), loss-curve plot, and report.
Training ran on Colab GPU (T4).

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-05-20 13:10:34 +03:00

2 Commits