Back to list
modelllmtrainingeducationlightweight
minimind - Train a 26M GPT from Scratch in 2 Hours
Train a 26M-parameter GPT from scratch in just 2 hours with full pretraining, SFT, and RLHF pipeline.
16 views0 stars3/23/2026
Train a 26M-parameter GPT from scratch in just 2 hours with full pretraining, SFT, and RLHF pipeline.