canbingol/gemma3_1B_base-tr-cpt-only_4th_stage_data Text Generation • 1.0B • Updated 15 days ago • 674
canbingol/gemma3_1B_base-tr-cpt-only_3rd_stage_data Text Generation • 1.0B • Updated 16 days ago • 392
canbingol/gemma3_1B_base-tr-cpt-only_2nd_stage_data Text Generation • 1.0B • Updated 18 days ago • 1.07k
canbingol/exp6_flash_attn_1epoch_lr1e4_500k_vngr_corpus_10layers Text Generation • Updated Feb 23 • 9
canbingol/exp8_sdpa_1epoch_lr1e4_500k_vngr_corpus_1-epoch_kaggle Text Generation • Updated Feb 11 • 2
canbingol/exp7_sdpa_1epoch_lr1e4_500k_vngr_corpus_2-epoch-kaggle Text Generation • Updated Feb 11 • 2