Post
1440
I was curious about the Block Diffusion hybrid model and tried retraining it on a DNA tokenizer + dataset 🧬. Too early to evaluate, but it generates sequences (AAATGG TTATTG CAAATC...) and was improving on the validation set during training
Model: monsoon-nlp/dna-blockdiff-papaya
Original paper: Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models (2503.09573)
Model: monsoon-nlp/dna-blockdiff-papaya
Original paper: Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models (2503.09573)