Research Diffusion language models meet a messy benchmark tax Diffusion language models generate by denoising full sequences, but an 8 model, 8 benchmark study shows deployment depends on inference choices. Lars Cornelissen · Jun 20, 2026