LLaDA 1.5: Variance-Reduced Preference Optimization for Diffusion LLMs

3 points by gok 3 days ago | 0 comments