DPO: Direct Preference Optimization

3 points by Garcia98 2 years ago | 0 comments