[NeurIPS 2025] Preference Optimization by Estimating the Ratio of the Data Distribution (BPO)
Official PyTorch implementation of BPO
Yeongmin Kim, Heesun Bae, Byeonghu Na, Il-Chul Moon
| Paper |
Code will be released soon.