分享

KTO: Model Alignment as Prospect Theoretic Optimization

热度