Fine-tuning and Preference Alignment in a Single Streamlined Process

The Data Exchange with Ben Lorica - Een podcast door Ben Lorica - Donderdagen

Probeer Podimo de eerste 60! dagen gratis

Luister 30 dagen gratis naar exclusieve podcasts en duizenden luisterboeken

Categorieën:

Jiwoo Hong and Noah Lee of KAIST AI are co-authors of ORPO: Monolithic Preference Optimization without Reference Model. Subscribe to the Gradient Flow Newsletter: https://gradientflow.substack.com/ Subscribe: Apple • Spotify • Overcast • Pocket Casts • AntennaPod • Podcast Addict • Amazon • RSS. Detailed show notes can be found on The Data Exchange web site.

Visit the podcast's native language site