https://medium.com/@monocosmo77/how-direct-preference-optimization-works-part6-machine-learning-2024-de758dcd3ddb