https://medium.com/@monocosmo77/how-direct-preference-optimization-works-part7-machine-learning-2024-251a6c6feeb8