https://medium.com/@monocosmo77/how-policy-optimization-works-part2-artificial-intelligence-159c82bb6a60