https://medium.com/top-python-libraries/top-10-pyspark-functionalities-every-data-engineer-should-know-5abbda1bfe3c