https://medium.com/swlh/data-processing-stack-overflow-data-using-apache-spark-on-aws-emr-3e889784ba70