https://towardsdatascience.com/a-project-driven-approach-to-learning-pyspark-4533c85f52b3