This homework need implement Batch Gradient Descent to fit a line into a two dimensional data set. You will implement a set of Spark jobs that will learn parameters for such line from the New York City Taxi trip reports in the Year 2013.
All the homework demand has been uploaded on file. Please only write the pyspark code for small data that I uploaded. I will run the final code on the my large data set. I only need py file for all the task.