Q.1: Use Spark MLlib or Sklearn to complete this question.1. [2 pts] Find a dataset in kaggle or any other source.
2. [2 pts] Write a detailed description of the dataset.
3. [6 pts] Preprocess the dataset.
4. [2 pts] Divide the dataset into training and testing.
5. [12 pts] Build a regression model using Linear regression.
6. [4 pts] Test the model and compute its accuracy.
Q.2: Use Spark MLlib or Sklearn to complete this question.
7. [2 pts] Find a dataset in kaggle (other than Q1) or any other source.
8. [2 pts] Write a detailed description of the dataset.
9. [6 pts] Preprocess the dataset.
10. [2 pts] Divide the dataset into training and testing.
11. [12 pts] Build a classification model using KNN.
12. [4 pts] Test the model and compute its accuracy.
Deliverable:
•
One pdf file which contains:
o [1 pt] A cover page which contains your names, Title, and date of submission.
o The solution of each of the above questions.
code
output