CIS 607 Unit 5

  1. Pick any dataset relevant to your major that you would like to analyze. Avoid use the same or similar datasets to one you use for your final project.
  2. Randomly divide it into two chunks 80% and 20% of records.
  3. Select input variables (2 min) that you will use for cluster analysis. Provide reasoning for the selection.
  4. Use SPSS or other tool to apply appropriate cluster analysis method to clusters the larger part of the dataset.
  5. Identify clusters and describe their centroids and business meaning.
  6. If classes are poorly identified by the analysis or their business meaning is hard to describe. Change your variable selection and go to the step 3.
  7. For at least 5 records from the remaining smaller part of the dataset identify the closest cluster centroid. That will be a prediction which cluster those records belong too. Note that they have not been used in cluster identification, therefore this prediction will qualify as an example of predictive analytics.
  8. Submit a Word report describing each step and a result of this process, include relevant scripts and outputs produced by the tool you use.

Save Time On Research and Writing
Hire a Pro to Write You a 100% Plagiarism-Free Paper.
Get My Paper
Still stressed with your coursework?
Get quality coursework help from an expert!