Uncategorized

Clustering

Select a dataset for clustering. The specific requirements for the assignment are as follows:

• Choose a dataset that is of interest to you and is well suited for clustering
• Give a brief description of the dataset
• Test at least 2 clustering algorithms using the software of your choice.
• Use a cluster quality metric to assess the clustering quality of each algorithm.
• Use the elbow method to determine the number of clusters
• Compare the results from each algorithm.
• Write a report that describes your experiment and results. The report should include an introduction to the data mining question, a description of the dataset (include some EDA for a high score), a description of the classifiers, and the results of the experiment as well as any interesting conclusions that can be drawn from your analysis.
• There are many resources available on clustering in R or Python including the following:
o http://michael.hahsler.net/SMU/EMIS7332/R/chap8.html
o http://www.rdatamining.com/
o http://en.wikibooks.org/wiki/Data_Mining_Algorithms_In_R/Clustering
o https://www.ibm.com/developerworks/library/os-weka2/
o http://www.cs.ccsu.edu/~markov/ccsu_courses/datamining-ex3.html

 

ORDER THIS ESSAY HERE NOW AND GET A DISCOUNT !!!