What are the characteristics of data?
Compare the difference in each of the following clustering types: prototype-based, density-based, graph-based.
What is a scalable clustering algorithm?
How do you choose the right algorithm?
What are the characteristics of anomaly detection?
What are the detection problems and methods?
What are the statistical approaches when there is an anomaly found?
Compare and contrast proximity and clustering based approaches