Statistical Data Mining
Statistical Data Mining
Paper details:
This project is going to be a statistical analysis of http://archive.ics.uci.edu/ml/datasets
default+of+credit+card+clients Defaulted credit card clients
This paper should include and analysis write up of the data AND use of R Programming (a Statistical coding
language)
Paper sections MUST INCLUDE 1. Introduction Introduction: give some background and context about the
domain of application. provide the rationale for the type of analysis. and state the objective clearly.
THE ANALYSIS IS ABOUT THE DATA - Analysis: describe the data both qualitatively and quantitatively
through exploratory analysis. perform necessary preprocessing activities. give some intuition about the
algorithm and core parameters, demonstrate the model building steps along with parameter tuning, and
explain all your assumptions.
The RESULTS - - Result: explain the result and interpret the model output using terms that reflect the
application area, perform model evaluation using the appropriate metrics, and leverage visualization.
Conclusion: summarize your main findings, discuss experimental limitations related to the data and/or
implementation of the algorithm, and suggest improvement areas as a potentiation future work.
Pictures and Graphs go in the APPENDIX not in the paper. Pictures are produced automatically.
Watch Video https://www.youtube.com/watch?v=qWKx7SMj1CE&feature=youtu.be