Data Mining and Business Intelligence
Data Mining and Business Intelligence
Paper details:
General Instructions:
Download all the datasets on to your machines and save them in a single folder. You can use this folder as your library when
you are building models.
Submit a Word file with snapshots of the models (min 3 per question) you have built with detailed explanations of the
process
Do not use the “Automatically Fit Model" option in SAS
Explain which model is the best fit and your rationale for model selection- Explore the RMSE. Residuals. Autocorrelation.
White Noise plots and the metrics to pick the best model
1. Open ‘GNPsas7dat’ file. This dataset contains quarterly USA Gross National Product (GNP) in billions of dollars and three
other variables that may affect it. Fit the best model that explains GNP and interpret the factors that influence it. Describe your
best model and your rationale in detail. (25 points)
2. Open ‘HouseSales.sas7bdat’. This dataset contains data for a USA town monthly new house sales from 2008 to 2016. Fit
the best model and explain your rationale in detail. (25 points)
3. Open ‘ EmployeeSales2.sas7bdat’. This data represents an employee sales records from 2002 to 2016. On August 2008
the US market crashed- As a result. his sales numbers dropped significantly. Fit the best forecast model to accurately capture all
aspects of the time series data and explain your choice in detail (25 points).
4. Open ‘ProductB.sas7dat’ file. This dataset contains monthly demand for a firm product 8. Fit the best model that explains
the demand for product 8. Describe your best model and your rationale in detail. (25 points)
The following statement must be present with your submission: “The work contained and presented here is my work and my work
alone."