1. Use this Kaggle dataset
https://www.kaggle.com/datasets/uciml/student-alcohol-consumption
2. Select a variable to be used as the Response Variable. List it here.
3. Select three continuous variables to serve as explanatory variables
4. Generate a Numerical Summaries table, Correlation table, and histogram on each interval. (Interpret Model 2 row in JASP)
5. Run the multiple regression and report the 3 regression output tables
6. Interpret R, Adjusted R square values
7. Interpret p-value from Regression table
8. Interpret p-values from Coefficient table
9. Check VIF (collinearity diagnostics) and Durbin-Watson and interpret
10.Check residuals vs predicted plot. Interpret
11.Check residuals histogram. interpret
12.Use a variable selection process (stepwise, forward, etc …) , indicate which process you used, to remove explanatory variables.
13.Show all 3 final regression output tables after the variable selection process has completed. Interpret relevant statistics within the tables