Steps for Completion:
- Use the LoanStats dataset and make a subset using the following variables:
dfn <- df3[,c("home_ownership","loan_amnt","grade")]
- Clean the dataset (removing the NONE and NA cases), using the following code:
dfn <- na.omit(dfn)dfn <- subset (dfn, !dfn$home_ownership %in% c("NONE"))
- Create a boxplot showing the loan amount versus home ownership.
- Color differentiate by credit grade.
Outcome:
Refer to the following URL for the output: https://goo.gl/RheL2G.
The answers to question 5 are as follows:
- Credit grades F and G are the highest. Credit grades A and B are the lowest.
- They are higher for a person who has a mortgage.
- The median value for A is 2,000, and the median ...