NEC Lending Club Data Programming Worksheet

Nutrition Case Study

The main objective is to write a fully executed R-Markdown program performing regression prediction for the response variable using the best models found for kNN, Random Forest and XGBoost techniques predicting the response variable in the Nutrition case study. Make sure to describe the final hyperparameter settings of all algorithms that were used for comparison purposes.

You are required to clearly display and explain the models that were run for this task and their effect on the reduction of the Cost Function.

Points will be deducted in case you fail to explain the output.

Please note that all code assignments must be submitted as a screenshot with a slice of your desktop showing the timestamp.

If the time and date are not visible, you will be graded 0.

Week 9: Lending Club
We will revisit the Lending Club data for this week’s assignment. The company has existed since 2007 and
have provided millions of personal loans since then. Lending Club announced IPO in December 2014, since
when the company came in the limelight for negative publicity. Lending club officials were accused of
taking aggressive risks by lending money to those with risky credit worthiness. You are asked to study this
phenomenon and determine if data provides clues of the authenticity of the claim that Lending Club behaved
irresponsibly.
You are given a single combined file of “approved” loans data from six years, which are supposedly the pre and
post periods of the controversy.
Step 1 (30 Points)
The first step is create two new columns as follows:
a) Comb_Risk_One: Create a binary column by combining categories A and B (Low Risk) into one
category and all the remaining categories in another (High Risk).
b) Comb_Risk_Two: Create a binary column by combining categories A, B and C (Low Risk) into one
category and all the remaining categories in another (High Risk).
Now, break the file into two files filtering out data for 2012, 13, and 14 in one file and 2015, 16 and 17 in
another file.
Step 2 (70 Points)
The primary objective is to use classification techniques learnt so far. Each loan is graded (A to G) based on the
risk, with A being least risky and G being the highest risk category. You are asked to predict Low and High-risk
categories (for the two new response variables) using various modeling techniques like Naïve Bayes’, KNN,
Logistic Regression, and CART model. Make sure to look for the following:
Instructor: Prashant Mittal.
a. Outliers based on the independent columns (predictors)
b. Multicollinearity
c. Scaling and standardization of the predictors
d. Train-Test split for both files and compare the confusion matrices on the Test.
Produce a “well documented and explained” R Markdown knit file analyzing the data with findings on the
model with the highest classification ability. Also describe the features of the categories that are not classified
correctly. Create a confusion matrix to answer the last question and run descriptive statistics on the
misclassified categories. Provide any necessary EDA and visuals to enhance understanding of your analysis.
Instructor: Prashant Mittal.

Calculate your order
275 words
Total price: $0.00

Top-quality papers guaranteed

54

100% original papers

We sell only unique pieces of writing completed according to your demands.

54

Confidential service

We use security encryption to keep your personal data protected.

54

Money-back guarantee

We can give your money back if something goes wrong with your order.

Enjoy the free features we offer to everyone

  1. Title page

    Get a free title page formatted according to the specifics of your particular style.

  2. Custom formatting

    Request us to use APA, MLA, Harvard, Chicago, or any other style for your essay.

  3. Bibliography page

    Don’t pay extra for a list of references that perfectly fits your academic needs.

  4. 24/7 support assistance

    Ask us a question anytime you need to—we don’t charge extra for supporting you!

Calculate how much your essay costs

Type of paper
Academic level
Deadline
550 words

How to place an order

  • Choose the number of pages, your academic level, and deadline
  • Push the orange button
  • Give instructions for your paper
  • Pay with PayPal or a credit card
  • Track the progress of your order
  • Approve and enjoy your custom paper

Ask experts to write you a cheap essay of excellent quality

Place an order