If you are supposed to teach about this section of the course, select one of
the assignments, do the assignment and show it to the instructor to make sure
you have done it correctly. Prepare your slides, narrate your slides,
remove excess words from narrated slides, convert narrated slides to a file
format that can be uploaded, upload your file, email everyone in the class
the URL of your file. Make sure that all these tasks are done ahead of
scheduled class session. Your peers will appreciate receiving your advice
on how to solve a class assignment as soon as possible and well before last day
prior to class session. More
No individual assignment should be completed in teams. Submit
your work in Blackboard. Do not discuss the work with other
Question 1: Use the attached data to create a receiver
operating curve. The file contains two values, predicted
probabilities and actual true classification. (a) Generate cutoff
values as the average of two consecutive predicted values. (b)
Classify the model predictions. (c) Calculate the sensitivity and
specificity of model predictions at each cutoff level and list in order
of the cutoff values. (d) Draw
the receiver operating curve. (e) Calculate the area under the
receiver operating curve.
Work in teams of 2 persons. Not the same person with whom you have
previously handed in a team assignment.
Upon submission, indicate the name of your team member.
Each member of the team should submit a separate assignment.
No copying of code from each other but feel free to learn from each other.
The data reported by team members must be the same, the SQL code
can be different. Come to an agreement on the findings
and help each other to arrive to the same findings.
If team assignments are completed with individual effort, then the student loses 10% of the grade.
Clean the data as you or your teammate had done so in the previous weeks.
Estimate the likelihood ratios as you or your teammate had done so in
Verify that both
team members are working with same set of cleaned data and same set of
Randomly set aside 80% of data for training and 20% for validation. Use
the validation data set in the following calculations.
Use naive Bayes to predict the probability of the outcome.
Report the accuracy of predictions using the receiver operating curves.
Draw the receiver operating curve.
Report the area under the receiver operating curve.
To complete this team assignment, upload your SQL code, the receiver
operating curve, the area under the receiver operating curve into a Word document. Then, upload the document into Blackboard. Each student will upload their
document by Sunday, 11:55 PM, EST.