This exercise is a classic machine learning exercise popularized by Kaggle and often used as a ‘machine leaning 101’ exercise. I’m currently working through the book, ‘Machine Learning with R Cookbook’ by Yu-Wei and David Chiu and this is the first exercise in the book.

To produce this post, I output the html from the R Notebook that I used to perform the analysis.


Add a new chunk by clicking the Insert Chunk button on the toolbar or by pressing Ctrl+Alt+I.

Assesing Performance with the ROC Curve

Prepare the Probability Matrix

Create an ROCR prediction from probabilities & Create ROC Curve