Metrics of Eval

Accuracy¶

Accuracy = \(\cfrac{TP + TF}{N}\)

Class problem where # of class 0 = 9990 and # of class 1 = 10
If everything is predicted to be class 0, accuracy is \(9990/10000 = 0.999\) => misleading!

\(c(i|j) =\) cost of classifying \(i\) as \(j\)

Cost = weighted accuracy from cost matrix

All correct positives over total positives

Precision = \(\cfrac{TP}{TP + FP}\)

All correct positives over all correct classifications

Recall = \(\cfrac{TP}{TP + FN}\)

Recall = \(\cfrac{2rp}{r + p} = \cfrac{2 \times TP}{2 \times TP + FP + FN}\)

Depends on:

⅔ train ⅓ test
k-fold cross validation (average/majority of all the k runs) used to tune hyper params, choose model, validate significance of one model
Leave one-out (LOO) cross validation
Random subsampling - k fold cross validation but instead of contiguous split, choose randomly (w/o replacement) each time