Quality metrics

Imbalanced binary classification. For compliance, recall on the reportable class matters most — a false negative is a reportable complaint that escaped.
metric bars
Benchmark vs current performance. The control concern is the recall drop, not the high precision.

Scorecard

MetricBenchmarkCurrentΔ
accuracy0.9490.832-0.117
balanced_accuracy0.9480.857-0.091
precision0.9530.966+0.013
recall0.9570.759-0.198
f10.9550.850-0.105
f1_macro0.9480.830-0.118
roc_auc0.9460.870-0.075
pr_auc0.9320.925-0.007

Confusion matrix & error breakdown

confusion
False negatives are the bottom-left cell: truly reportable complaints predicted as not reportable.
False negatives (FN=454) are reportable complaints predicted not-reportable — the costly error here. False positives (FP=50) create review workload but are safer.