Model Validation

Backtest Results

Historical validation — how did the models perform in the past?

Summary Metrics
Total Backtest Years
1990 to present · 4 bands
Average ROC-AUC
Average across 4 bands
Best Band
Highest accuracy
Heidke Skill Score
Performance above random
Band Selector

Band-Level Backtest Charts

Accuracy
ROC-AUC
HSS
Null Model
Degradation
Checking...
Yearly Accuracy & ROC-AUC — M4–5
Loading data...
Confusion Matrix (Total)
TP
FP
FN
TN
Algorithm Info
Algorithm
ROC-AUC
F1 Score
Accuracy
ROC-AUC
HSS
Null Model
Degradation
Checking...
Yearly Accuracy & ROC-AUC — M5–6
Loading data...
Confusion Matrix (Total)
TP
FP
FN
TN
Algorithm Info
Algorithm
ROC-AUC
F1 Score
Accuracy
ROC-AUC
HSS
Null Model
Degradation
Checking...
Yearly Accuracy & ROC-AUC — M6–7
Loading data...
Confusion Matrix (Total)
TP
FP
FN
TN
Algorithm Info
Algorithm
ROC-AUC
F1 Score
Accuracy
ROC-AUC
HSS
Null Model
Degradation
Checking...
Yearly Accuracy & ROC-AUC — M7+
Loading data...
Confusion Matrix (Total)
TP
FP
FN
TN
Algorithm Info
Algorithm
ROC-AUC
F1 Score
Skill Score

What is the Heidke Skill Score (HSS)?

HSS measures how much better a model performs compared to random guessing. It is the standard reference metric in operational earthquake forecasting systems.

HSS = 2(ad − bc) / [(a+c)(c+d) + (a+b)(b+d)]

a = TP  |  b = FP  |  c = FN  |  d = TN  — computed from the full confusion matrix.

HSS < 0Worse than random
HSS = 0Equal to random
HSS > 0Better than random
HSS = 1Perfect
Year-by-Year Comparison

4 Bands — Yearly Accuracy

Preparing chart...
Model History

Retraining Timeline

Loading retraining history...
Scheduled Degradation Online Learning
Backtest runs automatically every Monday. The latest results will be displayed above.