Benchmarks¶
Explore shared benchmark families, aligned external ecosystems, supported tasks, and model compatibility across PyHazards.
At a Glance¶
4
Shared evaluator families available through the benchmark runner.
12
External benchmark or data ecosystems linked from the public docs.
7
Hazard tasks covered across the family-level benchmark contracts.
27
Unique smoke configs referenced by the benchmark family cards.
Benchmark Families¶
These four cards summarize the benchmark families exposed through the shared runner and compress the core tasks, metrics, support level, and coverage counts into a scan-friendly catalog.
Shared PyHazards evaluator family for wildfire danger and wildfire spread experiments.
Wildfire Danger Spread Synthetic-backed
View Details: Wildfire Benchmark
Shared PyHazards evaluator family for earthquake phase-picking and wavefield-forecasting runs.
Earthquake Phase Picking Wavefield Forecasting Synthetic-backed
View Details: Earthquake Benchmark
Shared PyHazards evaluator family for streamflow forecasting and inundation prediction.
Flood Streamflow Inundation Synthetic-backed
View Details: Flood Benchmark
Shared PyHazards evaluator family for tropical cyclone and hurricane track-intensity forecasting.
Tropical Cyclone Track + Intensity Synthetic-backed
View Details: Tropical Cyclone Benchmark
Coverage Matrix¶
Use the matrix below for side-by-side comparison of hazard coverage, family-level tasks, primary metrics, linked-model counts, and support status without opening the detail pages first.
Hazard |
Benchmark Family |
Tasks |
Primary Metrics |
Linked Models |
Support Status |
|---|---|---|---|---|---|
Wildfire |
Danger, Spread |
Accuracy, Macro F1, AUC, PR-AUC, +5 more |
8 models |
Synthetic-backed |
|
Earthquake |
Phase Picking, Wavefield Forecasting |
P-pick MAE, S-pick MAE, Precision, Recall, +3 more |
5 models |
Synthetic-backed |
|
Flood |
Streamflow, Inundation |
MAE, RMSE, NSE, KGE, +3 more |
6 models |
Synthetic-backed |
|
Tropical Cyclone |
Track + Intensity |
Track Error, Intensity MAE |
8 models |
Synthetic-backed |
Benchmark Ecosystems¶
Browse the aligned benchmark ecosystems by hazard family. Each card links to a detail page with the routed benchmark family, source links, and the models currently mapped to that ecosystem.
Ecosystem cards describe the external benchmark or data protocol surfaced on this page and show how it maps back to the shared PyHazards benchmark family.
Temporal wildfire spread benchmark coverage for the shared wildfire spread evaluator.
Wildfire Spread Synthetic-backed
View Details: WildfireSpreadTS
Ecosystem cards describe the external benchmark or data protocol surfaced on this page and show how it maps back to the shared PyHazards benchmark family.
pick-benchmark-compatible waveform picking support routed through the shared earthquake evaluator.
Earthquake Phase Picking Synthetic-backed
View Details: pick-benchmark
Paper: pick-benchmark
SeisBench-shaped waveform picking support for the shared earthquake benchmark family.
Earthquake Phase Picking Synthetic-backed
View Details: SeisBench
Ecosystem cards describe the external benchmark or data protocol surfaced on this page and show how it maps back to the shared PyHazards benchmark family.
Caravan-style streamflow benchmark coverage for the shared flood streamflow evaluator.
Flood Streamflow Synthetic-backed
View Details: Caravan
FloodCastBench-style inundation benchmark coverage for the shared flood inundation evaluator.
Flood Inundation Synthetic-backed
View Details: FloodCastBench
Paper: FloodCastBench
HydroBench-style streamflow diagnostics coverage for the shared flood streamflow evaluator.
Flood Streamflow Synthetic-backed
View Details: HydroBench
Paper: HydroBench
WaterBench-style streamflow benchmark coverage for the shared flood evaluator.
Flood Streamflow Synthetic-backed
View Details: WaterBench
Ecosystem cards describe the external benchmark or data protocol surfaced on this page and show how it maps back to the shared PyHazards benchmark family.
TCBench Alpha-style storm benchmark coverage for the shared tropical cyclone evaluator.
Tropical Cyclone Track + Intensity Synthetic-backed
View Details: TCBench Alpha
Paper: TCBench Alpha
TropiCycloneNet-Dataset-backed storm benchmark coverage for the shared tropical cyclone evaluator.
Tropical Cyclone Track + Intensity Synthetic-backed
View Details: TropiCycloneNet-Dataset
Paper: TropiCycloneNet-Dataset
Programmatic Use¶
from pyhazards.configs import load_experiment_config
from pyhazards.engine import BenchmarkRunner
config = load_experiment_config("pyhazards/configs/earthquake/phasenet_smoke.yaml")
summary = BenchmarkRunner().run(config)
print(summary.metrics)
Use python scripts/run_benchmark.py --help for the CLI entry point,
then pair this page with Configs for experiment YAMLs
and Reports for comparable benchmark exports.