Quick Start Browse Models Browse Benchmarks

Overview¶

PyHazards brings together public dataset catalogs, registry-based models, benchmark families, experiment configs, and shared training or reporting workflows across wildfire, earthquake, flood, and tropical cyclone tasks.

It is designed for researchers and practitioners who need one coherent library for reproducing baselines, comparing methods, and extending hazard-ML workflows without rebuilding the software stack for each hazard family.

Hazard Families

Wildfire, earthquake, flood, and tropical cyclone workflows under one library.

Public Datasets

Curated dataset pages covering forcing sources and hazard-specific benchmark adapters.

Implemented Models

Public implemented baselines and variants surfaced through the model catalog.

Benchmark Families

Shared evaluator families with linked ecosystems, smoke configs, and reports.

Start Here¶

Use one of these four paths to move from overview to action quickly.

Why PyHazards¶

Unified Datasets

Public datasets, forcing sources, and inspection surfaces are documented through one hazard-first catalog.

Benchmark-aligned Evaluation

Shared benchmark families, smoke configs, and report exports make model comparisons more reproducible.

Registry-based Models

Baselines and adapters are exposed through a consistent build surface instead of one-off scripts.

Shared Training and Inference

One engine layer supports training, evaluation, prediction, and benchmark execution across hazard tasks.

Hazard Coverage¶

PyHazards spans four hazard families with public datasets, models, and benchmark pages designed to work together.

Featured Example¶

Run a benchmark-aligned smoke configuration with one command, then move into the full Quick Start for model building and training workflows.

python scripts/run_benchmark.py --config pyhazards/configs/flood/hydrographnet_smoke.yaml

Next step: Quick Start for the first full workflow, or Models to browse benchmark-linked baselines.

Explore the Docs¶

For Contributors¶

PyHazards is registry-driven and uses dataset cards, model cards, and benchmark cards to generate the public catalogs. If you plan to extend the library, use Implementation Guide for the contributor workflow and Coverage Audit for the audited gap list behind the current roadmap work.

Citation¶

If you use PyHazards in your research, please cite:

@misc{pyhazards2025,
  title        = {PyHazards: An Open-Source Library for AI-Powered Hazard Prediction},
  author       = {Cheng et al.},
  year         = {2025},
  howpublished = {\url{https://github.com/LabRAI/PyHazards}},
  note         = {GitHub repository}
}

Community¶

Use the RAI Lab Slack channel for project discussion and coordination.