Skip to content

hurricane

Back to geoai datasets

Field Value
Description Data Format Description for Hurricane Evaluation on Prithvi WxC

Overview
To evaluate the performance of Prithvi WxC on hurricanes, the surface and pressure data from the MERRA-2 dataset, comprising 160 variables used in training, is required. The complete evaluation dataset includes 75 different initial conditions for hurricanes that formed in the Atlantic Ocean between 2017 and 2023.

The scientific objective is to assess the zero-shot performance of Prithvi WxC in predicting the track and intensity of hurricanes. This dataset includes surface and pressure files for Hurricane Ida and can be utilized to predict a 72 hour forecast for Hurricane Ida (2021), initialized on 2021-08-27 at 00:00 UTC.

Dataset Description
The dataset includes variables at model-native levels corresponding to nominal pressure surfaces, which are 985 hPa, 970 hPa, 925 hPa, 850 hPa, 700 hPa, 600 hPa, 525 hPa, 412 hPa, 288 hPa, 245 hPa, 208 hPa, 150 hPa, 109 hPa, and 48 hPa, with data available at 3-hour intervals. Variables at these levels include:
Wind components (U, V)
Vertical wind (ω)
Air temperature (T)
Specific humidity (QV)
Actual mid-level pressure (PL)
Mid-layer geopotential height (H)
Cloud fraction (CLOUD)
Cloud mass fraction (ice: QI, water: QL)

Additional single-level variables are available at 1-hour intervals, including:
Near-surface wind components (U10, V10)
Near-surface (2 meter) air temperature (T2M)
Skin temperature (TS)
Surface roughness (Z0M)
Specific humidity (QV2M)
Surface pressure (PS)
Sea level pressure (SLP)
Column-total ice, liquid water, and water vapor (TQI, TQL, TQV)
Longwave radiation emitted by the surface (LWGEM)
Longwave radiation absorbed by the surface (LWGAB)
Upward longwave radiation at the top of the atmosphere (LWTUP)
Net downward shortwave radiation at the surface (SWGNT)
Net shortwave radiation at the top of the atmosphere (SWTNT)

Static Variables
Static variables include:
Surface geopotential height (PHIS)
Land fraction (FRLAND)
Ocean fraction (FROCEAN)
Ice fraction (FRACI)
These variables provide essential static information and are spatially varying but remain constant over time.

Time-Averaged Variables
Time-averaged variables such as root zone soil wetness (GWETROOT), leaf area index (LAI), and surface fluxes (EFLUX, HFLUX) are aggregated from 1-hour intervals, as they are diagnostic variables not available at the analysis time. For aggregation, the means of adjacent hourly values are used to create data for 12:00 UTC (e.g., the mean of 11:30 and 12:30 values is computed to derive 12:00 UTC data).

Missing values (NaNs) in GWETROOT and LAI are replaced with 1 and 0, respectively, to maintain data continuity over ocean regions.
Folder /datasets/geoai/ibm-nasa-geospatial/hurricane
Discipline GeoAI / Atmospheric Science / Climate Science
DOI
Link Access Data
Public True
Publication Date 2024-09-24
Downloaded 2025-09-10
Data Type NetCDF
Dataset Size 2.9G
Number of Files 70
Usage
$ module avail
$ module load datasets
$ module load geoai/ibm-nasa-geospatial/hurricane/2024-09-24
Usage Policy Link https://choosealicense.com/licenses/mit/
Usage Policy This dataset is released under the MIT License
.
The MIT License permits use, modification, and distribution of the dataset for research and commercial purposes, provided that appropriate credit is given to the original authors. Users are encouraged to acknowledge the IBM–NASA Geospatial team and include a citation to this dataset in any derived work, publication, or model evaluation using Prithvi WxC hurricane data.
Citation IBM NASA Geospatial. (2024). Hurricane Dataset (v1.0) [Dataset]. Hugging Face. https://huggingface.co/datasets/ibm-nasa-geospatial/hurricane
BibTeX
📜 View BibTeX citation
@dataset{ibm_nasa_hurricane_2024,
title = {Hurricane Dataset (v1.0)},
author = {IBM NASA Geospatial},
year = {2024},
howpublished = {\url{https://huggingface.co/datasets/ibm-nasa-geospatial/hurricane}},
note = {Available on Hugging Face Datasets}
}