Cloud 3D - Tropical Cyclones Dataset
Tropical cyclone subset from the Global 3D Cloud Reconstruction Dataset. Contains colocated pairs of geostationary imagery (GOES-16 ABI and Himawari-8/9 AHI) with CloudSat radar profiles for tropical cyclones identified via the IBTrACS (International Best Track Archive for Climate Stewardship) database. Each sample includes: multispectral geostationary imagery (16 spectral channels + satellite/solar angles), CloudSat vertical profiles as ground truth, colocation mask, and cyclone metadata (name, category, basin, distance to center). GOES covers Atlantic and Eastern Pacific basins; Himawari covers Western Pacific (highest TC frequency globally). 256x256 pixel patches in Cloud-Optimized GeoTIFF format.
Version: 0.1.0
License: CC-BY-4.0
Keywords: tropical cyclones, hurricanes, typhoons, 3d reconstruction, cloud microphysics, geostationary satellites, GOES-16, Himawari-8, Himawari-9, CloudSat, IBTrACS, best track, remote sensing, deep learning, vertical structure, eyewall, rainbands
Tasks: regression, foundation-model
Dataset Overview
Partitions: 68 files
Spatial coverage: [-168.49, -26.43, 174.86, 40.60] (WGS84)
Temporal coverage: 2015-07-17 to 2020-08-25
Root: FOLDER (482 samples)
Hierarchy:
- Level 1: FILE → FILE (964 samples)
LEVEL0
LEVEL1
Usage
Python
R
Julia
Data Providers
NOAA — producer
https://www.noaa.gov
JMA — producer
https://www.jma.go.jp
NOAA NCEI — producer
https://www.ncei.noaa.gov/products/international-best-track-archive
Colorado State University — producer
https://www.cloudsat.cira.colostate.edu
European Space Agency (ESA) — licensor
https://www.esa.int
source.coop — host
https://source.coop
Dataset Curators
Publications & Citations
If you use this dataset in your research, please cite:
DOI: 10.48550/arXiv.2511.04773
Ermis, S., Aybar, C., Freischem, L., Girtsou, S., Bintsi, K.-M., Diaz Salas-Porras, E., Eisinger, M., Jones, W., Jungbluth, A., & Tremblay, B. (2025). Global 3D Reconstruction of Clouds & Tropical Cyclones. Tackling Climate Change with Machine Learning Workshop at NeurIPS 2025.
Primary publication describing the dataset and methodology
BibTeX
Generated with ❤️ using TacoToolbox v0.23.3