A unified multi-modal Earth Observation pre-training dataset combining Sentinel-2, Landsat 8/9, Copernicus DEM, and ESA WorldCover on a global 10 km grid. 250,000 tiles, TACO v3 format.
No preview available for file type .ipynb. Open an issue if you would like support for this file type.
MajorTOM-Core is the first unified multi-modal Earth Observation dataset built on the MajorTOM global 10 km grid. It combines four modalities into a single collection where every tile contains co-registered optical, thermal, elevation, and land cover data.
This is version 1 of MajorTOM-Core. The dataset is designed to grow incrementally — we start with 250,000 tiles and plan to scale to full planetary coverage while adding new modalities over time. The dataset follows the TACO v3 specification, a format for organizing AI-ready Earth Observation datasets.
Pick a tile index and visualize all four modalities:
A complete notebook with metadata queries, filtering, and a streaming PyTorch DataLoader with parallel fetching is available here:
CC-BY-SA-4.0
MajorTOM-Core has been made possible thanks to Asterisk Labs, the ELLIOT project (European Commission, Horizon Europe, Grant 101214398), and the Image and Signal Processing Group (ISP) at Universitat de València.