Sponsored by Taylor Geospatial, the Global Fields of The World (FTW) dataset provides global-scale estimates of agricultural fields for 2024–2025. The dataset includes both model inputs (Sentinel-2–derived median composites in COG and Zarr v3 formats) and outputs (in Zarr, GeoParquet and PMTiles).
Global mosaics and agricultural field predictions for 2024 and 2025, from Taylor Geospatial.
Accompanying paper: https://aka.ms/ftw-global-paper
features (version alpha)
EPSG:4326 Zarr V3 mosaic.predictions (version alpha)
Location: s3://us-west-2.opendata.source.coop/ftw/global-data/features/cogs/alpha/.
Features are defined by selecting DOY ranges as planting/harvest heuristics and computing the median of masked pixels across ~5–10 scenes. See Appendix for the heuristics and masking details.
s2med_harvest/*:
["B02", "B03", "B04", "B08", "N_VALID_PIXELS"]N_VALID_PIXELS is the number of valid scenes after quality-flag masking.s2med_planting/*:
["B02", "B03", "B04", "B08", "N_VALID_PIXELS"]N_VALID_PIXELS is the number of valid scenes after quality-flag masking.index.parquet:
Location: s3://us-west-2.opendata.source.coop/ftw/global-data/features/zarr/alpha/global.zarr
All feature COGs are reprojected and resampled to EPSG:4326 at 8.983119e-5° (~10 m at the equator)
using GDAL cubic resampling, producing a single Zarr mosaic with dimensions (time, band, y, x).
Location: s3://us-west-2.opendata.source.coop/ftw/global-data/predictions/zarr/alpha/global.zarr
The PRUE model is run over features/zarr/alpha/global.zarr to produce a Zarr dataset with bands
[non_field_background, field, field_boundaries]. Feature and prediction Zarrs share the same grid,
so they are stackable.
We can inspect inputs and outputs side-by-side since they are stackable. This enables researchers to validate inputs to better understand their influence in model outputs.
Location: s3://us-west-2.opendata.source.coop/ftw/global-data/predictions/vectors/alpha/results/
A GeoParquet vector dataset is derived from the prediction Zarr by thresholding the softmax outputs
for [non_field_background, field, field_boundaries] at 0.5 and polygonizing.
Files follow the GeoParquet v1.1.0 spec: ~8.2B rows across 1,001 files, ~629 GB on S3, with the following schema:
Query with DuckDB and visualize with Lonboard:
Location: s3://us-west-2.opendata.source.coop/ftw/global-data/predictions/vectors/alpha/global.pmtiles
PMTiles built from the GeoParquet above for scalable browser-side visualization.
alpha)Relevant Sentinel-2 scenes are selected by acquisition DOY using the following functions:
All Sentinel-2 scenes were sourced from s3://sentinel-cogs/sentinel-s2-l2a-cogs.
Pixels with the following SCL values are masked out before taking the median:
CC-BY-4.0
isaac.corley@taylorgeospatial.org
If you use this dataset in your research please cite the following papers: