Global Fields of The World (FTW) provides global-scale estimates of agricultural fields for 2024–2025. The dataset includes both model inputs (Sentinel-2–derived median composites in COG and Zarr v3 formats) and outputs (in Zarr, GeoParquet and PMTiles).
Taylor Geospatial Engine Labs presents mosaics and global predictions of agricultural fields for the years 2024 and 2025. These mosaics and field boundary predictions were built with Wherobots RasterFlow.
features
alphaEPSG:4326 in a single Zarr V3 format as a single
dataset.predictions:
alphaLocation: s3://us-west-2.opendata.source.coop/ftw/global-data/features/cogs/alpha/.
Features are defined by selecting DOY ranges as planting/harvest heuristics and computing the median of masked pixels across ~5–10 scenes. See Appendix for additional information.
s2med_harvest/*:
["B02", "B03", "B04", "B08", "N_VALID_PIXELS"]N_VALID_PIXELS is the number of valid scenes after quality-flag masking.s2med_planting/*:
["B02", "B03", "B04", "B08", "N_VALID_PIXELS"]N_VALID_PIXELS is the number of valid scenes after quality-flag masking.index.parquet:
Location: s3://data.source.coop/ftw/global-data/features/zarr/alpha/global.zarr
We reproject and resample all feature COGs to EPSG:4326 at 8.983119e-5° (~10 m at the equator)
using GDAL cubic resampling, producing a single Zarr mosaic with dimensions (time, band, y, x).
Location: s3://data.source.coop/ftw/global-data/predictions/zarr/alpha/global.zarr
We run the PRUE model over features/zarr/alpha/global.zarr to generate a stackable Zarr dataset
with bands [non_field_background, field, field_boundaries]. Note that the feature and
predictions zarr datasets share the same grid and are "stackable".
We can inspect inputs and outputs side-by-side since they are stackable. This enables researchers to validate inputs to better understand their influence in model outputs.
Location: s3://us-west-2.opendata.source.coop/ftw/global-data/predictions/vectors/alpha/results/
A geoparquet vector dataset has been created from the prediction zarr dataset by simply thresholding
the softmax outputs for the three labels [non_field_background, field, field_boundaries] by 0.5.
The files were created using the v1.1.0 spec. There are over 8217195679 rows across 1001 files totalling about 629Gb on S3 with the following schema:
Get started by using DuckDB and Lonboard for visualization.
Location: s3://us-west-2.opendata.source.coop/ftw/global-data/predictions/vectors/alpha/global.pmtiles
PMTiles are created from the vector data above enabling scalable visualization of the vectorized results.
See this blog post for more information on the platform used to generate these results. And see related data products such as fields of the world results for a few countries.
alpha)We determine relevant Sentinel-2 scenes by acquisition dates according to the following functions:
All Sentinel-2 scenes were sourced from s3://sentinel-cogs/sentinel-s2-l2a-cogs.
We use the following quality flags to mark Sentinel-2 bands as invalid as provided in the SCL band:
Email: len@wherobots.com