Global Fields of The World (FTW)

Name: Global Fields of The World (FTW)
Creator: Fields of The World

Sponsored by Taylor Geospatial, the Global Fields of The World (FTW) dataset provides global-scale estimates of agricultural fields for 2024–2025. The dataset includes both model inputs (Sentinel-2–derived median composites in COG and Zarr v3 formats) and outputs (in Zarr, GeoParquet and PMTiles).

Product Details

Visibility: Public
Owner: Fields of The World
Created: 4 Feb 2026
Last Updated: 23 Apr 2026

Product Contents

root

predictions

vectors

README

FTW Global — Field Boundary Predictions (GeoParquet)

Global agricultural field-boundary polygons predicted by the PRUE model (ftw-baselines), published as cloud-native GeoParquet partitioned by country with per-country PMTiles for web visualization. Part of Fields of the World.

What's here

GeoParquet, partitioned by country under alpha/results-by-admin-conf/admin:country_code=<CC>/. Query the whole dataset with the Portolan glob asset: alpha/results-by-admin-conf/admin:country_code=*/*.parquet.
One STAC item per parquet partition (id = the parquet filename). Large countries are split by admin subdivision and grouped under a per-country sub-catalog.
Per-country PMTiles alongside each parquet (<name>.pmtiles).
Two collection-level PMTiles for the global web view:
- ftw-global-fields-2025.pmtiles — default display (2025).
- 2024_with_confidence.pmtiles — 2024.
MapLibre styles in styles/ that color fields by confidence (red→green, 0–100) with the recommended >= 69 reliability filter and a legend.

How this was made — the model & training data

These boundaries are predictions from the PRUE model (Muhawenayo et al. 2026, PRUE: A Practical Recipe for Field Boundary Segmentation at Scale) — a U-Net semantic-segmentation model with composite loss functions and targeted data augmentations. In a benchmark of 18 models it reached 76% IoU and 47% object-F1 on the FTW benchmark (a +6% / +9% gain over the prior baseline), outperforming instance-segmentation and geospatial-foundation-model approaches, and is more robust to changes in illumination, spatial scale, and geographic location.

PRUE was trained and evaluated on the Fields of the World (FTW) benchmark — get the data on Source Cooperative: https://source.coop/kerner-lab/fields-of-the-world (Kerner et al. 2025, AAAI, arXiv:2409.16252). The benchmark is 70,462 samples across 24 countries on four continents (Europe, Africa, Asia, South America), pairing multi-date multispectral Sentinel-2 imagery with instance- and semantic-segmentation field masks; models pre-trained on FTW generalize better (zero-shot and fine-tuned) to held-out countries.

To build this global layer, PRUE is run over global Sentinel-2 planting/harvest median composites (the features collection / prediction Zarr), and the softmax outputs for [non_field_background, field, field_boundaries] are thresholded at 0.5 and polygonized into these vectors; the confidence raster summarizes per-cell reliability. The broader FTW ecosystem and downstream recipes (crop-type mapping, forest-loss attribution) are in the FTW field guide (Corley et al. 2026), which reports median predicted field sizes of 0.06 ha (Rwanda) to 0.28 ha (Switzerland) across five countries / 4.76M km².

The `confidence` column

Each polygon carries a confidence value on a 0–100 scale. It is derived, not part of the raw model vector output:

Sampled at each field's point-on-surface from the 500 m PRUE confidence COG (the predictions/confidence collection).
Rescaled raw / 0.578178 * 100 and clamped to 100 — i.e. 0.578178 is treated as 100%, matching the FTW inference app legend (raw 0.404→70, 0.463→80, 0.521→90, 0.578→100). Cells with no data become null.
This centroid sample is the closest single value to the app's confidence_mean (for the common case of fields smaller than a 500 m cell they are identical).
Recommended default filter: confidence >= 69 (raw 0.4). Confidence reflects cell-level model reliability, not individual-polygon geometric accuracy.

It is computed by the scripts in this repo: add_confidence.py, process_partition.sh, run_rails.sh, make_pmtiles.py.

Schema

Field definitions use the wording of the fiboa core specification and the vecorel geometry-metrics and administrative-division extensions (also machine-readable in the collection's table:columns):

Field	Description
`id`	An identifier for the field. Must be unique per collection.
`geometry`	A geometry that reflects the footprint of the field, usually a Polygon. Default CRS is WGS84.
`bbox`	The bounding box of the field.
`metrics:area`	Area of the field, in square meters (m²). Must be > 0.
`metrics:perimeter`	Perimeter of the field, in meters (m). Must be > 0.
`determination:datetime`	The last timestamp at which the field did exist and was observed, in the UTC timezone.
`determination:method`	The boundary creation method (one of: manual, surveyed, driven, auto-operation, auto-imagery, unknown).
`admin:country_code`	ISO 3166-1 alpha-2 country code (aka admin0). Two-letter country code for the country that contains the field.
`admin:subdivision_code`	ISO 3166-2 code for the principal subdivision (e.g. province or state, aka admin1) of a country that contains the field. Only the subdivision part of the code is stored.
`confidence`

The per-country PMTiles split fields into two layers by prediction year — 2024 and 2025 — derived from determination:datetime.

Using the data

License

CC-BY-4.0. Produced by the Taylor Geospatial Institute and the Microsoft AI for Good Research Lab.

README

FTW Global — Field Boundary Predictions (GeoParquet)

What's here

GeoParquet, partitioned by country under alpha/results-by-admin-conf/admin:country_code=<CC>/. Query the whole dataset with the Portolan glob asset: alpha/results-by-admin-conf/admin:country_code=*/*.parquet.
One STAC item per parquet partition (id = the parquet filename). Large countries are split by admin subdivision and grouped under a per-country sub-catalog.
Per-country PMTiles alongside each parquet (<name>.pmtiles).
Two collection-level PMTiles for the global web view:
- ftw-global-fields-2025.pmtiles — default display (2025).
- 2024_with_confidence.pmtiles — 2024.
MapLibre styles in styles/ that color fields by confidence (red→green, 0–100) with the recommended >= 69 reliability filter and a legend.

How this was made — the model & training data

The `confidence` column

Each polygon carries a confidence value on a 0–100 scale. It is derived, not part of the raw model vector output:

Sampled at each field's point-on-surface from the 500 m PRUE confidence COG (the predictions/confidence collection).
Rescaled raw / 0.578178 * 100 and clamped to 100 — i.e. 0.578178 is treated as 100%, matching the FTW inference app legend (raw 0.404→70, 0.463→80, 0.521→90, 0.578→100). Cells with no data become null.
This centroid sample is the closest single value to the app's confidence_mean (for the common case of fields smaller than a 500 m cell they are identical).
Recommended default filter: confidence >= 69 (raw 0.4). Confidence reflects cell-level model reliability, not individual-polygon geometric accuracy.

It is computed by the scripts in this repo: add_confidence.py, process_partition.sh, run_rails.sh, make_pmtiles.py.

Schema

Field definitions use the wording of the fiboa core specification and the vecorel geometry-metrics and administrative-division extensions (also machine-readable in the collection's table:columns):

Field	Description
`id`	An identifier for the field. Must be unique per collection.
`geometry`	A geometry that reflects the footprint of the field, usually a Polygon. Default CRS is WGS84.
`bbox`	The bounding box of the field.
`metrics:area`	Area of the field, in square meters (m²). Must be > 0.
`metrics:perimeter`	Perimeter of the field, in meters (m). Must be > 0.
`determination:datetime`	The last timestamp at which the field did exist and was observed, in the UTC timezone.
`determination:method`	The boundary creation method (one of: manual, surveyed, driven, auto-operation, auto-imagery, unknown).
`admin:country_code`	ISO 3166-1 alpha-2 country code (aka admin0). Two-letter country code for the country that contains the field.
`admin:subdivision_code`	ISO 3166-2 code for the principal subdivision (e.g. province or state, aka admin1) of a country that contains the field. Only the subdivision part of the code is stored.
`confidence`

The per-country PMTiles split fields into two layers by prediction year — 2024 and 2025 — derived from determination:datetime.

Using the data

License

CC-BY-4.0. Produced by the Taylor Geospatial Institute and the Microsoft AI for Good Research Lab.

Global Fields of The World (FTW)

Global Fields of The World (FTW)

FTW Global — Field Boundary Predictions (GeoParquet)

What's here

How this was made — the model & training data

The confidence column

Schema

Using the data

License

FTW Global — Field Boundary Predictions (GeoParquet)

What's here

How this was made — the model & training data

The confidence column

Schema

Using the data

License

The `confidence` column

The `confidence` column