This repository contains Clay embedding datasets created and maintained by lgnd.ai. Licensed under CC BY 4.0.
A deduplicated time series of Clay v1.5 embeddings, selecting one representative embedding per Major TOM grid cell per month. This removes redundancy from overlapping scene coverage at MGRS tile borders, normalizes observation frequency across regions with varying revisit rates, and simplifies time series analysis by providing consistent monthly snapshots.
The data is hive-partitioned by model version, collection, chip size, embedding dimensions, geohash, year, and month:
geo and covering metadataLicensed under CC BY 4.0.
| struct |
Bounding box (xmin, ymin, xmax, ymax) for geoparquet spatial filtering |
geometry | binary (WKB) | Polygon geometry of the chip footprint in EPSG:4326 |