This repository contains the GlobalBuildingAtlas (GBA) Level of Detail 1 (LoD1) building footprints dataset, converted from the original 922 GeoJSON files into Parquet format for improved performance and accessibility.
The GlobalBuildingAtlas LoD1 dataset contains building footprints for approximately 2.75 billion buildings worldwide. This represents the most comprehensive global building dataset available, covering nearly all inhabited areas on Earth.
The dataset combines multiple existing building footprint datasets with new footprints generated using open source deep learning models applied to Planet Labs satellite imagery.
This Parquet version was converted from the original 922 GeoJSON files (1.1 TB) to improve performance and accessibility. For detailed information about the conversion process and technical analysis, see Mark Litwintschik's comprehensive blog post.
The dataset is organized as Parquet files, each containing building footprint data with the following key attributes:
Each Parquet file contains:
The dataset combines multiple building footprint sources:
The TUM deep learning component uses open source computer vision models applied to Planet Labs satellite imagery. Planet Labs operates several constellations totaling hundreds of satellites in low Earth orbit, capturing images of the entire Earth's landmasses daily.
The dataset combines multiple data sources as detailed above. The TUM deep learning component (ours2) represents new building footprints generated from Planet Labs satellite imagery, while the other sources provide existing building data from various organizations.
For areas with significant recent construction, consider using the latest OpenStreetMap data or Overture's building dataset, which prioritizes recent OSM updates.
If you use this Parquet version of the dataset hosted on Source Cooperative, please cite both the original dataset and this hosted version:
Original Dataset:
Source Cooperative Hosted Version: