In this project, we provide the level of detail 1 (LoD1) data of buildings across the globe.
A overview of the dataset is illustrated bellow:
⚠️ This dataset is provided in three parts: ODbL-licensed polygons (GBA.ODbLPolygon), CC BY-NC 4.0 polygons and LoD1 building models (GBA.PolygonandGBA.LoD1), and CC BY-NC 4.0 height maps (GBA.Height).
⚠️ Users may combine these datasets for analysis or downstream applications, but doing so may create license implications.
⚠️ It is the responsibility of each user to ensure that their use complies with the respective licenses.
⚠️ This repository does not provide legal advice; users should review the original licenses and, if necessary, consult legal counsel.
The dataset aims at global coverage, so all countries, territories, and cities should be included. However, due to data quality limitations, some areas may be absent or may not have height attributes. You can check availability using our web viewer.
This is a machine-learning–derived product. Errors may occur. Please refer to the publication for validation results and further details.
High traffic can occasionally affect the web viewer. We restart the server as needed to maintain access. Please check the bulletin board for maintenance updates.
We used some building footprints from ODbL-licensed sources (OSM and Microsoft Global ML Building Footprints). According to ODbL, derivatives must also be ODbL-licensed, which conflicts with our PLANET-derived BY-NC data. Therefore, the dataset is split:
- Part I – HuggingFace:
GBA.ODbLPolygoncontains only building polygons derived from ODbL-licensed sources. - Part II – HuggingFace:
GBA.LoD1contains additional building footprints from other sources, and LoD1 JSON files linking all building polygon features fromGBA.ODbLPolygonandGBA.Polygon. - GBA.Height – mediaTUM.
You can follow the README instructions under this repository, or HuggingFace to derive the final LoD1 GeoJSON files.
All building polygons are recorded in EPSG:3857. Some files in GBA.ODbLPolygon on HuggingFace may appear in EPSG:4326 — please treat them as EPSG:3857.
A WFS is provided so that one can access the data using other websites or GIS softwares such as QGIS and ArcGIS.
Url: https://tubvsig-so2sat-vm1.srv.mwn.de/geoserver/ows?
A web interface for viewing the data is available at: website.
Note: Over the past few days, our web viewer has received nearly 280,000 access requests. Due to this unusually high traffic, some data may not load completely, which may result in a significant portion of buildings not being displayed.
The full data can be downloaded as follows:
- Part I - HuggingFace, GBA.ODbLPolygon contains ONLY building polygons derived from ODbL-licensed data sources.
- Part II - HuggingFace, GBA.LoD1 contains additional building footprints from other data sources, and LoD1 JSON files linking all building polygons features from GBA.ODbLPolygon and GBA.Polygon.
- GBA.Height - mediaTUM.
-
Access the representative dataset
- Either in the
representative/folder from HuggingFace folder of this repository, or via mediaTUM.
- Either in the
-
Identify tiles overlapping your region of interest (RoI)
- For polygons and LoD1 models: use
lod1.geojsonto find the intersecting tiles inGBA.PolygonorGBA.LoD1. - For heights: use
height_zip.geojsonandheight_tif.geojsonto find intersecting tiles inGBA.Height.
- For polygons and LoD1 models: use
-
Download required tiles
-
ODbL polygons: download from HuggingFace and save under:
./ODbLPolygon -
LoD1 polygons and other data: download from this repository, and save under:
./Polygon ./LoD1
-
-
Run the enrichment script provided at HuggingFace
python produce_lod1.py
- You may also specify folder paths at your choice.
python produce_lod1.py \ --odbl_root /path/to/odbl \ --polygon_root /path/to/polygon \ --json_root /path/to/json \ --output_root /path/to/output- The script will read the two GeoJSON folders and the JSON folder, merge the properties, and add height and var fields.
-
Output
- LoD1 GeoJSON files will be written under
./LoD1_GeoJSONor the folder you specified.
- If you are only interested in GBA.Polygon, you can ignore the
heightandvarfields.
For codes related to building map extraction, regularization, polygonization, and simplification, i.e., generating building polygons from satellite images (Sec. 4.3.2, Sec. 4.3.3, and Sec. 4.3.4), please refer to ./im2bf.
- For codes related to monocular height estimation using HTC-DC Net (Sec. 4.4.2), please refer to
./im2bh. - For codes related to the global inference and uncertainty quantification (Sec. 4.4.3), please refer to
./infer_height
- For codes related to quality-guided building polygon fusion (Sec. 4.5.1), please refer to
./fuse_bf. - For codes related to LoD1 building model generation (Sec. 4.5.2), please refer to
./make_lod1.
For codes to reproduce the plots in the manuscript, please refer to ./make_plots.
MIT with Commons Clause (no commercial use allowed). See LICENSE.
If you find this dataset helpful in your work, please cite the following paper.
@Article{essd-17-6647-2025,
AUTHOR = {Zhu, X. X. and Chen, S. and Zhang, F. and Shi, Y. and Wang, Y.},
TITLE = {GlobalBuildingAtlas: an open global and complete dataset of building polygons, heights and LoD1 3D models},
JOURNAL = {Earth System Science Data},
VOLUME = {17},
YEAR = {2025},
NUMBER = {12},
PAGES = {6647--6668},
URL = {https://essd.copernicus.org/articles/17/6647/2025/},
DOI = {10.5194/essd-17-6647-2025}
}
