Skip to content

Research into application of random forests in snow coverage mapping.

Notifications You must be signed in to change notification settings

DSHydro/HLS-SCA

Repository files navigation

HLS-SCA(Snow Mapping Project)

Machine learning (ML) techniques are becoming increasingly vital for land cover classification, particularly in mapping snow-covered areas. However, it is important that the land cover maps produced by these methods be validated against ground truth and compared with similar datasets to understand the uncertainties between models, products derived from remote sensing, and specific land surfaces. To this end, in this project, we are interested in understanding how well we can map snow-covered areas using the remote sensing Harmonized Landsat Sentinel-2 (HLS, Claverie et al., 2018) product at 10-m spatial resolution. We hypothesize that the ML methods are likely to perform better than the standard index-based snow mapping method. This repository focuses on evaluating the effectiveness of ML methods compared to traditional index-based methods for snow mapping using the Harmonized Landsat Sentinel-2(HLS) product at 10-m spatial resolution.

Objectives

  1. Access and visualize airborne and satellite imagery (HLS).
  2. Derive preliminary snow maps using index-based methods.
  3. Create a random forest model tailored to HLS datasets and evaluate its performance.
  4. Explore feature importance and model transferability, including tests on independent datasets.
  5. Derive snow-covered areas using existing models, including Google Dynamic World and NASA-IBM geospatial model.

Tools used in this Project/References

Environment

  • Cryocloud: Built-in environment to access and manipulate data.

Tutorials

HLS Data Access

Lidar Data

  • Earthdata Search: We used 50m resolution lidar data found via Earthdata Search as our truth values due to it being very precise and accurate. See here: for a tutorial on how to use the tool.
  • ASO Alternatively, we can look in here for specific regions of lidar data.

Accessing the working environement

To match the development environment, access cyrocloud.

Set up a local environment

If you are not able to access cryocloud, then look at the provided [[environment.yml]] file extracted from cryocloud. From the same directory, run conda env create --name envname --file=environment.yml (replacing envname) to generate a conda environment. This will have all libraries at the appropriate versions that we used in this project.

Model 0

We've generated model 0 based on "ground truth" data, derived from the HLS NDSI product using a threshold of 0.4. To get model 0, please follow the steps on notebooks:

  1. DataDiscovery.ipynb
  2. EarthdataLoginSetup.ipynb
  3. SnowIndexing.ipynb

Random Forest Model

Validation framework on comparing Ml-derived datasets with ground truth measurements. To run Random forest model: 1.RF_data_preparation.ipynb 2.RF_Model.ipynb

About

Research into application of random forests in snow coverage mapping.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages