How to build a STAC out of ERA5-land data?

Hello everybody,

i’m trying to build a STAC catalog based on some data variables from the ERA5 land collection. The downloaded data (via the Climate Data Store) stores the time variable in the different bands of the NetCDF file, e.g. one file contains all the hourly time steps of a month for said variable.

Now, my understanding of the STAC nomenclature is that every item refers to a specific location and time. Therefore, i would like to reference the different bands to different items, with every item containing the data variables of that specific time step.

So my main question is: How can i formulate a “href” to point to a specific band within one file?

I found a pystac tutorial for multi-spectral data sets (Create a STAC Item that Implements the EO Extension Using PySTAC | STAC Tutorials) which however would break the logic of one item referencing one specific location and time.

Another work-around i could think of is to export all the bands via gdal_translate to single files and than build my STAC based on those files.

Any ideas and advise is very much appreciated!

Cheers,
Martina

Hi Martina,

are you aware that B-Open, within the Destination Earth project, produced already a Zarr store with the ERA5 Land data, available as a STAC Item here https://earthdatahub.destine.eu/api/stac/v1/collections/era5/items/reanalysis-era5-land ? There’s an example on how to use it here: https://earthdatahub.destine.eu/tutorials/tutorial-era5-land-floods-precipitation-anomaly-in-greece

Let me know if you would still want to proceed in building your own STAC Collection, I’ll help you if needed.

1 Like

Hi,

i did not know about the project and the Zarr store, that is very cool. I will test the STAC item but it seems like it is exactly what i was looking for!

Thanks a lot for sharing the information!

Best,
Martina

This thread highlights something quite interesting and important about how we think about STAC and Zarr.

The OP imagined she was going to download thousands of NetCDF files and treat each one as a STAC item.

@clausmichele pointed her at a single STAC item representing the entire ERA5 dataset.

These are extremely different ways of representing ERA5!

3 Likes