Test cloud optimized data formats for swath (L2) satellite data
Description
As NASA and NOAA data are moving to the cloud, what is the best cloud-optimized format for swath data? NASA has explored different formats and written a report that presents some viable options for swath data. For this project, we will stage a sample of the MODIS L2P data (from PO.DAAC) on Pangeo, transform it to a couple different formats (eg. Zarr, cloud-optimized HDF) and test access and analysis times for a few different likely patterns of analysis, such as collocation with random points that are globally distributed, finding all data within a bounding box, etc.
Required Skills
What technical skills are needed in order to contribute? For example
- Basic-Intermediate python programming (Xarray, matplotlib)
Mentors
- Chelle Gentemann, cgentemann@faralloninstitute.org