Problems downloading CMIP6 historical simulations from PANGEO gallery

Hi good folks. So I am using this library: GitHub - dr-ko/python_pangeo_cmip6: Python scripts to download CMIP6 data from Pangeo Gallery · GitHub, to download historical simulations, particularly large ensembles. For some models, not all members will get downloaded. And in some cases, for ocean variable, let’s say ‘tos’ and an atmospheric variable (e.g., zg), not the same members will get downloaded for both. Is this an issue withing the PANGEO gallery, or could it be a problem of the said library? Thanks in advance.

I don’t know much about that package but I believe @jbusecke worked on a Zarr version of CMIP6 so perhaps he has some insights to share?

Hey @nrchow I am not at all familiar with that library, but you can check out the latest version of the Pangeo Zarr CMIP6 data on GCS and see if there are more members available.

Feel free to hit me up if you have more questions. Cheers

Hey @jbusecke, thank you very much. So I can download data with this or has to be accessed through the cloud? Thanks.

The instructions will give you a lazy representation of the data in xarray pointing to data in the cloud, and you should be able to work with the data on your machine (data will stream to your local computer, so performance depends on your internet connection). If you find yourself repeating the loading process often you cpuld set up a local cache or download the data (the catalog has the urls for the data).