I want tips on setting up a scalable Pangeo cloud environment

Hey everyone,

I am new to Pangeo & trying to set up a cloud environment that can handle large geoscience datasets efficiently. I have been exploring some tools & workflows but still feel a bit lost on where to start with scaling things smoothly on the cloud.

Has anyone set up Pangeo on AWS, GCP or Azure? What is the best way to manage resources without breaking the bank? Also, how do you handle data storage and access to keep workflows fast and reliable?

By the way, while learning some automation for my projects, I took a Selenium Course which helped me understand managing cloud resources better—so any advice on integrating automation with Pangeo would be awesome too!

I have check Scratch Bucket is not working on new Pangeo Cloud cluster I want to hear your setups, challenges & any tips or tutorials for beginners working with cloud-based Pangeo workflows.

Thank you.:slight_smile:

1 Like

Selenium is a browser automation engine, so I’m not sure how that would help here. Maybe share a bit more about your background and what you’re more interested in (the infrastructure / cloud devops side? The geoscience side?) and we can point you in some directions.