Processing Terabyte-Scale NASA Cloud Datasets with Coiled

jrbourbeau · November 7, 2023, 6:12pm

Hi All,

I’ve been working with groups that process NASA data (folks like @asteiker @betolink @andypbarrett @jinbow) to help move their workloads to the cloud. It turns out a common pattern for lots of workloads is a big parallel for-loop.

Here’s a blogpost where we wrote about this analysis pattern and discussed cloud performance and cost Processing Terabyte-Scale NASA Cloud Data | Coiled (Spoiler: it’s surprisingly cheap to churn through a bunch of cloud data when using spot instances, the right region, etc.)

Hopefully this can serve as a useful template for others with similar use cases

betolink · November 27, 2023, 9:37pm

Just to mention that we are having a community call tomorrow to talk about Coiled-powered workflows for cloud data Openscapes Community Call: NASA Earthdata Cloud with Coiled
This call is open and it would be great if Pangeo folks can attend and give their 2c on this approach.

Topic		Replies	Views
Blog post: Processing a 250 TB dataset with Xarray, Dask, and Coiled Cloud	0	452	September 5, 2023
Cloud Optimized Geotiffs + Pangeo best practices Data	4	2081	January 21, 2021
Wednesday February 22nd 2023: D’explorer Explore cloud datasets from your notebooks Pangeo Showcase	13	546	March 7, 2023
Exploring Pangeo's Data Processing Capabilities for Large-Scale Climate Modeling! Data	0	69	January 31, 2025
Large Scale Geospatial Benchmarks News & Announcements	2	220	October 22, 2024

Processing Terabyte-Scale NASA Cloud Datasets with Coiled

Related topics