Pangeo Showcase: "Cloud Native Data Loaders for Machine Learning Using Zarr and Xarray"

Title: “Cloud Native Data Loaders for Machine Learning Using Zarr and Xarray”
Invited Speaker: Joe Hamman (ORCID:0000-0001-7479-8439)
When: Wednesday March 20, 12PM EDT
Where: Launch Meeting - Zoom
Abstract: We lack well established patterns for streaming scientific data from cloud object storage into machine learning frameworks. This presentation will review a recent blog post we wrote (Cloud native data loaders for machine learning using Zarr and Xarray | Earthmover) describing one such pattern that uses Xarray, Zarr, Xbatcher, and PyTorch to build a cloud-native dataloader for scientific data. I will explain how the dataloader works, outline the benchmark results, and discuss where we could go from here.

  • 20 minutes - Community Showcase
  • 40 minutes - Showcase Discussion/Community Check-ins
3 Likes