Any suggestions for efficiently operating over windows of data?

maxrjones · January 31, 2023, 11:41pm

You may find some helpful information in Efficiently slicing random windows for reduced xarray dataset, which also considered efficiently selecting only “valid” examples.

We’re working on the xbatcher library for this type of use case - our roadmap for the next few months focuses on improving efficiency for the data loaders and we’re considering support for filtering examples.

Topic		Replies	Views
xr.DataArray.chunks, np.digitize and xr.DataArray.groupby, and dask Science	2	674	January 16, 2022
Optimising Access For Zarr on S3 Data by LAT/LONG (Dask) Data	11	1650	April 25, 2022
Best practice reading zarr from s3 Cloud	8	4494	July 28, 2022
Xarray to_zarr interp_like via dask distributed - anyone done this before? Science	1	516	May 30, 2022
Saving larger-than-memory objects to zarr using dask and xarray Data zarr	9	601	December 3, 2024

Any suggestions for efficiently operating over windows of data?

Related topics