Downloading speed from Zenodo

rlourenco · June 26, 2023, 8:05pm

I am currently downloading a dataset from Zenodo (10’s of GB) and it is taking a long time (hour or so). Is it common? Any ideas on how to speedup?

Imagine · July 19, 2023, 3:42pm

Hi, Since last couple of weeks i am facing same problem… Even sometimes its happens that out of 26GB data 25GB downloaded then Zenodo server down, i was waited for more than 30hrs to complete the download.
Very Pathetic but it’s cool that at least we are getting data from there…

brews · August 11, 2023, 6:37pm

Just jumping in to say I’ve been having the same trouble. I don’t know that I’ve seen it this bad before.

It would be cool if they had a status page if this kind of thing becomes more frequent.

nealmcb · December 23, 2023, 8:41pm

I note that they do sometimes share status on the Zenodo blog, e.g. Zenodo upgrade issues, 2023-10-19
I’m seeing slow speeds myself now, e.g. 200 kB/s, despite speedtest.net results of 80 Mbps and more…

NomadXD · February 9, 2025, 2:06am

If anyone else struggles with the slow download speed in zenodo, try using this command line tool
GitHub - dvolgyes/zenodo_get: Zenodo_get: Downloader for Zenodo records. I was able to download ~2GB dataset within seconds

konstntokas · February 19, 2025, 12:05pm

Hi,

The xcube plugin xcube-zenodo may help.

It offers lazy access of chunked datasets published as tif or netcdf on Zenodo. see example notebook.
If the dataset is published in a compressed format (zip. tar, tar.gz), then the full dataset needs to be downloaded first. This can be done via the xcube’s preload API. See other example notebooks in the same repository.

scottyhq · March 27, 2025, 2:39pm

I’ve also noticed on small machines in AWS it’s common to have <1MB/s transfer speeds for a big 10GB+ zenodo.zip

In Germany on home wifi it’s a bit better ~ 6 MB/s, which I assume is because the Zenodo data centers are physically in Switzerland (Infrastructure | Zenodo)

Michael_Sumner · March 27, 2025, 9:36pm

oh this was interesting for me, everything is now a store, including a website with a directory containing a tif?? Isn’t that seen as pretty fragile, since a simple string is obscured by required code in only one language?

import rioxarray
rioxarray.open_rasterio("/vsicurl/https://zenodo.org/records/8154445/files/planet_canopy_cover_30m_v0.1.tif?download=1")


<xarray.DataArray (band: 1, y: 149363, x: 170397)> Size: 25GB
[25451007111 values with dtype=uint8]
Coordinates:
  * band         (band) int64 8B 1
  * x            (x) float64 1MB 2.555e+06 2.555e+06 ... 7.667e+06 7.667e+06
  * y            (y) float64 1MB 5.82e+06 5.82e+06 ... 1.339e+06 1.339e+06
    spatial_ref  int64 8B 0
Attributes:
    AREA_OR_POINT:  Area
    _FillValue:     255
    scale_factor:   1.0
    add_offset:     0.0

I’m not particularly heavy user of rioxarray, but this is a lazy access to a chunked dataset, and would work as well in a zip, tar, or targz (variously add the /vsizip/ /vsitar/ /vsigzip protocols). With osgeo.gdal.OpenEx and OF_MULTIDIM_RASTER also netcdf and zarr can be accessed in multidimensional mode). You really don’t have to download compressed files.

rlourenco · March 27, 2025, 10:01pm

And without using another library (no offence intended, but I try to keep my stack as minimal as possible).

Michael_Sumner · March 27, 2025, 10:12pm

what library are you using to download?

rlourenco · March 28, 2025, 11:09am

I used zenodo_get, but the call you presented seems way more straightforward.

The only thing that can be difficult (perhaps) is downloading from Zenodo with authentication for closed repositories.

Karan_Mahajan · May 8, 2025, 10:53am

Hi. I was trying to download a 30GB dataset from Zenodo that has 50 files of 500MB each. Unfortunately, the download speed using the browser (Microsoft edge) or using zenodo_get in python is the same. I have a 700Mbps connection but the download speed from Zenodo is some 5 MBps. So I am not sure if Zenodo_get gives faster downloads. At the end it is a problem with Zenodo servers I think that do not allow high download speed.

Karan_Mahajan · May 8, 2025, 10:55am

Hi. I know I am 2 years late but did you figure out any way to increase the download speeds? I have to download 10 TB of data that is distributed on 240 pages (records) on Zenodo. With the current speeds, I am looking at days for the download to complete.

Topic		Replies	Views
Reading GOES-R s3 netCDFs from an AWS EC2 instance - is it possible to get faster speeds than from my local machine? Data	13	2088	January 10, 2024
Extremly slow write to S3 bucket with xarray.Dataset.to_zarr Data	32	4952	December 6, 2023
S3 - Zarr / NetCDF access times using s3fs Data	13	3521	April 19, 2023
Extremely slow xarray/zarr writes Data	5	574	August 22, 2024
CMIP6 Zarr datasets on AWS — useful for interactive exploration? Data	1	914	June 10, 2021

Downloading speed from Zenodo

Related topics