Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix typo in data download instructions. #193

Closed
wants to merge 1 commit into from
Closed

Conversation

mkcor
Copy link
Contributor

@mkcor mkcor commented Apr 23, 2024

Dear community,

Last year (Jul 19, 2023), I was able to download image data from the IDR after I reached out to the authors of the corresponding publication and they pointed me to this page: https://idr.openmicroscopy.org/about/download.html Many thanks to my former Outreachy intern @ana42742 who greatly assisted in the process!

Today, I would like to download more data, using a cluster rather than my laptop. So I went back to the 'Data download' instructions and noticed that the page had changed (indeed, the bottom of the page says 'IDR logo: prod120. Last updated: 2024-03-06').

I can browse the Git history to find the old instructions with the Aspera CLI client, but I'd be curious to know why you decided to 'migrate from Aspera to FileZilla' (since I can't access the document referenced in #189). Usually, when working with large datasets, we use HPC clusters and, hence, prefer CLI over GUI tools.

Cc'ing @joshmoore here.

Best,
Marianne

Copy link
Member

@sbesson sbesson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the typo fix.

I can browse the Git history to find the old instructions with the Aspera CLI client, but I'd be curious to know why you decided to 'migrate from Aspera to FileZilla' (since I can't access the document referenced in #189). Usually, when working with large datasets, we use HPC clusters and, hence, prefer CLI over GUI tools.

For a bit of background, the public IDR data has been migrated internally within EMBL-EBI. The old download workflow has been broken as part of this migration so the former instructions were replaced. The EMBL-EBI public data storage effectively supports a few transfer protocols:

  • FTP
  • Aspera
  • Globus

I'll leave @pwalczysko and @francesw to comment on the rationale to focus on the FTP/FileZilla workflow.
I concur that for large datasets a command-line / headless application is preferred. If the FTP protocol is an option, you could try using a command-line FTP client otherwise

@dominikl
Copy link
Member

See also https://forum.image.sc/t/idr-zipping-an-entire-study/95123 and issue #190 , i.e. we should add instructions for Globus to the IDR download page. I can open a PR later.

@dominikl
Copy link
Member

Thanks for fixing the typo @mkcor . If you don't mind, I cherry-picked your commit into my PR #194 . So this one can be closed and we don't get any merge conflicts.

@dominikl dominikl closed this Apr 25, 2024
@mkcor
Copy link
Contributor Author

mkcor commented Apr 25, 2024

All good, @dominikl! Thank you very much. I look forward to following the updated instructions... I should be able to get back to this by the end of the week.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants