Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

xtreme / pan-x cannot be downloaded #2691

Closed
severo opened this issue Jul 21, 2021 · 5 comments · Fixed by #2710
Closed

xtreme / pan-x cannot be downloaded #2691

severo opened this issue Jul 21, 2021 · 5 comments · Fixed by #2710
Labels
bug Something isn't working

Comments

@severo
Copy link
Collaborator

severo commented Jul 21, 2021

Describe the bug

Dataset xtreme / pan-x cannot be loaded

Seems related to #2326

Steps to reproduce the bug

dataset = load_dataset("xtreme", "PAN-X.fr")

Expected results

Load the dataset

Actual results

FileNotFoundError: Couldn't find file at https://www.dropbox.com/s/12h3qqog6q4bjve/panx_dataset.tar?dl=1

Environment info

  • datasets version: 1.9.0
  • Platform: macOS-11.4-x86_64-i386-64bit
  • Python version: 3.8.11
  • PyArrow version: 4.0.1
@severo severo added the bug Something isn't working label Jul 21, 2021
@severo severo mentioned this issue Jul 21, 2021
@albertvillanova
Copy link
Member

Hi @severo, thanks for reporting.

However I have not been able to reproduce this issue. Could you please confirm if the problem persists for you?

Maybe Dropbox (where the data source is hosted) was temporarily unavailable when you tried.

@severo
Copy link
Collaborator Author

severo commented Jul 22, 2021

Hmmm, the file (https://www.dropbox.com/s/dl/12h3qqog6q4bjve/panx_dataset.tar) really seems to be unavailable... I tried from various connexions and machines and got the same 404 error. Maybe the dataset has been loaded from the cache in your case?

@albertvillanova
Copy link
Member

Yes @severo, weird... I could access the file when I answered to you, but now I cannot longer access it either... Maybe it was from the cache as you point out.

Anyway, I have opened an issue in the GitHub repository responsible for the original dataset: afshinrahimi/mmner#4
I have also contacted the maintainer by email.

I'll keep you informed with their answer.

@albertvillanova
Copy link
Member

albertvillanova commented Jul 23, 2021

Reply from the author/maintainer:

Will fix the issue and let you know during the weekend.

@albertvillanova
Copy link
Member

albertvillanova commented Jul 23, 2021

The author told that apparently Dropbox has changed their policy and no longer allow downloading the file without having signed in first. The author asked Hugging Face to host their dataset.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants