Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset can't be loaded. #1

Open
guillaume-chevalier opened this issue Sep 20, 2018 · 1 comment
Open

Dataset can't be loaded. #1

guillaume-chevalier opened this issue Sep 20, 2018 · 1 comment

Comments

@guillaume-chevalier
Copy link

I've tried to download the dataset, but it seems impossible to download.
I went from your recent article: https://ahmedbesbes.com/overview-and-benchmark-of-traditional-and-deep-learning-models-in-text-classification.html
To this: http://thinknook.com/twitter-sentiment-analysis-training-corpus-dataset-2012-09-22/
To then this: http://www.sananalytics.com/lab/twitter-sentiment/
However the last link of sananalytics.com doesn't load at all.

Or else, I try to download the data from your previous blog post:
https://ahmedbesbes.com/sentiment-analysis-on-twitter-using-word2vec-and-keras.html
I've tried to download the dataset from the Google Drive, but it seems erroneous. First, I copied your def ingest(): method. Then, I tried. first it didn't load: had the change the encoding to latin-1. Then, I got this and I realized the dataset had no columns. I had the error: ValueError: labels ['ItemID' 'SentimentSource'] not contained in axis, and it was on this line: data.drop(['ItemID', 'SentimentSource'], axis=1, inplace=True).

I wonder how I would be able to reproduce your experiments or at least use the same data for a quick comparison. I didn't tried further than what I've put above. I guess adding names to the columns manually might do it, but from this point on I suspect that probably other things wouldn't work as expected too down the road. It'd be very cool if you could an easy data loading pipeline.

Thanks!

@ahmedbesbes
Copy link
Owner

Hello Guillaume,

Here's the link to download the dataset: http://thinknook.com/wp-content/uploads/2012/09/Sentiment-Analysis-Dataset.zip

Ahmed,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants