Social Media Sentiment Analysis

📃Table of Content

Description
Our Progress
Results
Screenshots
How to run

🚀Description

Social media has become a powerful platform for public opinion and brand perception. This notebook explores the use of Artificial Intelligence (AI) techniques, specifically sentiment analysis, to understand the sentiment expressed in social media data. We will delve into the process of data collection, pre-processing, model building, and evaluation to build a system that can automatically classify social media posts as positive, negative, or neutral.

⏳Our Progress

We divided the project into five stages:

Data Exploration

Data import: We loaded our data into pandas dataframe (736 rows and 14)
Column exploration: Explored every column to know what does it contain and represent in our data and to identify whether it will help our analysis or it is just an identifier (e.g. ID).
Redundancy check: We noticed in this stage that there are 20 redundant columns.
String columns space: We noticed different values for the same string so we deduced it is extra space.

Data Cleaning

Redundancy removal: We removed the redundant to prevent any insignificance and striped the spaces from the strings columns.
Hashtag column extraction: Our columns contained multiple hashtags in the same string so we extracted them to different columns.

Exploratory Data Analysis

Visualization techniques: We used different visual techniques: pie charts, wordcloud, line plots, scatter plots and bar charts to identify different trends on every social media platform.
Trends and patterns: For every social media platform existed in our data (Facebook, Instagram, and Twitter) we identified it's users interests and usage patterns.

Text Preprocessing

clean_text function: Utilized the power of RegEx and NLTK libraries to apply deep cleaning and preprocessing for text data by stemming, removing stop words, punctuation, and numbers.
Vectorization: Vectorized our text using Bag of Words and TF-IDF methods.

Modeling

Models: We used the following models:
- Logistic Regression.
- K-Nearest Neighbors.
- Support Vector Machine.
- Naive Bayes.
- Decision Tree.
- Random Forest.
Experimentation: We trained the model on two splits: one with bag of words vectorized text and the other using tf-idf.
Evaluation metrics: We evaluated our model using accuracy score, precision, recall, and F1-score and averaging the result with macro.

Deployment

Models: After the evaluation we agreed that our best model is Support Vector Machine but we gave the user in the deployment the ability to use all the models we trained.

🔬Results

Using Bow Training Model

Model Name	Train Accuracy	Test Accuracy
Logistic Regression	100%	85%
K-Nearest Neighbors	87%	78%
Naive Bayes (Multinomial)	98%	83%
Support Vector Machine (SVM)	100%	80%
Decision Tree	100%	70%
Random Forest	100%	78%

Using TF-IDF Training Model

Model Name	Train Accuracy	Test Accuracy
Logistic Regression	94%	79%
K-Nearest Neighbors	91%	86%
Naive Bayes (Multinomial)	91%	85%
Support Vector Machine (SVM)	98%	85%
Decision Tree	100%	61%
Random Forest	100%	76%

📸 Screenshots

Title Page
Positive Prediction
Neutral Prediction
Negative Prediction

🛠️How to run

Clone the project

git clone https://github.com/Social-Media-Sentiment-Analysis/NLP-Project

install dependencies

cd NLP-Project 
pip install -r requirements.txt

Next, you run this code and the deployment will work locally with you.

streamlit run deployment.py

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
README.md		README.md
Sentiment_Analysis.joblib		Sentiment_Analysis.joblib
Social_Media_Sentiment_Analysis_Project.ipynb		Social_Media_Sentiment_Analysis_Project.ipynb
deployment.py		deployment.py
dt.joblib		dt.joblib
intro_image.png		intro_image.png
knn.joblib		knn.joblib
lr.joblib		lr.joblib
mnb.joblib		mnb.joblib
negative_sentiment.png		negative_sentiment.png
neutral_sentiment.png		neutral_sentiment.png
positive_sentiment.png		positive_sentiment.png
requirements.txt		requirements.txt
rf.joblib		rf.joblib
sentimentdataset.csv		sentimentdataset.csv
svm.joblib		svm.joblib
tempCodeRunnerFile.py		tempCodeRunnerFile.py
tf_idf_vectorizer.joblib		tf_idf_vectorizer.joblib
title_page.png		title_page.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Social Media Sentiment Analysis

📃Table of Content

🚀Description

⏳Our Progress

Data Exploration

Data Cleaning

Exploratory Data Analysis

Text Preprocessing

Modeling

Deployment

🔬Results

📸 Screenshots

🛠️How to run

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

Social-Media-Sentiment-Analysis/NLP-Project

Folders and files

Latest commit

History

Repository files navigation

Social Media Sentiment Analysis

📃Table of Content

🚀Description

⏳Our Progress

Data Exploration

Data Cleaning

Exploratory Data Analysis

Text Preprocessing

Modeling

Deployment

🔬Results

📸 Screenshots

🛠️How to run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages