datawrangling

Here are 233 public repositories matching this topic...

harishrongala / Food_Choices

Data wrangling in Python

exploratory-data-analysis ipython-notebook pandas seaborn statistical-inference matplotlib data-wrangling datawrangling data-cleaning

Updated Apr 21, 2017
Jupyter Notebook

patmendoza330 / annotationwrangling

Star

Converting and integrating data from multiple sources is often tricky business. Luckily there are some great tools available that make this a breeze. I use a genetic annotation file (Brachypodium) and incorporate gene ontology definitions. This Uses dplyr and tidyr to do the data wrangling.

r datawrangling

Updated Oct 4, 2021

NAVEENDATAANALYST / Medical-Appointment-No-Show

Star

You can find the dataset in kaggle

python eda datawrangling datavisualization datacleaning

Updated Jul 8, 2023
Jupyter Notebook

mhashemihsmw / MLMOI

Star

The package reaches out to scientists that seek to estimate MOI and lineage frequencies at molecular markers using the maximum-likelihood framework described in https://doi.org/10.1371/journal.pone.0261889. Users can import data from Excel files in various formats, and perform maximum-likeli

data data-visualization datawrangling statistical-models dataanalysis datapreprocessing

Updated Nov 27, 2023
R

nzsaurabh / aggregatingdata

Star

Aggregate data in R using simple SQL commands

r sql time-series datawrangling sqldf

Updated Dec 2, 2018
R

suprematis / US-Newborn-Name-Analysis

Star

This is an exercise on the use of python for data wrangling based on the book "Python for Data Analysis" by Wes McKinney

data-visualization names plotting datawrangling

Updated Jul 21, 2018
Jupyter Notebook

hallan6749 / WozUDataScienceStudentCodeTurnedIn

Star

I finished the Woz U's Data Science program in March 2022. This is the code and the projects that I turned in during my student experience.

visualization python data-science r statistics sql database agile nosql scrum machinelearning datawrangling tableau

Updated Apr 11, 2022
HTML

neerajmech57 / PYTHON-PROJECT

Star

THIS repo contains projects done under Udemy Boot Camp on Data Science

python numpy pandas-dataframe datawrangling datacleaning

Updated Sep 27, 2022
Jupyter Notebook

SindiAI / WeRateDogs

Star

This repository provide an overview of the data wrangling process used for the WeRateDogs Twitter account dataset. The data wrangling process included data gathering, assessment, and cleaning to ensure the dataset was free of quality and tidiness issues.

python data jupyter-notebook datawrangling

Updated Feb 22, 2023
Jupyter Notebook

WMF07 / WalmartSalesAnalysis---MySQL

Star

Data analysis to gain insight into the sales data of Walmart to understand the different factors that affect sales of the different branches.

mysql exploratory-data-analysis feature-extraction datawrangling

Updated Jul 13, 2023

hassanmujtaba7 / DataAnalytics_PortfolioProject

Star

SQL & Tableau - Portfolio Project

data sql businessintelligence datawrangling tableau datavisualization dataexploration

Updated Jul 9, 2023

alasdairgm / gender_pay_gap_analysis

Star

A analysis of the gender pay data across Scottish companies

r datawrangling hypothesis-testing dataanalysis datavisualisation

Updated Aug 9, 2023
HTML

yongpuitung / Spotify-Data-Preprocessing

Star

This project utilizes R to preprocess Spotify's "Unpopular Songs" and "Genre of Artists" datasets from Kaggle. Following tidy data principles, it handles duplicates, transforms variables, scans for outliers, and normalizes data. The resulting clean dataset is ready for statistical analysis, ensuring accurate and ethical data practices.

spotify data-science r statistics rstudio statistical-analysis datawrangling kaggle-dataset datacleaning datapreprocessing tidydata

Updated Jan 8, 2024
HTML

sturaro-ds / eda_data_wrangling_e-commerce

Star

Data Wrangling com Python para e-Commerce

python data-science commerce jupyter-notebook e-commerce datawrangling

Updated Mar 17, 2024
Jupyter Notebook

KoketsoMangwale / We-Rate-Dogs

Star

Gather data from various sources(csv, web scrape, json) and wrangle the data. Analysis and visualization of the twitter dog rating

json csv text datawrangling webscraping datacleaning vizualisation

Updated Sep 6, 2022
Jupyter Notebook

EmanueleCannizzaro / udacity_data_wrangling_mongodb

Star

Data Wrangling with MongoDB class code

udacity mongodb datawrangling

Updated Feb 1, 2019
Jupyter Notebook

ariesra92 / capstone_project_mef

Star

json machine-learning random-forest azure datawrangling tweepy decision-trees

Updated Mar 6, 2019
HTML

imRishabhGupta / Data-Wrangling

Star

This repo contains the code to download data and then extract it, if needed, and store it in a pickle file.

python data datawrangling pickle

Updated Mar 24, 2017
Python

abhijitkulkarni25 / Data_Wrangling_JSON

Star

json data json-data pandas data-analysis datawrangling

Updated Jul 16, 2017
Jupyter Notebook

Aisha-Ojey / TMDB-Movie-Dataset-Analysis

Star

This analysis examines a dataset of 10,000 movies from a movie database, revealing insights and trends in the industry. Notably, drama is the most popular genre, and factors like budget and popularity impact revenue. However, limited data, replaced null values, outliers, and correlation-causation considerations call for cautious interpretation.

exploratory-data-analysis datawrangling

Updated Aug 7, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the datawrangling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the datawrangling topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

datawrangling

Here are 233 public repositories matching this topic...

harishrongala / Food_Choices

patmendoza330 / annotationwrangling

NAVEENDATAANALYST / Medical-Appointment-No-Show

mhashemihsmw / MLMOI

nzsaurabh / aggregatingdata

suprematis / US-Newborn-Name-Analysis

hallan6749 / WozUDataScienceStudentCodeTurnedIn

neerajmech57 / PYTHON-PROJECT

SindiAI / WeRateDogs

WMF07 / WalmartSalesAnalysis---MySQL

hassanmujtaba7 / DataAnalytics_PortfolioProject

alasdairgm / gender_pay_gap_analysis

yongpuitung / Spotify-Data-Preprocessing

sturaro-ds / eda_data_wrangling_e-commerce

KoketsoMangwale / We-Rate-Dogs

EmanueleCannizzaro / udacity_data_wrangling_mongodb

ariesra92 / capstone_project_mef

imRishabhGupta / Data-Wrangling

abhijitkulkarni25 / Data_Wrangling_JSON

Aisha-Ojey / TMDB-Movie-Dataset-Analysis

Improve this page

Add this topic to your repo