Skip to content

It is a assemble to include all Practice Projects about Big Data Topic, includes Hadoop, Spark, Spark Streaming and Kafka

License

Notifications You must be signed in to change notification settings

druheendas/Big-Data-Analysis-Practice-Projects

 
 

Repository files navigation

Big Data Analysis Practice Projects

It is a assemble to include all Practice Projects completed in Big Data Course. All description can check in each part folder.

Source

  • Hadoop
    • Common Friends
    • Top-10 Comman Friends pairs
    • Yelp Dateset Top 10 rating businesses information
    • Yelp Dataset Palo Alto businesses Rating
  • Spark
    • Common Friends
    • Top-10 Comman Friends pairs
    • Yelp Dateset Top 10 rating businesses information
    • Yelp Dataset Palo Alto businesses Rating
  • Spark Stream
    • Movie Clustering (spark-mlib, KMEANS)
    • Use Collaborative filtering find the accuracy(MSE) of ALS model accuracy
    • App Data correction and App Prediction (KMM)
    • Twitter Sentiment Analyzer (Spark Streaming and Kafka)

About

It is a assemble to include all Practice Projects about Big Data Topic, includes Hadoop, Spark, Spark Streaming and Kafka

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 51.9%
  • Java 46.0%
  • Shell 2.1%