Movie recommendation system based on Collaborative filtering using Apache Spark
Uses ALS algorithm for Collaborative filtering. Dataset used: MovieLens Database
complete_dataset_url = 'http://files.grouplens.org/datasets/movielens/ml-latest.zip'
small_dataset_url = 'http://files.grouplens.org/datasets/movielens/ml-latest-small.zip'
Pre-requisite:
-
Python (3.X)
-
Apache Spark (Requires installation of Scala, JVM)
-
PySpark (Wrapper for Spark)