Skip to content

Using the MovieLens dataset with Surprise to compare different algorithms for rating prediction, and also create a movie recommendation system on top of it.

Notifications You must be signed in to change notification settings

jacobceles/Movie-Recommendation-Rating-Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Movie Recommendation & Rating Prediction

In this project, we have used the MovieLens 100k dataset to compare different algorithms for rating prediction, and also create a movie recommendation system on top of it. Surprise is used to develop the models, and the dataset itself is open for public use since 1998, and has 100,000 ratings from 943 users on 1682 movies. Each user has rated at least 20 movies, and simple demographic info for the users (age, gender, occupation, zip) are given.

Problem Statement

  1. Data preprocessing and split, create a training dataset and a testing dataset for experiment
  2. Rating prediction, develop an algorithm to predict the ratings in the testing set based on the information (ratings and others) in the training dataset, and evaluate the predictions based on MAE and RMSE.
  3. Item Recommendation, construct a recommendation list for each user, and then evaluate the recommendation quality based on precision, recall, and F-measure.

Implementation

High Level Design

Repository Structure

├── data                                 # Data used for the project
├── Documents                            # Holds documents realted to the project
├── README.md                            # Read this first
└── code.ipynb                           # The code used for building the application

How to Use

  1. Open code.ipynb in you jupyter installation.
  2. Run all the cells. The code inside will handle installing the required packages.
  3. If needed, you can easily modify the last cell to get recommendations for specific users.

Results

RMSE and MAE scores Precision, Recall, and F-score
error_scores.png precision_recall.png

Recommendations

Movie recommendation

Contributors

References

About

Using the MovieLens dataset with Surprise to compare different algorithms for rating prediction, and also create a movie recommendation system on top of it.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published