Skip to content

CppChan/Movie_Topic_Clustering_and_Recommendation

Repository files navigation

Movie Topic Modeling based on NLP and Recommendation System Design

MovieCluster_nlp.ipynb

  • Used tokenizing, stemming and stop-words removing to preprocess user review text, and extracted features via TF-IDF
  • Trained unsupervised learning models including K-Means Clustering and Latent Dirichlet Allocation to implement Latent Topic Clustering
  • Visualized clustering results by dimensionality reduction using Principal Component Analysis.

Recommendation_Pyspark

Spark OLAP analysis
  • Analysis of the movie rating dataset and information extraction
  • Collaborative Filtering on Spark implementation( ALS model )
  • Personal Movie Recommendation
  • MSE Curves analysis and parameter tuning
Local Collaborative Filtering
  • ALS Collaborative Filtering Algorithm
  • MSE Curves analysis and parameter tuning
  • ROC Curves Analysis

Recommendation_Spark_Scala

  • encoding or raw rating data
  • itembased collaborative filtering algorithm
  • tune parameter for lower RMSE

Recommendation based on AutoEncoder-decoder Model

  • Details in README

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors