A content-based recommendation system that suggests similar movies using cosine similarity and NLP techniques. The system analyzes movie features like genres, director, cast, and keywords to provide personalized suggestions.
- Data Processing: Combines multiple features (genres, director, cast, keywords) into a single text vector
- Vectorization: Uses
CountVectorizerto convert text into a numerical matrix - Similarity Scoring: Computes cosine similarity between movies to find the closest matches
- Recommendation: Returns the most similar movies to any given input title
- Built with Python's scikit-learn ecosystem (
pandas,numpy,sklearn) - Implements cosine similarity for measuring movie similarity
- Processes the Movie Dataset containing comprehensive movie metadata
- Clone the repository
- Install dependencies:
pandas,numpy,scikit-learn - Run the script and input your favorite movie
- Get 10 most similar movie recommendations
python movie_recommender.py- Handles missing data gracefully
- Customizable feature combination
- Efficient similarity scoring
- Easy-to-understand recommendation output