Movie Industry Analysis
Exploring four decades of movie data to identify key factors driving box-office success.
Project Snapshot
Explored over four decades of movie data to identify key factors influencing film success, including budget, director, genre, and ratings. Built multiple linear regression models to quantify the impact of each variable on gross revenue.
Problem
The film industry involves massive financial investments with uncertain returns. Identifying which measurable factors — budget, genre, director track record, audience ratings — are most predictive of commercial success can inform production and investment decisions.
Approach
- Collected and cleaned a large dataset spanning 40+ years of movie releases
- Conducted exploratory data analysis and correlation analysis
- Built multiple linear regression models to assess variable impact on gross revenue
- Compared feature importance across budget, vote count, genre, and runtime
Tech Stack
Python, Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, Jupyter Notebook
Results
- Higher budget and vote count identified as the strongest predictors of commercial success
- Genre and runtime showed weaker associations with gross revenue
- Regression models provided actionable insights into film industry economics