This website provides data-driven insights into audience sentiment at different stages of a movie's release. It also uses machine learning models to predict box office performance for upcoming films. The platform offers a unique look at public expectations both before and after release, presented through visualisations and a practical implementation of a model that combines sentiment analysis, historical financial data, and production information to forecast revenue outcomes.
Data sources
Reddit: Post-release audience sentiment collected from r/movies discussions.
YouTube: Pre-release audience sentiment collected from official movie trailer comments.
The Movie Database (TMDb): Movie metadata, including budget, runtime, release date, and actual revenue figures.
Company Revenue Data: Built using historical revenue information from The Movies Dataset (Kaggle).