ABOUT

This website provides data-driven insights into audience sentiment at different stages of a movie's release. It also uses machine learning models to predict box office performance for upcoming films. The platform offers a unique look at public expectations both before and after release, presented through visualisations and a practical implementation of a model that combines sentiment analysis, historical financial data, and production information to forecast revenue outcomes.

Data sources

Reddit: Post-release audience sentiment collected from r/movies discussions.

YouTube: Pre-release audience sentiment collected from official movie trailer comments.

The Movie Database (TMDb): Movie metadata, including budget, runtime, release date, and actual revenue figures.

Company Revenue Data: Built using historical revenue information from The Movies Dataset (Kaggle).