Sunday, 25 February 2018

Data Update 2 - Domestic Box Office

Lead

Star Wars: The Force Awakens was the highest grossing film from the years 1997-2017, and had the highest opening weekend. Godzilla (1998) is the lowest in terms of both total gross and opening weekend.

Excel Workbook Link and Explanation

Original workbook.

The "Slice" tab of my spreadsheet is organized by "Total Gross", from largest to smallest. This shows that Star Wars: The Force Awakens had the highest total gross, and Godzilla had the lowest. If you were to organize the spreadsheet by "Opening Weekend", both films are in the exact same spot.

Original Dataset Link

Original dataset.

News Story/Study Link and Brief Summary

'Star Wars: Force Awakens' First Ever to Cross $900 Million Domestically

This news story shows that Star Wars: The Force Awakens is the first film ever to cross $900 million domestically at the box office. It then goes on to say that the film has passed both Avatar and Titanic. The article is capped off with two top five record holders lists, one being domestic box office, and the other worldwide.

Monday, 12 February 2018

Data Update 1


  1. What dataset will you use for your final report? (describe your dataset and include a link to it)
    • The dataset that I will be using for my final report is the Domestic Opening Weekend Box Office From 1986-2018, which lists the #1 film of each consecutive year and how much money said film made at the box office. I'll most likely omit 2018, however, because that year literally just started.
  2. Describe the dataset. What kind of data does it contain?
    • The dataset contains the #1 film of each year, the amount of money a film made upon its opening and that percentage of the film's total gross. The dataset also contains the amount of theatres each film opened at and what I assume is the average amount of money theatres made. Finally, the dataset lists the total gross of each film.
  3. Is there anything about your data that you don't understand? (i.e. what a column heading means). How will you find out?
    • I understand most of the data due to the fact that I am obsessed with movies, but some of the headings are a bit vague.
    • The heading that reads, "Opening / % of Total," doesn't make it clear whether or not the table lists the revenue for opening day or opening weekend. To find this out, I could simply click on the link of one of the films on the website.
    • The heading that reads, "Theaters / Average," doesn't specify what the average is of. I could find this out by dividing the amount of theatres from the value of each revenue in the table.
    • One final thing that I'm curious about is if the older films are adjusted for inflation. I could find this out by doing a simple google search of the oldest film in the table, with the keywords, "adjusted for inflation," and comparing the values in the table.
  4. What are some questions you hope to answer with your data? List at least three.
    • Which #1 film had the highest total gross?
    • Which #1 film had the highest opening?
    • Which #1 film opened in the most theatres?