Monday, 12 February 2018

Data Update 1


  1. What dataset will you use for your final report? (describe your dataset and include a link to it)
    • The dataset that I will be using for my final report is the Domestic Opening Weekend Box Office From 1986-2018, which lists the #1 film of each consecutive year and how much money said film made at the box office. I'll most likely omit 2018, however, because that year literally just started.
  2. Describe the dataset. What kind of data does it contain?
    • The dataset contains the #1 film of each year, the amount of money a film made upon its opening and that percentage of the film's total gross. The dataset also contains the amount of theatres each film opened at and what I assume is the average amount of money theatres made. Finally, the dataset lists the total gross of each film.
  3. Is there anything about your data that you don't understand? (i.e. what a column heading means). How will you find out?
    • I understand most of the data due to the fact that I am obsessed with movies, but some of the headings are a bit vague.
    • The heading that reads, "Opening / % of Total," doesn't make it clear whether or not the table lists the revenue for opening day or opening weekend. To find this out, I could simply click on the link of one of the films on the website.
    • The heading that reads, "Theaters / Average," doesn't specify what the average is of. I could find this out by dividing the amount of theatres from the value of each revenue in the table.
    • One final thing that I'm curious about is if the older films are adjusted for inflation. I could find this out by doing a simple google search of the oldest film in the table, with the keywords, "adjusted for inflation," and comparing the values in the table.
  4. What are some questions you hope to answer with your data? List at least three.
    • Which #1 film had the highest total gross?
    • Which #1 film had the highest opening?
    • Which #1 film opened in the most theatres?

4 comments:

  1. Hi Taylor,

    This is an interesting dataset. Just by looking at the names of the films, one can get an idea of the type of movies the audience likes to watch. I wonder what country is considered domestic? Is this dataset for the Canadian or for the US audience? I too am curious about the point you raised on inflation. A comparison would be interesting to see.

    ReplyDelete
  2. Being an avid moviegoer myself, I am curious to see what answers you find in your data. In particular the answer to the highest grossing movie peaks my interest - which film maker made the best movie for the lowest cost? This data set would be fun to work with, good find!

    ReplyDelete
  3. I think one problem you will face is the lack of data to work with. However, it'll be interesting to see what you can come up with. I'm surprised that both Titanic and Avatar aren't on this list, as I know they are the two highest-grossing films of all-time.

    ReplyDelete
  4. This seems like an interesting topic and interesting data set to work with. Some things you may want to consider when doing research is the overall trends in Hollywood and some of the most popular movie genres.

    ReplyDelete