Starting from:

$30

Assignment 1 Dino Fun World

You, in your role as a burgeoning data explorer and visualizer, have been asked by the
administrators of a small amusement park in your hometown to answer a couple questions about
their park operations. In order to perform the requested analysis, they have provided you with a
database containing information about one day of the park's operations.
Provided Database
The database provided by the park administration is formatted to be readable by any SQL database
library. The course staff recommends the sqlite3 library. The database contains three tables, named
'checkins', 'attractions', and 'sequences'. The information contained in each of these tables is listed
below:
checkins :
­ Description: check­in data for all visitors for the day in the park. The data includes two types of
check­ins, inferred and actual checkins.
­ Fields: visitorID, timestamp, attraction, duration, type
attraction :
­ The attractions in the park by their corresponding AttractionID, Name, Region, Category, and type.
Regions are from the VAST Challenge map such as Coaster Alley, Tundra Land, etc. Categories
include Thrill rides, Kiddie Rides, etc. Type is broken into Outdoor Coaster, Other Ride, Carousel,
etc.
­ Fields: AttractionID, Name, Region, Category, type
sequences :
­ The check­in sequences of visitors. These sequences list the position of each visitor to the park
every five minutes. If the visitor has not entered the part yet, the sequence has a value of 0 for that
time interval. If the visitor is in the park, the sequence lists the attraction they have most recently
checked in to until they check in to a new one or leave the park.
­ Fields: visitorID, sequence
1
The database file is named 'dinofunworld.db' and is available in the read only directory of the Jupyter
Notebook environment (i.e. readonly/dinofunworld.db).
Assignment
The administrators would like you to answer four relatively simple questions about the park activities
on the day in question. These questions all deal with park operations and can be answered using the
data provided.
Question 1 : What is the most popular attraction to visit in the park?
Question 2 : What ride (note that not all attractions are rides) has the longest visit time?
Question 3 : Which Fast Food offering has the fewest visitors?
Question 4 : Compute the Skyline of number of visits and visit time for the park's ride and report the
rides that appear in the Skyline
Administrative Notes
This assignment will be graded by Coursera's grading system. In order for your answers to be
correctly registered in the system, you must place the code for your answers in the cell indicated for
each question. In addition, you should submit the assignment with the output of the code in the cell's
display area. The display area should contain only your answer to the question with no extraneous
information, or else the answer may not be picked up correctly. Each cell that is going to be graded
has a set of comment lines at the beginning of the cell. These lines are extremely important and
must not be modified or removed.(Graded Cell and PartID comments must be in the same line for
proper execution of code)
A correct submission would result in feedback as:”Correct!”
An incorrect submission would  look like: “ Incorrect Response! ”
2

More products