Web Application: selecting a dataset

You will work with your web application team for this assignment.

The web application

For the remainder of the term, you'll be working on various aspects of a database-driven web application. Think about a site like Worldometer or the World Population Clock or Baseball Reference or IMDb or this ridiculous little toy for movie lovers. All of these concern themselves with various ways of searching and reporting on aspects of complex (or not-so-complex) datasets.

Our application will use a pretty typical setup:

There are performance, usability, and maintainability/extensibility tradeoffs in this structure that we'll discuss as we go along.

First step: pick a dataset

For the purposes of this project, you're going to start by selecting a dataset suitable for the pedagogical goals of the project. Normally, you would enter into a project knowing what data is involved, since there wouldn't be a project at all unless you or somebody else had an idea for what you wanted to build. But class projects are a little weird. Let's roll with it.

As we discussed last week in our discussion groups, we want data that has the following properties:

Where can you find interesting data?

You might be tempted to restrict your focus to Kaggle. You can do that, but Kaggle datasets are extremely variable in quality and interest. So don't stop there.

Here are some other places to look:

Your tasks

Have fun!