Why Start a Data Science Project?

A common popular technique for learning data science is starting a project. Here are the 3 E’s for why building a data science project is a good idea.

1. Experience

A data science project will expose you to all the stages of the data science process. You will need to start with the identification of an interesting question or problem. Then you will have to find and collect the necessary data. After that, cleaning and modeling the data is important. Finally, the result needs to be presented (this is called deployment).

Employers always want experience, and a project can provide that.

2. Education

Once a project is started, there is always something new to learn. For example, the project will have data. Where do you store that data? It might need a database. Should you use a cloud database or install one locally. You will have to learn how to do that. Another example, after you have collected your data, you might realize some rows are incomplete. Then you will have to look into methods for dealing with missing data. This will require more learning.

A project provides a better learning environment than a list of courses because each new thing you learn has a reason and a purpose. To follow the examples from above: You know why you are creating a database and you know why missing data is important.

3. Enjoyment

Make sure to pick a project that is interesting to you. This decision will make the project more enjoyable. If you enjoy sports, maybe build a project around fantasy sports. Sports are filled with data. If you enjoy reading, build something around books, authors, or magazines. If exercise is your thing, build an app to predict your progress. Plus, if you find an answer to your question, it is always fun to solve something.

This post goes along with the video here, . Got a question you want answered. Please ask in the comments.



7 responses to “Why Start a Data Science Project?”


    Please inform me which very popular and challenging data sets we can get better experiences, education and enjoyment (with R/Python/SQL/NoSQL and Machine Learning). Thanking you

      1. Joseph Woolf Avatar

        Hi Ryan,

        Thank you for the list of potential projects. After working on a few projects, what would be the best way to present them in a portfolio?


      2. Ryan Swanstrom Avatar

        Great job working on some projects. I suggest putting any code on github. Also, if you can demo anything in a web browser do that and share links with people. Also, it is always a good idea to write a blog post about your projects (even if they cannot be demoed).


  2. John Giorgio Avatar
    John Giorgio

    Good timing on your post – I was just pondering how to spend some time between terms in a constructive manner. May have to find someone local who has a challenge that could become a Data Science project

  3. shanjames Avatar

    Very useful information & guiding blog on an esoteric topic like Data Science. Thanks for such an interesting and wonderful blog. The list of Data Science Blogs you shared with us.

Leave a Reply