Blog

  • Confused on Hadoop? This link will help.

    Are you confused on what hadoop is? What about Hbase, Pig Hive? Well, this link will help you out. Hadoop Toolbox: When to Use What | SmartData Collective. It provides a nice short explaination for the following terms: Hadoop Hbase Hive Pig Sqoop Oozie Flume/Chukwa Avro Read more

  • Another Data Science Program in NYC (also online)

    Recently, both NYU and Columbia launched academic programs in data science. Well, another school in New York City is entering the mix. The City University of New York (CUNY) is now offering an online masters degree in data analytics. If you would like more information, there will be an online information session on May 22. Read more

  • Webinar on Data Science, May 14

    This looks to be a great webinar! It is today. Webinar on Data Science, May 14. Read more

  • Free Book on Big Data

    Jason and Jeremy Kolb of Applied Data Labs recently released a new book, Secrets of the Big Data Revolution. As of today, it is free on Amazon. I have just started it, and it is good so far. It is only free for a limited time. Read more

  • Harvard Data Science

    This Spring, Harvard University ran a data science course. Technically, the name of the course was Stat 221 Statistical Computing and Visualization. The course recently finished, and all the course lecture slides are available. The slides contain a bunch of useful information, plus they show one possible layout for a data science course. Read more

  • Enigma Launches for Open Public Data

    If you are looking for public data, Enigma.io is a new startup just for you. Enigma searches, finds, and connects a variety of formats of public data. The data is then linked and made accessible. Watch the video below for more details. Read more

  • Plot.ly a new online Graph Tool

    Plot.ly is a new site that allows for web-based plotting of graphs. The site allows a user to upload data, create a number of plots, and even write python code to generate custom graphs. Then the site has numerous export options for the graphs as well as options for sharing the graph via socia networks.… Read more

  • Columbia Data Science Certificate Program

    The Institute for Data Science and Engineering at Columbia University has released their first academic offering. It is a certificate program titled, Certification of Professional Achievement in Data Sciences. The certificate program consists of 4 courses: Algorithms for Data Science Probability & Statistics Machine Learning for Data Science Exploratory Data Analysis and Visualization Columbia is… Read more

  • Open Data Festival

    Launching in the autumn of 2013, Open Data Festival will be hosting a global data festival. The details are quite vague at this point, but they are looking for volunteers, cities, and speakers. Feel free to sign up. The festival is being organized by the same team that organizes Big Data Week. Read more

  • Gamification Data Science Video

    I thought this was a fun little video about gamification and data science, plus my 2 year-old was mesmerized by the video. It is worth 3 minutes to watch. Read more

  • What is Maching Learning

    Machine Learning is a term that can mean different things to different people. Andrew Ng, cofounder of Coursera and Professor at Stanford, provides two definitions in his popular Machine Learning Course. The first definition comes from Arthur Samuel around 1959. Field of study that gives computers the ability to learn without being explicitly programmed. The… Read more

  • Coursera Data Science Begins

    The highly anticipated Coursera class, Introduction to Data Science, started yesterday. It looks good so far. Why not join 72,000 other students interested in learning data science? Read more

data science 101 logo

Data Science 101

One of the oldest blogs on data science, started in 2012.

Threads Dev Interviews

Interviews with Developers on Threads