Blog

  • U of Washington Data Science Information Session Today

    A virtual information session about the Data Science Certificate Program at the University of Washington is today. If you are interested in the program, it is probably worth your time to attend. Read more

  • Two Relevant Coursera Courses Starting Today

    Web Intelligence and Big Data – Artificial Intelligence, NoSQL, map-reduce!! Oh My! I think this course will have some really good information. Gamification – From the first glance, this course may not directly relate to data science. However, gamification can lead to better user engagement which can lead to tons more data for prediction. I… Read more

  • Free Social Network Analysis Textbook

    David Easley and Jon Kleinberg, both of Cornell University, have placed the contents of their social networking textbook online. All 24 chapters of Networks, Crowds, and Markets: Reasoning About A Highly Connected World are available for download. This could serve as a wonderful learning resource or an excellent reference tool. The material covered is quite… Read more

  • Nice Intro to Data Science Slides

    Paco Nathan put together a nice slide deck about Data Science for Enterprise Big Data. Slide 9 contains a great list of valuable skills for a data scientist. Also, it is worth going through the entire set of slides, since slides 48 and 49 contain a valuable list of tools and algorithms. Enjoy! Read more

  • Data Collective: Big Data Investment Fund

    Data Collective is a new investment fund. It is strictly for early-stage Big Data startups. See the TechCrunch announcement for more detailed information. Read more

  • UC Berkeley Big Data Bootcamp

    The University of California at Berkeley is hosting AMP Camp, the Big Data Bootcamp, starting today. The conference is sold out for in-person attendace, but registration is free and live streaming is available. The agenda looks good (including machine learning, parallel programming, Mesos, and hands-on exercises), so this might be a good opportunity for some… Read more

  • Scalable Machine Learning Course

    During the Spring 2012, Alex Smola taught a course at Berkeley on Scalable Machine Learning. Alex is an Adjunct Professor at the University of California at Berkeley and a Visiting Scientist at Google. Alex was kind enough to put all the course materials on the internet. That includes papers, slides, links, and video lectures. Like… Read more

  • Strata Data in Motion Conference

    A while back, Strata hosted a web conference titled Data in Motion. The slides and audio are now available online. The conference is focused on unique applications of data used for movement. Examples are: trains, aerospace, and even car racing. The first talk on formula one car racing was fascinating. I had never thought about… Read more

  • “Why Online Education Won’t Replace College — Yet” I Disagree And Here Is Why

    Recently, I read an article titled, Why Online Education Won’t Replace College–Yet. The article is most likely a response the recent success of Massive Open Online Courses (MOOCs) such as those offered by Udacity and Coursera. The author, David Youngberg an Assistant Professor of Economics, presents 5 reasons why online education won’t replace college. I… Read more

  • Map of Kaggle Submissions

    See this interactive map of Kaggle Submissions. The map is a nice example of data visualization. The data is much easier to see on a map than in a data table. Nice work by Ramzi Ramey of Kaggle. Read more

  • Big Data for Organization (Infographic)

    Here is a nice infographic about some challenges of big data. It covers the problems that organizations face when dealing with the “three Vs” of big data. Volume Variety Velocity Learn about data visualization software. Big data Read more

  • Data Science in the NFL

    Data Science in the NFLThis is an excellent blog post about a unique use of data science.  Data science can help football team select better teammates Read more

data science 101 logo

Data Science 101

One of the oldest blogs on data science, started in 2012.

Threads Dev Interviews

Interviews with Developers on Threads