Blog

  • Top 5 Data Science Gals

    Hilary Mason Hilary is the Chief Scientist at Bitly. She is a frequent speaker at conferences. She is commonly cited, interviewed and referenced in data science news/blogs/articles. Cathy O’Neil Cathy is better known to the internet world as mathbabe. She is a blogger (although not strictly about just data science), conference speaker, and soon to… Read more

  • Top 5 MOOCs for Data Science

    Course Organization Notes Machine Learning Coursera (Standford) One of the first MOOCs Intro to Data Science Coursera (U of Washington) Starts in April 2013 Intro to Statistics Making Decisions Based on Data Udacity Enroll anytime Introduction to Infographics and Data Visualization Knight Center @ U of Texas Starts January 12, 2013 Learning From Data CalTech… Read more

  • Good List! Read more

  • Top 5 Data Science Blogs

    p-value.info – This blog is only about 1 month old, but it is filled with great stuff.  I just hope Carl , a data scientist at One Kings Lane, can keep up the good posts. Metamarkets Blog – Metamarkets is a startup focusing on data analytics for business users.  The blog contains lots of data science information.  During the… Read more

  • Top 5 Places to Get a Data Scientist job

    LinkedIn They turn data into products better than anyone else. Facebook If you are the type of person that loves to analyze people’s lives, there is no better place. Twitter Duh, It’s Twitter. lots of data and lots of possibilities Cloudera Cloudera is a successful Hadoop-based startup. Build tools and explore huge datasets for a… Read more

  • Top 5 Data Startups

    Kaggle They make data science a sport, enough said. DataKind DataKind may not technically be a startup because it is a nonprofit, but they are doing cool stuff.  They match nonprofit organizations with people that love to analyze data and create visualizations. Cloudera They call themselves “The Platform for Big Data”.  They are working hard… Read more

  • In 2013, Learn Data Science via Coursera (a curriculum)

    Coursera has some excellent courses coming up in 2013. Here are some potential curriculum paths for someone looking to learn data science. Prerequisites Either sequence requires/recommends some basic programming experience. If you are unfamiliar with programming, you still have a couple weeks to get familiar with some basic programming concepts. Some good places to start… Read more

  • 2 Recently Released Open Source Graph-Related Projects

    GraphBuilder Intel Labs built a tool for constructing mathematical graphs out of large datasets. It is Java based and works with Hadoop and MapReduce. Intel has release a whitepaper explaining more about GraphBuilder. The code is available on Github. A big thanks to Mark Nickel for pointing out this project. ArangoDB ArangoDB is a flexible… Read more

  • What Makes a Good Data Scientist?

    This is a very quick and informative video about data science. What is data science? What makes a good data scientist? DJ Patil does an excellent job answering both those questions. Here are his answers for what makes a good data scientist: Story Telling Curiousity Read more

  • I think the information is interesting, but I also think the charts do a good job of telling the story. Read more

  • Explain Data Science to Anyone

    When telling friends and family that I blog about data science, I am frequently asked to explain more. I usually respond with an answer similar to this: You know the world is generating huge amounts of data everyday due to financial transactions, medical records, social networks, and other internet uses. Data Science aims to make… Read more

  • This is a nice graphic showing where data science is being taught. It appears that data science is being taught all over the country. Read more

data science 101 logo

Data Science 101

One of the oldest blogs on data science, started in 2012.

Threads Dev Interviews

Interviews with Developers on Threads