-
Visualization Of Data Science Twitter Users
This is a fun and interactive visualization of 659 twitter accounts linked to data science. http://www.greenplum.com/datasciencesummit/community/ Read more
-
Data Science Summit Videos
The videos for the 2012 Data Science Summit organized by Greenplum are now available. See the videos Read more
-
Bitmarks: Bitly’s Data Science One URL At A Time
Earlier this week, Bitly launched a new bookmarking service. They call links/URLs bitmarks instead of bookmarks. It has a nice Chrome Extension and Bitmarklet. So far, I very much like the service. So, Why Should You Care? Well, at its core, Bitly is a data science company. This is just another way for Bitly to… Read more
-
Increase Your Kaggle Score With a Random Forest
Previously, I blogged about submitting your first solution to Kaggle for the Biological Response Competition. Well, that technique used Logistic Regression and the resulting score was not very good. Now, let’s try to improve upon that score. In this example, we will use what is called a Random Forest. Kaggle claims that random forests have… Read more
-
This helps to solidify MongoDB as the leading NoSQL database. Having used MongoDB myself, I can see why they were able to raise so much funding. MongoDB is fast, simple, and flexible. Read more
-
An Infographic for Taming Big Data
The gorilla may be a bit too much, but the rest of the infographic contains valuable information. What do companies do with bigdata? Also, spending on bigdata is set to increase rapidly in the next few years. Via: Wikibon Big Data Read more
-
Religion and the Number of Babies: A Hans Rosling TedTalk
Hans Rosling does it again. Hans may be the best storyteller of data on earth. He has a real gift of turning data into an exciting story. He also asks great questions and has some good wit. The whole goal of this TedTalk is to answer the following question. Do women of certain religions have… Read more
-
Your First Kaggle Submission
Yesterday, I wrote a post explaining the Kaggle Biological Response competition. If you don’t know, Kaggle is a website for data science competitions. Now it is time to submit a solution. After this post, you should have a spot on the Leaderboard. Granted, it will not be first place but it won’t be last place… Read more
-
Get Started With Kaggle – Description
Yesterday, I posted about the popularity of data hackathons. Well, today let’s get started with Kaggle. This is the first of a few simple posts about making your first submission to a Kaggle competition. I also promise you won’t be last place. You won’t be first either. This is an excellent way to start developing… Read more
-
Hackathons with Data are Everywhere
It seems that competitions and meetups for hacking data are all over the place. Coding challenges have been around for a long time. Recently, it appears that data is being thrown into the mix. I think the idea is great. Instead of just hacking some app, why not hack with some data that might help… Read more
-
Data Science Training Program in New York
If you are in New York City or the surrounding area and you want to learn data science, this post is for you. General Assembly; a technology, design, and entrepreneurship campus in New York City; is running a 12-week Intensive Program in Data Science. The course consists of lectures (twice a week), labs, homework, and… Read more
-
Challenge To Future Developers: Start Storing More Data
Dear Future Developers Please store as much data as possible. Do not worry about the cost of the extra storage disks. The value in the data will far outweigh the cost of the hardware. Here are some examples of data that could be stored but is typically not. Start storing data about the order in… Read more

