-
Data Science at Engine Yard | Engine Yard Blog
This is very good read about data science at Engine Yard. It covers the following topics: What is a data scientist? What does a data scientist do? What are the technologies? Realities of Being a data scientist Data Science at Engine Yard | Engine Yard Blog Unfortunately, as of 2019, the blog post has been… Read more
-
Free Bayesian and Machine Learning Textbook
David Barber, Computer Science Professor at University College London, is still offering his textbook, Bayesian Reasoning and Machine Learning, for free. This text looks quite extensive. The website also includes matlab code for many of the algorithms in the book. Read more
-
100 Machine Learning Videos
Here is a list of 100 machine learning videos from VideoLectures.net. Read more
-
Startup Showcase – How did I do?
Yesterday, I made some predictions about the startups I thought would win at the Strata Startup Showcase. Here are the winners. Fred Wilson selected Placed Tim O’Reilly selected Privacy Analytics The audience selected Hadapt So how did I do? Well, I got one of the winners correct. I selected Placed. Hopefully videos of the demos… Read more
-
Data Startup Showcase
As part of New York City Big Data Week, a startup showcase is being offered. It will consist of 14 startups. Each startup will get to give a quick demo/presentation. Then Tim O’Reilly and Fred Wilson will select 3 winners. Also, numerous investors and journalists will be present. A complete list of the startups presenting… Read more
-
Hadoop World/Strata Conference
The 2012 edition of Hadoop World and Strata Conference is underway. The conference is in New York City and if you are not lucky enough to attend, then at least you can watch the live video feed. Read more
-
3 Secrets for Aspiring Data Scientists | Software Advice
Michael Koploy wrote 3 Secrets for Aspiring Data Scientists about what it takes to enter a career as a data scientist. He lays out 3 steps: Sharpen Your Scientific Saw – Hone your math and science skills Learn the Language of Business – Data Scientists need to explain the data in business terms Keep Adding… Read more
-
Free Bayesian Statistics Textbook
Think Bayes by Allen B. Downey is another free book available from Green Tree Press. Allen B. Downey is a computer science professor at Olin College. The book is currently available in PDF or HTML. The book is not yet complete, so it may contain some errors. Read more
-
What is a Data Scientist?
A nice short and sweet video about what a data scientist is. Josh Wills of Cloudera defines a data scientist as follows: Person who is better at statistics than any software engineer and better at software engineering than any statistician. I would say that definition is pretty good. Read more
-
Real-Time Machine Learning for Industry
Michael Cutler, cofounder of TUMRA, gave a nice talk to the University of Oxford Computer Science Department. The following quote from his talk sums up his idea. Given a choice between a “best guess” now, and a “marginally better” answer later, I’d take the best guess every time. Many times, academic people focus a lot… Read more
-
How Good Is Your Medicine?
This is a great talk about how much clinical trial data is never published. It is a bit scary but definitely something people should be knowledgeable about. Read more

