-
Big Data Startup Investments [Infographic]
This infographic is packed with good data. I especially enjoyed the section about big data startups that were acquired in 2013. Courtesy of: BigData-Startups Read more
-
Python And Graph Databases Slidedeck
Odessapy2013 – Graph databases and Python from Maksym Klymyshyn Read more
-
Online Data Science Book Club
An online data science book club is being formed. The group will be discussing a collection of books from O’Reilly. Read more
-
R vs Python, The Great Debate
Recently I have seen blogs/articles claiming Python is the best choice for data science and R is the new language for business. Honestly, both articles are truthful and good. Both Python and R are good. Why do we have to choose? Let’s use both. Here is my opinion. I prefer R to Python when performing… Read more
-
Neural Network and Deep Learning Book
Chapter 1 of Michael Nielsen’s Deep Learning Book is available online. The chapter provides an introduction to neural networks. When completed, the book will be completely free and open-source. You are welcome to contribute to the fundraising efforts for the book. Read more
-
Data Size Matters Infographic
Here is a great infographic from Data Science @ Berkeley. Just how big is a Gigabyte(GB)? Be sure to look all the way to the bottom. It mentions/explains a few of the latest innovations in hard drives, for example: helium, SMR, HAMR. You will have to scroll to the bottom to see what those acronyms… Read more
-
Security And Data Science
The topic of internet security has been around for many years, but recently the topics of data science and security have joined forces. Many security applications collect vast amounts of data. Also, many security application operate based upon activity. Data Science can help collect all the past activity and machine learning can be used to… Read more
-
Open Data Action Plan for France
This information goes along with the post last week, Open Data Could Be Worth $5.4 Trillion Annually. Just last week France released an action plan for open data. Honestly, I have not read the full report, but it is great to see a government create such a plan. See the full report below. Read more
-
Open Data Could Be Worth $5.4 Trillion Annually
Michael Chui of McKinsey Global Institute provided some clear insights about the benefits of opendata. Here are the 4 characteristics of open data provided by Chui: Access by Everyone Formatted for Easy Reading by a Computer Free(no cost) Unlimited Rights to redistribute and reuse Also, Chui describes how an organization can get the most from… Read more
-
Dryad: Scientific Data Repository
Earlier, I posted about Scientific Data, but unfortunately the site does not host any of the data. Enter Dryad, data hosting is exactly what the site does. The site hosts opendata and any other digital artifacts associated with a research project. Plus the site provides a DOI (Digital Object Identifier) for citing the the artifacts… Read more
-
Scientific Data: A new publisher of Data
Nature.com is starting a new publication titled, Scientific Data. The goal is to help researchers publish and discover data. The publication content is called a Data Descriptor. It describes the data, explains the data collection methods, lists the columns, and states other essential information about the dataset. Unfortunately, the site does not host any of… Read more
-
Strata + Hadoop World 2013 Live Stream
The 2013 Strata + Hadoop World Conference is currently going on in New York City. Many of the keynotes will have video streaming live on Tuesday and Wednesday (Oct 29 and 30). Watch and enjoy. If for some reason you cannot watch via the livestream, most of the keynotes are usually placed on Youtube shortly… Read more

