-
Spark Summit 2015 Livestream
Apache Spark is currently one of the hottest technologies in data science. That trend leads the Spark Summit 2015 to be one of the top conferences. Luckily, the conference organizers where kind enough to set up a free Spark Summit 2015 Livestream of the event. Here is a small glimpse of what will be covered:… Read more
-
Tools For Writing a Data Science Dissertation
It can be a long and difficult task. It takes dedication, a good topic, a helpful advisor, some meetings, and a bit of paperwork. However, it is not impossible, and here are some tools to make it easier (hopefully). This is not intended to be a guide for selecting a topic. I am not qualified… Read more
-
2015 Summer of Data Science Learning
The twitter hashtag #SoDS is being used in 2015 to help people track and share what they are learning. The hashtag originated on the Becoming Data Scientist blog. I recently wrote a post for Sense about a number of freely available learning opportunities this summer, Start Learning with the Summer of Data Science. The post… Read more
-
Data Science, Startups, and Sex Trafficking
What is Sex Trafficking? It is a form of human trafficking and according to Wikipedia, human trafficking is the trade in humans, most commonly for the purpose of sexual slavery, forced labor or commercial sexual exploitation for the trafficker or others; or for the extraction of organs or tissues, including surrogacy and ova removal; or… Read more
-
Learn Apache Spark this Summer with edX
edX has just announced a new series of Big Data courses. The series consists of 2 courses focused around Apache Spark. If you are not familiar with Spark, it is a very fast engine for large-scale data processing. It claims to perform up to 100 times faster than hadoop. Here are the 2 courses: Introduction… Read more
-
Scoring A Software Development Organization With A Single Number
I just finished my PhD in the Computational Science and Statistics program at South Dakota State University. My dissertation focused on the area of software analytics, sometimes called Data-Driven Software Engineering. Specifically, how does a Software Development Organization evaluate itself? Students have a G.P.A. (Grade Point Average), but organizations do not have a similar evaluation… Read more
-
Free Book, Mining Massive Datasets, 2nd Edition
A new edition of Mining Massive Datasets is now available. It is used for a number of data mining courses at colleges across the US (and globe). Here are just a few of the topics from the book. Map-reduce Clustering Recommendation Systems Dimensionality Reduction Social Network Analysis Read more
-
The New Open Data Handbook
Originally published in 2012, the Open Data Handbook has released an second edition. The handbook is to be used as a guide for organizations or individuals interested in publishing and/or utilizing open data. The goal is ensuring data is open and that data is applied as often as possible. The second edition now includes 3… Read more
-
Data Science Wars: R vs. Python
The great team over at DataCamp, an online site for learning R , has put together another wonderful infographic. This time, the topic is Data Science Wars (R versus Python). This has been a rather hot topic for quite some time. I even wrote about the debate back in 2013, R vs Python, The Great… Read more
-
Deep Learning in 2015 at Oxford
Nando de Freitas taught a deep learning course at the University of Oxford. All of the videos are freely available. The playlist is a bit out of order, but starting with Lecture 1 is probably the best technique. Read more
-
Data Science Tech Institute Visiting Faculty
The Data ScienceTech Institute (DSTI) in France is starting 2 new master’s degree programs in data science. Both programs are highly innovative and offer a strong industry focus. Classes begin in October 2015, and each program is limited to 30 students. Therefore, if you are interested, it is important to apply as soon as possible.… Read more
-
Free Deep Learning Book
Yoshua Bengio, Ian Goodfellow and Aaron Courville are writing a deep learning book for MIT Press. The book is not yet complete, but the drafts of the chapters are all available online. The authors are also collecting comments about the chapters before the book goes to press. The book is broken into 3 sections: Math… Read more

