Site icon Ryan Swanstrom

Data Scientist Job Analysis

A few weeks ago, I posted 16 Companies Hiring Data Scientists Right Now. I decided to do a bit of analysis on the job posts, so I took all the job posting and compiled them into one file.

The Problem

I wanted to determine 2 things:

  1. What words occurred most often in the job posts?
  2. What words occurred in the most jobs posts?

The questions are similar, but if you read closely, they are different.   I wrote some Java code to answer those questions. The raw results are posted here.

Results

Honestly, nothing too surprising showed up. Not counting the common English words (and, to), the word data was the most popular. It occurred 167 times and it occurred at least once in all 16 job postings. That makes sense; a data scientist should know about data. I thought hadoop would occur in all job descriptions but it only appeared in 11 of the 16 job descriptions. Here are some other words I found interesting:

On an interesting note, Python and R occurred in more job postings than Java (2 more to be exact).

Does anything in the results strike you as interesting?

Exit mobile version