Data Scientists, Data Engineers, Software Engineers: The Difference According to LinkedIn

The differences between Data Scientists, Data Engineers, and Software engineers can get a little confusing at times. Thus, here is a guest post provided by Jake Stein, CEO at Stitch formerly RJ Metrics, which aims to clear up some of that confusion based upon LinkedIn data.

As data grows, so does the expertise needed to manage it. The past few years have seen an increasing distinction between the key roles tasked with managing data: software engineers, data engineers, and data scientists.

More and more we’re seeing data engineers emerge as a subset within the software engineering discipline, but this is still a relatively new trend. Plenty of software engineers are still tasked with moving and managing data.

Our team has released two reports over the past year, one focused on understanding the data science role, one on data engineering. Both of these reports are based on self-reported LinkedIn data. In this post, I’ll lay out the distinctions between these roles and software engineers, but first, here’s a diagram to show you (in very broad strokes) what we saw in the skills breakdown between these three roles:

Data Roles and Skill Sets
A comparison of software engineers vs data engineers vs data scientists

Software Engineer

A software engineer builds applications and systems. Developers will be involved through all stages of this process from design, to writing code, to testing and review. They are creating the products that create the data. Software engineering is the oldest of these three roles, and has established methodologies and tool sets.

Work includes:

  • Frontend and backend development
  • Web apps
  • Mobile apps
  • Operating system development
  • Software design

Data Engineer

A data engineer builds systems that consolidate, store, and retrieve data from the various applications and systems created by software engineers. Data engineering emerged as a niche skill set within software engineering. 40% of all data engineers were previously working as a software engineer, making this the most common career path for data engineers by far.

Work includes:

  • Advanced data structures
  • Distributed computing
  • Concurrent programming
  • Knowledge of new & emerging tools: Hadoop, Spark, Kafka, Hive, etc.
  • Building ETL/data pipelines

Data Scientist

A data scientist builds analysis on top of data. This may come in the form of a one-off analysis for a team trying to better understand customer behavior, or a machine learning algorithm that is then implemented into the code base by software engineers and data engineers.

Work includes:

  • Data modeling
  • Machine learning
  • Algorithms
  • Business Intelligence dashboards

Evolving Data Teams

These roles are still evolving. The process of ETL is getting much easier overall as new tools (like Stitch) enter the market, making it easy for software developers to set up and maintain data pipelines. Larger companies are pulling data engineers off the software engineering team entirely in lieu of forming a centralized data team where infrastructure and analysis sit together. In some scenarios data scientists are responsible for both data consolidation and analysis.

At this point, there is no single dominant path. But we expect this rapid evolution to continue, after all, data certainly isn’t getting any smaller.


Originally Posted

in

, ,

by

Last Modified:

Comments

32 responses to “Data Scientists, Data Engineers, Software Engineers: The Difference According to LinkedIn”

  1. S K Rath Avatar

    Software design made by Software Engineer is based on the requirements identified by Data engineerr or Data scientist. Data engineering and Data science both emerged as niche skill set within software engineering. All data engineers or data scientists were previously working as a software engineer, the architecture of software in Software design phase, can not be assessed without pre analysis of data, done by Data engineers or data scientists.

    1. Ryan Swanstrom Avatar

      “All data engineers or data scientists were previously working as a software engineer”… I do not completely agree with that statement. I know of some data scientists who have psychology backgrounds or stats backgrounds. However, the disciplines you mention do need a good understanding of data.

  2. Oded Dror Avatar
    Oded Dror

    Hi there,
    You can add T-SQL and PL-SQL to the database engineer

    Thanks

    1. Ryan Swanstrom Avatar

      Thanks for the suggestions.

  3. hamid Avatar
    hamid

    Well, I am a software engineer, and I do all three. That seems like a very limited definition of software engineering.

  4. Daniel Wollmann Avatar
    Daniel Wollmann

    The skill set between Data Engineers and Data Scientist is very different. Although both roles need to hack data, the Scientists need that strong background on Math and Statistics to be able to best use Machine Learning resources, define the benchmark models to use and overall lead the Analytics discussion with BDMs

    1. Ryan Swanstrom Avatar

      Thanks Daniel, I would agree in theory, but in practice the lines are more blurred.

      Ryan

  5. Bernard Clyde Avatar

    I think it is astounding how much data we can collect with the technology we have nowadays. It’s important that that data is understood correctly and in the right context by data scientists. I agree that more training is required as we gather more data in our technology saturated era.

    1. Ryan Swanstrom Avatar

      I agree, thank you for sharing.

      Ryan

  6. […] there, today I’m sharing a post from Data Science 101 guest author Ryan Swanstrom entitled Data Scientists, Data Engineers, […]

  7. Jay Avatar

    Great article, would you consider posting on https://imadata.ninja/blog/

  8. […] to Ryan Swanstrom a Data Engineer “builds systems that consolidate, store, and retrieve data from the various […]

  9. Chika Avatar
    Chika

    Great insight Ryan. I love the clarification.

  10. Chika Avatar
    Chika

    Good insight Ryan!

  11. Bruce Campbell Avatar
    Bruce Campbell

    This is a great article. Where would you consider adding Data Visualization? It’s such a huge part of the data engineer and data scientist roles.

    1. Ryan Swanstrom Avatar

      Well, the data here comes from LinkedIn, so it displays where the actual skills appear. Data Visualization was either not entered by enough people or not tracked.

      If I were to add it somewhere, I think it would be a skill within data scientist but outside the other two.

      Thanks,
      Ryan

  12. Balbir Avatar

    Thank you so much for sharing valuable information

  13. otobong james Avatar
    otobong james

    this is sensible and nice

  14. Lee Schlesinger Avatar
    Lee Schlesinger

    The Stitch blog now has a blog post on data scientist demographics called What does a data scientist REALLY look like? https://www.stitchdata.com/blog/what-does-a-data-scientist-really-look-like/

    1. Ryan Swanstrom Avatar

      Thanks, I will check it out.

  15. […] to the point and avoid confusing individual areas of your expertise when you list them. A report by Ryan Swanstrom demonstrates the comparison of skills included on LinkedIn profiles for data scientists, software […]

  16. […] the point and avoid confusing individual areas of your expertise when you list them. A report by Ryan Swanstrom demonstrates the comparison of skills included on LinkedIn profiles for data scientists, software […]

  17. […] the point and avoid confusing individual areas of your expertise when you list them. A report by Ryan Swanstrom demonstrates the comparison of skills included on LinkedIn profiles for data scientists, software […]

  18. […] complicated particular person areas of your experience whenever you listing them. A report by Ryan Swanstrom demonstrates the comparability of abilities included on LinkedIn profiles for information […]

  19. […] and avoid confusing individual areas of your expertise when you list them. A report by Ryan Swanstrom demonstrates the comparison of skills included on LinkedIn profiles for data scientists, software […]

  20. […] and avoid confusing individual areas of your expertise when you list them. A report by Ryan Swanstrom demonstrates the comparison of skills included on LinkedIn profiles for data scientists, software […]

  21. […] and avoid confusing individual areas of your expertise when you list them. A report by Ryan Swanstrom demonstrates the comparison of skills included on LinkedIn profiles for data scientists, software […]

  22. […]                                         Source:  Ryan Swanstrom […]

Leave a Reply

Discover more from Ryan Swanstrom

Subscribe now to keep reading and get access to the full archive.

Continue reading