Tools

THE TOP FOUR TOOLS FROM EUROPEAN RESPONDENTS WERE EXCEL, SQL, R, AND PYTHON, each used by over half of all respondents. These four tools have kept their top positions in every Data Salary Survey we have conducted, and there does not appear to be any sign of this changing. Almost every respondent reported using at least one, and about half the sample used three or all four.

Commonly used tools with above-average salaries include Scikit-learn (whose users have a median salary of €52K), Spark (€55K), Hive (€57K), and Scala (€70K). Readers may notice that most tools have a higher median salary than the sample-wide median salary of €48K. This is because respondents who use lots of tools tend to earn more (and they are counted in a large number of tool salary medians). The 43% of respondents who used no more than 10 tools had a median salary of €43K, while those who used more than 10 tools had a median salary of €53K.

Since there is significant overlap between users of individual tools, it is useful to consider mutually exclusive groups of respondents based on tool usage. The groups we will define here are based on a simple set of rules, but using a clustering algorithm would produce very similar results. The rules are:

  1. If someone used Spark or Hadoop, we call them “Hadoop”

  2. If someone (not in the Hadoop group) uses R and/or Python, they are labeled “R+Python,” “R-only,” or “Python-only,,” as appropriate

  3. Everyone who uses SQL and/or Excel (usually both), we call “SQL/Excel”

The five resulting groups each contain between 13% and 26% of the sample. The Hadoop group reported the highest salaries (median: €56K), while the R-only group had the lowest (€42K). However, this doesn’t mean that knowing R means less pay: respondents using Python and R earned slightly more than those using Python and not R. Aside from salary, one important difference between the groups is experience. The SQL/Excel group—in other words, those who don’t use Python, R, Spark, or Hadoop—was more experienced than the other groups (8.3 years on average), followed by the R-only (7.3 years), Hadoop (6.3 years), Python-only (6 years), and Python+R groups (5.2 years). Since we expect more-experienced data professionals to earn higher salaries, the median salary of €46K for the SQL/Excel group is actually quite low, while the €48K of the Python-R group is high.

edsss17 6
edsss17 7
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset
18.116.90.141