There are different views about what Data Science is. The following links give you a flavour of the views:
- Interview with Hadley Wickham, Chief Scientist of RStudio (2013).
- Data science: how is it different to statistics? by Hadley Wickham (2014)
- RSS President’s address 2015 - “Statistics: a Data Science for the 21st Century” Video
- RSS “Data Science and Statistics: different worlds?“” Video
I could argue that “Data Science” is “Applied Statistics” but instead I prefer to define it in three parts:
- Statistics and Data Programming
- Data Context
Nothing controversial or unexpected so far. Using this definition, an “Applied Statistician” and “Data Scientist” combine “Statistics” and “Statistical and Data Programming” and might be knowledgable about the “Data Context” area. However, the latter is usally the role another party or parties that the “Applied Statistician” / “Data Scientist” collaborates with. The collaboration is an essential part of the definition.
Another way to look at it is, when there are different people working in each of the three areas defined above then they collectively work in “Data Science”.
What’s in a name? that which we call a rose by any other word would smell as sweet; Romeo and Juliet William Shakespeare
I really don’t care if you call me a Data Scientist, Applied Statistician, Statistician, Statistician and Statistical Programmer… as I will continue to enjoy analysing and processing data however messy it is 😃