Employers’ Requirements for Data Scientists - an Analysis of Job Posts


  • Monica Mihaela Maer Matei National Scientific Research Institute for Labour and Social Protection
  • Anamaria Beatrice Aldea Researcher, National Scientific Research Institute for Labour and Social Protection, Bucharest




labor market, data scientist, text mining, job posts


Technological development and innovation are the main drivers of jobs transformations leading to skill mismatch. One very dynamic domain, dealing with these issues is data science. Generally, a data scientist has to work with big data in a scientific and creative manner. To reduce the drawbacks of a sparse matching between educational offer and the new requirements of the labour market is essential to understand real time job market requirements. The most relevant data source for such an investigation is represented by online job market portals. Nowadays, with the increasing digitalisation of society, these portals are considered to improve transparency and signalling in labour markets. Moreover, the potential of the textual vacancy data from Romanian online recruiting platforms has not been exploited up to now. Following these arguments, in order to understand employers’ requirements for data science jobs in Romania, we develop an analysis of textual data extracted from job advertisements dedicated to data scientists. Mainly the data analysis will involve the investigation of term frequencies and associations combined with relevant visualization tools. The research will reveal the employers’ needs and will support training providers like universities to adapt curricula and training programmes so that they provide what the labour market requires. Moreover, the findings of this research could support young people in making better training choices, signal important trends related to occupations and skills.

Author Biography

Anamaria Beatrice Aldea, Researcher, National Scientific Research Institute for Labour and Social Protection, Bucharest

Researcher, National Scientific Research Institute for Labour and Social Protection, Bucharest


Almaleh, A., Aslam, M. A., Saeedi, K., & Aljohani, N. R. (2019). Align My Curriculum: A Framework to Bridge the Gap between Acquired University Curriculum and Required Market Skills. Sustainability, 11(9), 2607.

Cao, L. (2017a). Data science: a comprehensive overview. ACM Computing Surveys (CSUR), 50(3), 43.

Cao, L. (2017b). Data science: challenges and directions. Communications of the ACM, 60(8), 59-68.

Cao, L. (2016). Data science: Nature and pitfalls. IEEE Intelligent Systems, 31(5), 66- 75.

CEDEFOP (2019a). Online job vacancies and skills analysis, A Cedefop panEuropean approach, ISBN: 978-92-896-2850-1 doi:10.2801/097022

CEDEFOP (2019b). The online job vacancy market in the EU: driving forces and emerging trends. Luxembourg: Publications Office. Cedefop research paper; No 72. http://data.europa.eu/doi/10.2801/16675

CEDEFOP (2018a). Real-time labour market information on skill requirements: setting up the EU system for online vacancy analysis.

CEDEFOP. (2018b). Mapping the landscape of online job vacancies. Background country report: Romania,http://www.cedefop.europa.eu/en/events-andprojects/projects/big-data-analysis-onlinevacancies/publications

Davenport, T. H., & Patil, D. J. (2012). Data Scientist: The Sexiest Job of the 21st Century-A new breed of professional holds the key to capitalizing on big data opportunities. But these specialists aren't easy to find—And the competition for them is fierce. Harvard Business Review, 70.

De Mauro, A., Greco, M., Grimaldi, M., & Nobili, G. (2016). Beyond data scientists: a review of big data skills and job families. Proceedings of IFKAD, 1844-1857.

Debortoli, S., Müller, O., & vom Brocke, J. (2014). Comparing business intelligence and big data skills. Business & Information Systems Engineering, 6(5), 289-300.

Feinerer, I., & Hornik, K. (2013). tm: Text Mining Package. R package version 0.5- 10. 2014-04-10]. http://CRAN. R-project, org/package= tm.

Feinerer, I. (2008). An introduction to text mining in R. The Newsletter of the R Project Volume 8/2, October 2008, 8, 19.

Föll, P., & Thiesse, F. (2017). Aligning IS Curriculum with Industry Skill Expectations: A Text Mining Approach.

Gao, L., & Eldin, N. (2014). Employers’ expectations: A probabilistic text mining model. Procedia Engineering, 85, 175-182.

Hershbein, B., & Kahn, L. B. (2018). Do recessions accelerate routine-biased technological change? Evidence from vacancy postings. American Economic Review, 108(7), 1737-72.

Ketamo, H., Moisio, A., Passi-Rauste, A., & Alamäki, A. (2019). Mapping the Future Curriculum: Adopting Artificial Intelligence and Analytics in Forecasting Competence Needs. In Proceedings of the 10th European Conference on Intangibles and Intellectual Capital ECIIC 2019. Academic Conference Publishing International.

Karakatsanis, I., AlKhader, W., MacCrory, F., Alibasic, A., Omar, M. A., Aung, Z., & Woon, W. L. (2017). Data mining approach to monitoring the requirements of the job market: A case study. Information Systems, 65, 1-6.

Lavrinenko, A., & Shmatko, N. (2019). Twenty-First Century Skills in Finance: Prospects for a Profound Job Transformation. Форсайт, 13(2 (eng)).

Maer-Matei, M. M., Mocanu, C., Zamfir, A. M., & Georgescu, T. M. (2019). Skill Needs for Early Career Researchers—A Text Mining Approach. Sustainability, 11(10), 2789.

Meyer, D., Hornik, K., & Feinerer, I. (2008). Text mining infrastructure in R. Journal of statistical software, 25(5), 1-54.

Modestino, A. S., Shoag, D., & Ballance, J. (2016). Downskilling: changes in employer skill requirements over the business cycle. Labour Economics, 41, 333-347.

Pater, R., Szkola, J., & Kozak, M. (2019). A method for measuring detailed demand for workers' competences. Economics: The Open-Access, Open-Assessment EJournal, 13(2019-27), 1-30.

Srivastava, A. N., & Sahami, M. (2009). Text mining: Classification, clustering, and applications. Chapman and Hall/CRC.

Voulgaris, Z. (2014). Data scientist: the definitive guide to becoming a data scientist. Technics Publications.

Wowczko, I. (2015, December). Skills and vacancy analysis with data mining techniques. In Informatics (Vol. 2, No. 4, pp. 31-49). Multidisciplinary Digital Publishing Institute.

Woolridge, R. W., & Parks, R. (2016). What's In and What's Out: Defining an Industry-Aligned IS Curriculum Using Job Advertisements. Journal of Higher Education Theory & Practice, 16(2).

Weihs, C., & Ickstadt, K. (2018). Data science: the impact of statistics. International Journal of Data Science and Analytics, 6(3), 189-194.