Employers’ Requirements for Data Scientists - an Analysis of Job Posts
Keywords:labor market, data scientist, text mining, job posts
AbstractTechnological development and innovation are the main drivers of jobs transformations leading to skill mismatch. One very dynamic domain, dealing with these issues is data science. Generally, a data scientist has to work with big data in a scientific and creative manner. To reduce the drawbacks of a sparse matching between educational offer and the new requirements of the labour market is essential to understand real time job market requirements. The most relevant data source for such an investigation is represented by online job market portals. Nowadays, with the increasing digitalisation of society, these portals are considered to improve transparency and signalling in labour markets. Moreover, the potential of the textual vacancy data from Romanian online recruiting platforms has not been exploited up to now. Following these arguments, in order to understand employers’ requirements for data science jobs in Romania, we develop an analysis of textual data extracted from job advertisements dedicated to data scientists. Mainly the data analysis will involve the investigation of term frequencies and associations combined with relevant visualization tools. The research will reveal the employers’ needs and will support training providers like universities to adapt curricula and training programmes so that they provide what the labour market requires. Moreover, the findings of this research could support young people in making better training choices, signal important trends related to occupations and skills.
Almaleh, A., Aslam, M. A., Saeedi, K., & Aljohani, N. R. (2019). Align My Curriculum: A Framework to Bridge the Gap between Acquired University Curriculum and Required Market Skills. Sustainability, 11(9), 2607.
Cao, L. (2017a). Data science: a comprehensive overview. ACM Computing Surveys (CSUR), 50(3), 43.
Cao, L. (2017b). Data science: challenges and directions. Communications of the ACM, 60(8), 59-68.
Cao, L. (2016). Data science: Nature and pitfalls. IEEE Intelligent Systems, 31(5), 66- 75.
CEDEFOP (2019a). Online job vacancies and skills analysis, A Cedefop panEuropean approach, ISBN: 978-92-896-2850-1 doi:10.2801/097022
CEDEFOP (2019b). The online job vacancy market in the EU: driving forces and emerging trends. Luxembourg: Publications Office. Cedefop research paper; No 72. http://data.europa.eu/doi/10.2801/16675
CEDEFOP (2018a). Real-time labour market information on skill requirements: setting up the EU system for online vacancy analysis.
CEDEFOP. (2018b). Mapping the landscape of online job vacancies. Background country report: Romania,http://www.cedefop.europa.eu/en/events-andprojects/projects/big-data-analysis-onlinevacancies/publications
Davenport, T. H., & Patil, D. J. (2012). Data Scientist: The Sexiest Job of the 21st Century-A new breed of professional holds the key to capitalizing on big data opportunities. But these specialists aren't easy to find—And the competition for them is fierce. Harvard Business Review, 70.
De Mauro, A., Greco, M., Grimaldi, M., & Nobili, G. (2016). Beyond data scientists: a review of big data skills and job families. Proceedings of IFKAD, 1844-1857.
Debortoli, S., Müller, O., & vom Brocke, J. (2014). Comparing business intelligence and big data skills. Business & Information Systems Engineering, 6(5), 289-300.
Feinerer, I., & Hornik, K. (2013). tm: Text Mining Package. R package version 0.5- 10. 2014-04-10]. http://CRAN. R-project, org/package= tm.
Feinerer, I. (2008). An introduction to text mining in R. The Newsletter of the R Project Volume 8/2, October 2008, 8, 19.
Föll, P., & Thiesse, F. (2017). Aligning IS Curriculum with Industry Skill Expectations: A Text Mining Approach.
Gao, L., & Eldin, N. (2014). Employers’ expectations: A probabilistic text mining model. Procedia Engineering, 85, 175-182.
Hershbein, B., & Kahn, L. B. (2018). Do recessions accelerate routine-biased technological change? Evidence from vacancy postings. American Economic Review, 108(7), 1737-72.
Ketamo, H., Moisio, A., Passi-Rauste, A., & Alamäki, A. (2019). Mapping the Future Curriculum: Adopting Artificial Intelligence and Analytics in Forecasting Competence Needs. In Proceedings of the 10th European Conference on Intangibles and Intellectual Capital ECIIC 2019. Academic Conference Publishing International.
Karakatsanis, I., AlKhader, W., MacCrory, F., Alibasic, A., Omar, M. A., Aung, Z., & Woon, W. L. (2017). Data mining approach to monitoring the requirements of the job market: A case study. Information Systems, 65, 1-6.
Lavrinenko, A., & Shmatko, N. (2019). Twenty-First Century Skills in Finance: Prospects for a Profound Job Transformation. Форсайт, 13(2 (eng)).
Maer-Matei, M. M., Mocanu, C., Zamfir, A. M., & Georgescu, T. M. (2019). Skill Needs for Early Career Researchers—A Text Mining Approach. Sustainability, 11(10), 2789.
Meyer, D., Hornik, K., & Feinerer, I. (2008). Text mining infrastructure in R. Journal of statistical software, 25(5), 1-54.
Modestino, A. S., Shoag, D., & Ballance, J. (2016). Downskilling: changes in employer skill requirements over the business cycle. Labour Economics, 41, 333-347.
Pater, R., Szkola, J., & Kozak, M. (2019). A method for measuring detailed demand for workers' competences. Economics: The Open-Access, Open-Assessment EJournal, 13(2019-27), 1-30.
Srivastava, A. N., & Sahami, M. (2009). Text mining: Classification, clustering, and applications. Chapman and Hall/CRC.
Voulgaris, Z. (2014). Data scientist: the definitive guide to becoming a data scientist. Technics Publications.
Wowczko, I. (2015, December). Skills and vacancy analysis with data mining techniques. In Informatics (Vol. 2, No. 4, pp. 31-49). Multidisciplinary Digital Publishing Institute.
Woolridge, R. W., & Parks, R. (2016). What's In and What's Out: Defining an Industry-Aligned IS Curriculum Using Job Advertisements. Journal of Higher Education Theory & Practice, 16(2).
Weihs, C., & Ickstadt, K. (2018). Data science: the impact of statistics. International Journal of Data Science and Analytics, 6(3), 189-194.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant this journal right of first publication, with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work, with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g. post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g. in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as an earlier and greater citation of published work (See The Effect of Open Access).
LUMEN EAS Journal has an Attribution-NonCommercial-NoDerivs