IZA DP No. 15506: Using Online Vacancy and Job Applicants' Data to Study Skills Dynamics
We assess whether online data on vacancies and applications to a job board are a suitable source for studying skills dynamics outside of Europe and the United States, where a rich literature has examined skills dynamics using online vacancy data. Yet, the knowledge on skills dynamics is scarce for other countries, irrespective of their level of development. We first propose a taxonomy that systematically aggregates three broad categories of skills – cognitive, socioemotional and manual – and fourteen commonly observed and recognizable skills sub-categories, which we define based on unique skills identified through keywords and expressions. Our aim is to develop a taxonomy that is comprehensive but succinct, suitable for the labour market realities of developing and emerging economies and adapted to online vacancies and applicants' data. Using machine-learning techniques, we then develop a methodology that allows implementing the skills taxonomy in online vacancy and applicants' data, thus capturing both the supply and the demand side. Implementing the methodology with Uruguayan data from the job board BuscoJobs, we assign skills to 64 per cent of applicants' employment spells and 94 per cent of vacancies. We consider this a successful implementation since the exploited text information often does not follow a standardized format. The advantage of our approach is its reliance on data that is currently available in many countries across the world, thereby allowing for country-specific analysis that does not need to assume that occupational skills bundles are the same across countries. To the best of our knowledge, we are the first to explore this approach in the context of emerging economies.