arXiv Open Access 2025

JobHop: A Large-Scale Dataset of Career Trajectories

Iman Johary Raphael Romero Alexandru C. Mara Tijl De Bie
Lihat Sumber

Abstrak

Understanding labor market dynamics is essential for policymakers, employers, and job seekers. However, comprehensive datasets that capture real-world career trajectories are scarce. In this paper, we introduce JobHop, a large-scale public dataset derived from anonymized resumes provided by VDAB, the public employment service in Flanders, Belgium. Utilizing Large Language Models (LLMs), we process unstructured resume data to extract structured career information, which is then normalized to standardized ESCO occupation codes using a multi-label classification model. This results in a rich dataset of over 1.67 million work experiences, extracted from and grouped into more than 361,000 user resumes and mapped to standardized ESCO occupation codes, offering valuable insights into real-world occupational transitions. This dataset enables diverse applications, such as analyzing labor market mobility, job stability, and the effects of career breaks on occupational transitions. It also supports career path prediction and other data-driven decision-making processes. To illustrate its potential, we explore key dataset characteristics, including job distributions, career breaks, and job transitions, demonstrating its value for advancing labor market research.

Topik & Kata Kunci

Penulis (4)

I

Iman Johary

R

Raphael Romero

A

Alexandru C. Mara

T

Tijl De Bie

Format Sitasi

Johary, I., Romero, R., Mara, A.C., Bie, T.D. (2025). JobHop: A Large-Scale Dataset of Career Trajectories. https://arxiv.org/abs/2505.07653

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓