Semantic Scholar Open Access 2019 243 sitasi

Process Mining for Python (PM4Py): Bridging the Gap Between Process- and Data Science

A. Berti S. V. Zelst Wil M.P. van der Aalst

Abstrak

Process mining, i.e., a sub-field of data science focusing on the analysis of event data generated during the execution of (business) processes, has seen a tremendous change over the past two decades. Starting off in the early 2000's, with limited to no tool support, nowadays, several software tools, i.e., both open-source, e.g., ProM and Apromore, and commercial, e.g., Disco, Celonis, ProcessGold, etc., exist. The commercial process mining tools provide limited support for implementing custom algorithms. Moreover, both commercial and open-source process mining tools are often only accessible through a graphical user interface, which hampers their usage in large-scale experimental settings. Initiatives such as RapidProM provide process mining support in the scientific workflow-based data science suite RapidMiner. However, these offer limited to no support for algorithmic customization. In the light of the aforementioned, in this paper, we present a novel process mining library, i.e. Process Mining for Python (PM4Py) that aims to bridge this gap, providing integration with state-of-the-art data science libraries, e.g., pandas, numpy, scipy and scikit-learn. We provide a global overview of the architecture and functionality of PM4Py, accompanied by some representative examples of its usage.

Topik & Kata Kunci

Penulis (3)

A

A. Berti

S

S. V. Zelst

W

Wil M.P. van der Aalst

Format Sitasi

Berti, A., Zelst, S.V., Aalst, W.M.v.d. (2019). Process Mining for Python (PM4Py): Bridging the Gap Between Process- and Data Science. https://www.semanticscholar.org/paper/a11e157cb828b800426223f0a3d79e8fb122c8cc

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2019
Bahasa
en
Total Sitasi
243×
Sumber Database
Semantic Scholar
Akses
Open Access ✓