Semantic Scholar Open Access 2023 5 sitasi

A framework for generating large-scale microphone array data for machine learning

Adam Kujawski Art J. R. Pelling S. Jekosch E. Sarradj

Lihat Sumber DOI

Abstrak

The use of machine learning for localization of sound sources from microphone array data has increased rapidly in recent years. Newly developed methods are of great value for hearing aids, speech technologies, smart home systems or engineering acoustics. The existence of openly available data is crucial for the comparability and development of new data-driven methods. However, the literature review reveals a lack of openly available datasets, especially for large microphone arrays. This contribution introduces a framework for generation of acoustic data for machine learning. It implements tools for the reproducible random sampling of virtual measurement scenarios. The framework allows computations on multiple machines, which significantly speeds up the process of data generation. Using the framework, an example of a development dataset for sound source characterization with a 64-channel array is given. A containerized environment running the simulation source code is openly available. The presented approach enables the user to calculate large datasets, to store only the features necessary for training, and to share the source code which is needed to reproduce datasets instead of sharing the data itself. This avoids the problem of distributing large datasets and enables reproducible research.

Topik & Kata Kunci

Computer Science

Penulis (4)

Adam Kujawski

Art J. R. Pelling

S. Jekosch

E. Sarradj

Format Sitasi

APA MLA BibTeX

Kujawski, A., Pelling, A.J.R., Jekosch, S., Sarradj, E. (2023). A framework for generating large-scale microphone array data for machine learning. https://doi.org/10.1007/s11042-023-16947-w

Akses Cepat

Lihat di Sumber doi.org/10.1007/s11042-023-16947-w

Informasi Jurnal

Tahun Terbit: 2023
Bahasa: en
Total Sitasi: 5×
Sumber Database: Semantic Scholar
DOI: 10.1007/s11042-023-16947-w
Akses: Open Access ✓