Semantic Scholar Open Access 2023 15 sitasi

MiraBest: A Dataset of Morphologically Classified Radio Galaxies for Machine Learning

Fiona A. M. Porter A. Scaife

Abstrak

The volume of data from current and future observatories has motivated the increased development and application of automated machine learning methodologies for astronomy. However, less attention has been given to the production of standardized data sets for assessing the performance of different machine learning algorithms within astronomy and astrophysics. Here we describe in detail the MiraBest data set, a publicly available batched data set of 1256 radio-loud AGN from NVSS and FIRST, filtered to 0.03 < z < 0.1, manually labelled by Miraghaei and Best according to the Fanaroff–Riley morphological classification, created for machine learning applications and compatible for use with standard deep learning libraries. We outline the principles underlying the construction of the data set, the sample selection and pre-processing methodology, data set structure and composition, as well as a comparison of MiraBest to other data sets used in the literature. Existing applications that utilize the MiraBest data set are reviewed, and an extended data set of 2100 sources is created by cross-matching MiraBest with other catalogues of radio-loud AGN that have been used more widely in the literature for machine learning applications.

Topik & Kata Kunci

Penulis (2)

F

Fiona A. M. Porter

A

A. Scaife

Format Sitasi

Porter, F.A.M., Scaife, A. (2023). MiraBest: A Dataset of Morphologically Classified Radio Galaxies for Machine Learning. https://doi.org/10.48550/arXiv.2305.11108

Akses Cepat

Lihat di Sumber doi.org/10.48550/arXiv.2305.11108
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Total Sitasi
15×
Sumber Database
Semantic Scholar
DOI
10.48550/arXiv.2305.11108
Akses
Open Access ✓