arXiv Open Access 2024

Spatial Transformer Network YOLO Model for Agricultural Object Detection

Yash Zambre Ekdev Rajkitkul Akshatha Mohan Joshua Peeples
Lihat Sumber

Abstrak

Object detection plays a crucial role in the field of computer vision by autonomously locating and identifying objects of interest. The You Only Look Once (YOLO) model is an effective single-shot detector. However, YOLO faces challenges in cluttered or partially occluded scenes and can struggle with small, low-contrast objects. We propose a new method that integrates spatial transformer networks (STNs) into YOLO to improve performance. The proposed STN-YOLO aims to enhance the model's effectiveness by focusing on important areas of the image and improving the spatial invariance of the model before the detection process. Our proposed method improved object detection performance both qualitatively and quantitatively. We explore the impact of different localization networks within the STN module as well as the robustness of the model across different spatial transformations. We apply the STN-YOLO on benchmark datasets for Agricultural object detection as well as a new dataset from a state-of-the-art plant phenotyping greenhouse facility. Our code and dataset are publicly available.

Topik & Kata Kunci

Penulis (4)

Y

Yash Zambre

E

Ekdev Rajkitkul

A

Akshatha Mohan

J

Joshua Peeples

Format Sitasi

Zambre, Y., Rajkitkul, E., Mohan, A., Peeples, J. (2024). Spatial Transformer Network YOLO Model for Agricultural Object Detection. https://arxiv.org/abs/2407.21652

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓