Semantic Scholar Open Access 2015 28100 sitasi

Fast R-CNN

Ross B. Girshick

Abstrak

This paper proposes Fast R-CNN, a clean and fast framework for object detection. Compared to traditional R-CNN, and its accelerated version SPPnet, Fast R-CNN trains networks using a multi-task loss in a single training stage. The multi-task loss simplifies learning and improves detection accuracy. Unlike SPPnet, all network layers can be updated during fine-tuning. We show that this difference has practical ramifications for very deep networks, such as VGG16, where mAP suffers when only the fully-connected layers are updated. Compared to"slow"R-CNN, Fast R-CNN is 9x faster at training VGG16 for detection, 213x faster at test-time, and achieves a significantly higher mAP on PASCAL VOC 2012. Compared to SPPnet, Fast R-CNN trains VGG16 3x faster, tests 10x faster, and is more accurate. Fast R-CNN is implemented in Python and C++ and is available under the open-source MIT License at https://github.com/rbgirshick/fast-rcnn

Topik & Kata Kunci

Penulis (1)

R

Ross B. Girshick

Format Sitasi

Girshick, R.B. (2015). Fast R-CNN. https://www.semanticscholar.org/paper/7ffdbc358b63378f07311e883dddacc9faeeaf4b

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2015
Bahasa
en
Total Sitasi
28100×
Sumber Database
Semantic Scholar
Akses
Open Access ✓