Fast R-CNN

Girshick, Ross

Cornell University

We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate

arXiv logo

Computer Science> Computer Vision and Pattern Recognition

arXiv:1504.08083 (cs)

[Submitted on 30 Apr 2015 (v1), last revised 27 Sep 2015 (this version, v2)]

Title:Fast R-CNN

Authors:Ross Girshick

View PDF

Abstract:This paper proposes a Fast Region-based Convolutional Network method (Fast R-CNN) for object detection. Fast R-CNN builds on previous work to efficiently classify object proposals using deep convolutional networks. Compared to previous work, Fast R-CNN employs several innovations to improve training and testing speed while also increasing detection accuracy. Fast R-CNN trains the very deep VGG16 network 9x faster than R-CNN, is 213x faster at test-time, and achieves a higher mAP on PASCAL VOC 2012. Compared to SPPnet, Fast R-CNN trains VGG16 3x faster, tests 10x faster, and is more accurate. Fast R-CNN is implemented in Python and C++ (using Caffe) and is available under the open-source MIT License at this https URL.

Comments:	To appear in ICCV 2015
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1504.08083 [cs.CV]
(or arXiv:1504.08083v2 [cs.CV] for this version)
https://doi.org/10.48550/arXiv.1504.08083