Misc,

Light-Head R-CNN: In Defense of Two-Stage Object Detector

xxx.
(Nov 23, 2017)

Abstract

In this paper, we first investigate why typical two-stage methods are not as fast as single-stage, fast detectors like YOLO and SSD. We find that Faster R-CNN and R-FCN perform an intensive computation after or before RoI warping. Faster R-CNN involves two fully connected layers for RoI recognition, while R-FCN produces a large score maps. Thus, the speed of these networks is slow due to the heavy-head design in the architecture. Even if we significantly reduce the base model, the computation cost cannot be largely decreased accordingly. We propose a new two-stage detector, Light-Head R-CNN, to address the shortcoming in current two-stage approaches. In our design, we make the head of network as light as possible, by using a thin feature map and a cheap R-CNN subnet (pooling and single fully-connected layer). Our ResNet-101 based light-head R-CNN outperforms state-of-art object detectors on COCO while keeping time efficiency. More importantly, simply replacing the backbone with a tiny network (e.g, Xception), our Light-Head R-CNN gets 30.7 mmAP at 102 FPS on COCO, significantly outperforming the single-stage, fast detectors like YOLO and SSD on both speed and accuracy. Code will be made publicly available.

BibTeX key: citeulike:14508715
entry type: misc
year: 2017
month: nov
day: 23
citeulike-article-id: 14508715
citeulike-linkout-1: http://arxiv.org/pdf/1711.07264
priority: 0
posted-at: 2017-12-27 16:35:39
eprint: 1711.07264
citeulike-linkout-0: http://arxiv.org/abs/1711.07264
archiveprefix: arXiv
url: http://arxiv.org/abs/1711.07264

BibSonomy

Light-Head R-CNN: In Defense of Two-Stage Object Detector

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on