Object Detection(目標檢測神文)

浩瀚之水_csdn發表於2018-11-02

目標檢測神文，非常全而且持續在更新。轉發自：https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html，如有侵權聯絡刪除。
更新時間：
20181026

我會跟進原作者部落格持續更新，加入自己對目標檢測領域的一些新研究及論文解讀。部落格根據需求直接進行關鍵字搜尋，例如2018，可找到最新論文。

Method	backbone	test size	VOC2007	VOC2010	VOC2012	ILSVRC 2013	MSCOCO 2015	Speed
OverFeat						24.3%
R-CNN	AlexNet		58.5%	53.7%	53.3%	31.4%
R-CNN	VGG17		66.0%
SPP_net	ZF-5		54.2%			31.84%
DeepID-Net			64.1%			50.3%
NoC			73.3%		68.8%
Fast-RCNN	VGG16		70.0%	68.8%	68.4%		19.7%(@[0.5-0.95]), 35.9%(@0.5)
MR-CNN			78.2%		73.9%
Faster-RCNN	VGG16		78.8%		75.9%		21.9%(@[0.5-0.95]), 42.7%(@0.5)	198ms
Faster-RCNN	ResNet101		85.6%		83.8%		37.4%(@[0.5-0.95]), 59.0%(@0.5)
YOLO			63.4%		57.9%			45 fps
YOLO	VGG-16		66.4%					21 fps
YOLOv2		448x448	78.6%		73.4%		21.6%(@[0.5-0.95]), 44.0%(@0.5)	40 fps
SSD	VGG16	300x300	77.2%		75.8%		25.1%(@[0.5-0.95]), 43.1%(@0.5)	46 fps
SSD	VGG16	512x512	79.8%		78.5%		28.8%(@[0.5-0.95]), 48.5%(@0.5)	19 fps
SSD	ResNet101	300x300					28.0%(@[0.5-0.95])	16 fps
SSD	ResNet101	512x512					31.2%(@[0.5-0.95])	8 fps
DSSD	ResNet101	300x300					28.0%(@[0.5-0.95])	8 fps
DSSD	ResNet101	500x500					33.2%(@[0.5-0.95])	6 fps
ION			79.2%		76.4%
CRAFT			75.7%		71.3%	48.5%
OHEM			78.9%		76.3%		25.5%(@[0.5-0.95]), 45.9%(@0.5)
R-FCN	ResNet50		77.4%					0.12sec(K40), 0.09sec(TitianX)
R-FCN	ResNet101		79.5%					0.17sec(K40), 0.12sec(TitianX)
R-FCN(ms train)	ResNet101		83.6%		82.0%		31.5%(@[0.5-0.95]), 53.2%(@0.5)
PVANet 9.0			84.9%		84.2%			750ms(CPU), 46ms(TitianX)
RetinaNet	ResNet101-FPN
Light-Head R-CNN	Xception*	800/1200					31.5%@[0.5:0.95]	95 fps
Light-Head R-CNN	Xception*	700/1100					30.7%@[0.5:0.95]	102 fps

Papers

R-CNN

Fast R-CNN

Faster R-CNN

intro: North Carolina State University & Alibaba
keywords: AND-OR Graph (AOG)
arxiv: https://arxiv.org/abs/1711.05226

Light-Head R-CNN

##Cascade R-CNN

MultiBox

SPP-Net

intro: PAMI 2016
intro: an extension of R-CNN. box pre-training, cascade on region proposals, deformation layers and context representations
project page: http://www.ee.cuhk.edu.hk/˜wlouyang/projects/imagenetDeepId/index.html
arxiv: http://arxiv.org/abs/1412.5661

intro: TPAMI 2015
keywords: NoC
arxiv: http://arxiv.org/abs/1504.06066

MR-CNN

YOLO

intro: train with customized data and class numbers/labels. Linux / Windows version for darknet.
blog: http://guanghan.info/blog/en/my-works/train-yolo/
github: https://github.com/Guanghan/darknet

intro: Real-time object detection on Android using the YOLO network with TensorFlow
github: https://github.com/natanielruiz/android-yolo

YOLOv2

intro: Auxilary scripts to work with (YOLO) darknet deep learning famework. AKA -> How to generate YOLO anchors?
github: https://github.com/Jumabek/darknet_scripts

intro: Bounding box labeler tool to generate the training data in the format YOLO v2 requires.
github: https://github.com/Cartucho/yolo-boundingbox-labeler-GUI

YOLOv3

DenseBox

SSD

DSSD

intro: rainbow SSD (R-SSD)
arxiv: https://arxiv.org/abs/1705.09587

keywords: CSSD, DiCSSD, DeCSSD, effective receptive fields (ERFs), theoretical receptive fields (TRFs)
arxiv: https://arxiv.org/abs/1707.08682

https://arxiv.org/abs/1709.05054

FSSD

https://arxiv.org/abs/1712.00960

intro: WeaveNet
keywords: fuse multi-scale information
arxiv: https://arxiv.org/abs/1712.03149

ESSD

intro: Zhengzhou University
arxiv: https://arxiv.org/abs/1805.07009

Inside-Outside Net (ION)

intro: “0.8s per image on a Titan X GPU (excluding proposal generation) without two-stage bounding-box regression and 1.15s per image with it”.
arxiv: http://arxiv.org/abs/1512.04143
slides: http://www.seanbell.ca/tmp/ion-coco-talk-bell2015.pdf
coco-leaderboard: http://mscoco.org/dataset/#detections-leaderboard

Factors in Finetuning Deep Model for object detection

intro: CVPR 2016.rank 3rd for provided data and 2nd for external data on ILSVRC 2015 object detection
project page: http://www.ee.cuhk.edu.hk/~wlouyang/projects/ImageNetFactors/CVPR16.html
arxiv: http://arxiv.org/abs/1601.05150

CRAFT

OHEM

intro: CVPR 2016
keywords: scale-dependent pooling (SDP), cascaded rejection classifiers (CRC)
paper: http://www-personal.umich.edu/~wgchoi/SDP-CRC_camready.pdf

R-FCN

arxiv: http://arxiv.org/abs/1605.06409
github: https://github.com/daijifeng001/R-FCN
github(MXNet): https://github.com/msracver/Deformable-ConvNets/tree/master/rfcn
github: https://github.com/Orpine/py-R-FCN
github: https://github.com/PureDiors/pytorch_RFCN
github: https://github.com/bharatsingh430/py-R-FCN-multiGPU
github: https://github.com/xdever/RFCN-tensorflow

MS-CNN

intro: VOC2007: 78.6%, VOC2012: 74.9%
arxiv: http://arxiv.org/abs/1608.05159

PVANET

intro: Presented at NIPS 2016 Workshop on Efficient Methods for Deep Neural Networks (EMDNN). Continuation of arXiv:1608.08021
arxiv: https://arxiv.org/abs/1611.08588
github: https://github.com/sanghoon/pva-faster-rcnn
leaderboard(PVANet 9.0): http://host.robots.ox.ac.uk:8080/leaderboard/displaylb.php?challengeid=11&compid=4

GBD-Net

intro: winner of the ImageNet object detection challenge of 2016. CUImage and CUVideo
intro: gated bi-directional CNN (GBD-Net)
arxiv: https://arxiv.org/abs/1610.02579
github: https://github.com/craftGBD/craftGBD

intro: CVPR 2017. Google Research
arxiv: https://arxiv.org/abs/1611.10012

Feature Pyramid Network (FPN)

intro: Facebook AI Research
arxiv: https://arxiv.org/abs/1612.03144

intro: CMU & UC Berkeley & Google Research
arxiv: https://arxiv.org/abs/1612.06851

intro: University of Maryland & Mitsubishi Electric Research Laboratories
arxiv: https://arxiv.org/abs/1702.01478

keykwords: CC-Net
intro: chained cascade network (CC-Net). 81.1% mAP on PASCAL VOC 2007
arxiv: https://arxiv.org/abs/1702.07054

intro: ICCV 2017 (poster)
arxiv: https://arxiv.org/abs/1703.10295

intro: CVPR 2017. SenseTime
keywords: Recurrent Rolling Convolution (RRC)
arxiv: https://arxiv.org/abs/1704.05776
github: https://github.com/xiaohaoChen/rrc_detection

intro: Embedded Vision Workshop in CVPR. UC San Diego & Qualcomm Inc
arxiv: https://arxiv.org/abs/1705.05922

intro: Point Linking Network (PLN)
arxiv: https://arxiv.org/abs/1706.03646

https://arxiv.org/abs/1707.05031

intro: BMVC 2017 (oral). Sorbonne Universités & CEDRIC
arxiv: https://arxiv.org/abs/1707.06175

DSOD

intro: ICCV 2017. Fudan University & Tsinghua University & Intel Labs China
arxiv: https://arxiv.org/abs/1708.01241
github: https://github.com/szq0214/DSOD

intro: ICCV 2017 Best student paper award. Facebook AI Research
keywords: RetinaNet
arxiv: https://arxiv.org/abs/1708.02002

https://arxiv.org/abs/1711.05187

intro: NTU, Singapore & Amazon
keywords: multi-instance multi-label domain adaption learning framework
arxiv: https://arxiv.org/abs/1711.05954

MegDet

intro: Peking University & Tsinghua University & Megvii Inc
arxiv: https://arxiv.org/abs/1711.07240

intro: Microsoft AI & Research Munich
arxiv: https://arxiv.org/abs/1711.09822

keywords: region selection network, gating network
arxiv: https://arxiv.org/abs/1712.02408

intro: IEEE/CAA Journal of Automatica Sinica
arxiv: https://arxiv.org/abs/1712.08470

keywords: object mining, object tracking, unsupervised object discovery by appearance-based clustering, self-supervised detector adaptation
arxiv: https://arxiv.org/abs/1712.08832

intro: Tsinghua University & JD Group
arxiv: https://arxiv.org/abs/1801.01051

intro: Peking University & MSRA
arxiv: https://arxiv.org/abs/1803.07066

intro: Singapore Management University & Zhejiang University
arxiv: https://arxiv.org/abs/1803.08208

intro: University of Tokyo & National Institute of Informatics, Japan
arxiv: https://arxiv.org/abs/1803.08670

intro: National University of Defense Technology
arxiv: https://arxiv.org/abs/1804.04606

intro: Tsinghua University & Megvii Inc
arxiv: https://arxiv.org/abs/1804.06215

intro: United Technologies Research Center-Ireland
arxiv: https://arxiv.org/abs/1805.06361

intro: CVPR 2018 Deep Vision Workshop
arxiv: https://arxiv.org/abs/1805.11778

intro: Megvii Inc (Face++) & Fudan University
arxiv: https://arxiv.org/abs/1807.00980

intro: ECCV 2018. Middle East Technical University
arxiv: https://arxiv.org/abs/1807.01696
github: https://github.com/cancam/LRP

intro: Rejected by ECCV18
arxiv: https://arxiv.org/abs/1807.02842

intro: Google AI Perception
arxiv: https://arxiv.org/abs/1807.03284

intro: ECCV 2018
keywords: IoU-Net, PreciseRoIPooling
arxiv: https://arxiv.org/abs/1808.01244
github: https://github.com/umich-vl/CornerNet

Non-Maximum Suppression (NMS)

arxiv: http://arxiv.org/abs/1511.06437
Improving Object Detection With One Line of Code

intro: ICCV 2017. University of Maryland
keywords: Soft-NMS
arxiv: https://arxiv.org/abs/1704.04503
github: https://github.com/bharatsingh430/soft-nms

Adversarial Examples

intro: University of Illinois
arxiv: https://arxiv.org/abs/1712.02494

Weakly Supervised Object Detection

intro: TPAMI 2017. National Institutes of Health (NIH) Clinical Center
arxiv: https://arxiv.org/abs/1801.03145

Video Object Detection

intro: Submitted on 12 Jan 2016
keywords: Deep learning, saliency map, optical flow, convolution network, contrast features
paper: https://hal.archives-ouvertes.fr/hal-01251614/document

intro: Winning solution in ILSVRC2015 Object Detection from Video(VID) Task
arxiv: http://arxiv.org/abs/1604.02532
github: https://github.com/myfavouritekk/T-CNN

http://image-net.org/challenges/talks_2017/ilsvrc2017_short(poster).pdf

intro: University of Pennsylvania, 2Dartmouth College
arxiv: https://arxiv.org/abs/1803.05549

intro: Microsoft Research Asia
arxiv: https://arxiv.org/abs/1804.05830

Object Detection on Mobile Devices

intro: ICLR 2018 workshop track
intro: based on the SSD
arxiv: https://arxiv.org/abs/1804.06882
github: https://github.com/Robert-JunWang/Pelee

Object Detection in 3D

intro: Valeo Schalter und Sensoren GmbH & Ilmenau University of Technology
arxiv: https://arxiv.org/abs/1803.06199

Object Detection on RGB-D

Zero-Shot Object Detection

intro: Australian National University
keywords: YOLO
arxiv: https://arxiv.org/abs/1803.07113

intro: Australian National University
arxiv: https://arxiv.org/abs/1803.06049

intro: Middle East Technical University & Hacettepe University
arxiv: https://arxiv.org/abs/1805.06157

Salient Object Detection

This task involves predicting the salient regions of an image given by human eye fixations.

intro: CVPR 2016. recurrent attentional convolutional-deconvolution network (RACDNN)
arxiv: http://arxiv.org/abs/1604.03227

Unconstrained Salient Object Detection

intro: ACMMM 2016. deeply-supervised recurrent convolutional neural network (DSRCNN)
arxiv: http://arxiv.org/abs/1608.05177

intro: IEEE Transactions on Image Processing
arxiv: http://arxiv.org/abs/1609.02077

intro: Nanyang Technological University
arxiv: https://arxiv.org/abs/1611.05345

intro: University of Maryland College Park & eBay Inc
arxiv: https://arxiv.org/abs/1708.00079

intro: Accepted as a poster in ICCV 2017
arxiv: https://arxiv.org/abs/1708.02031

intro: National University of Defense Technology, China & National University of Singapore
arxiv: https://arxiv.org/abs/1708.05595

intro: 2nd Workshop on Visualisation for Deep Learning in the 34th International Conference On Machine Learning
arxiv: https://arxiv.org/abs/1801.04261

Video Saliency Detection

Visual Relationship Detection

intro: Visual Phrase reasoning Convolutional Neural Network (ViP-CNN), Visual Phrase Reasoning Structure (VPRS)
arxiv: https://arxiv.org/abs/1702.07191

intro: CVPR 2017 spotlight paper
arxiv: https://arxiv.org/abs/1703.03054

intro: CVPR 2017 oral. The Chinese University of Hong Kong
arxiv: https://arxiv.org/abs/1704.03114

intro: Google AI & IST Austria
arxiv: https://arxiv.org/abs/1807.02136

intro: 2018 ACM Multimedia Conference
arxiv: https://arxiv.org/abs/1809.06213

intro: ECCV 2018 Workshop
arxiv: https://arxiv.org/abs/1809.09828

Face Deteciton

intro: overlap with CMS-RCNN
arxiv: https://arxiv.org/abs/1612.05322

intro: ACM MM 2016
keywords: IOULoss
arxiv: http://arxiv.org/abs/1608.01471

author: 萬韶華 @ 小米.
intro: Faster R-CNN, hard negative mining. state-of-the-art on the FDDB dataset
arxiv: http://arxiv.org/abs/1608.02236

MTCNN

intro: An extended version of ICCV 2015 paper
arxiv: https://arxiv.org/abs/1701.08393

intro: CVPR 2017. MP-RCNN, MP-RPN
arxiv: https://arxiv.org/abs/1703.09145

intro: CVPR 2017. SenseTime & Tsinghua University
arxiv: https://arxiv.org/abs/1706.09876

intro: ICCV 2017. University of Maryland
arxiv: https://arxiv.org/abs/1708.03979
github(official, Caffe): https://github.com/mahyarnajibi/SSH

intro: IJCB 2017
keywords: Rapidly Digested Convolutional Layers (RDCL), Multiple Scale Convolutional Layers (MSCL)
intro: the proposed detector runs at 20 FPS on a single CPU core and 125 FPS using a GPU for VGA-resolution images
arxiv: https://arxiv.org/abs/1708.05234
github(Caffe): https://github.com/zeusees/FaceBoxes

intro: ICCV 2017. Chinese Academy of Sciences
intro: can run at 36 FPS on a Nvidia Titan X (Pascal) for VGA-resolution images
arxiv: https://arxiv.org/abs/1708.05237
github(Caffe, official): https://github.com/sfzhang15/SFD
github: https://github.com//clcarwin/SFD_pytorch

intro: CVPR 2018. Beihang University & CUHK & Sensetime
arxiv: https://arxiv.org/abs/1804.05197

intro: Beihang University & Megvii Inc. (Face++)
arxiv: https://arxiv.org/abs/1804.06559

intro: The University of Sydney
arxiv: https://arxiv.org/abs/1805.03363

Detect Small Faces

intro: ENS Paris-Saclay. ExtendedTinyFaces
intro: Detecting and counting small objects - Analysis, review and application to counting
arxiv: https://arxiv.org/abs/1801.06504
github: https://github.com/alexattia/ExtendedTinyFaces

intro: WACV 2018
keywords: Face Magnifier Network (Face-MageNet)
arxiv: https://arxiv.org/abs/1803.05258
github: https://github.com/po0ya/face-magnet

Person Head Detection

Pedestrian Detection / People Detection

intro: ICCV 2015. CUHK. DeepParts
intro: Achieving 11.89% average miss rate on Caltech Pedestrian Dataset
paper: http://personal.ie.cuhk.edu.hk/~pluo/pdf/tianLWTiccv15.pdf

intro: “set a new record on the Caltech pedestrian dataset, lowering the log-average miss rate from 11.7% to 8.9%”
arxiv: http://arxiv.org/abs/1603.04525

intro: ECCV Workshop 2016
arxiv: https://arxiv.org/abs/1802.03269

intro: IEEE 2016 ICCE-Berlin
arxiv: http://arxiv.org/abs/1609.02500

intro: ECCV 2016 Workshops
arxiv: https://arxiv.org/abs/1610.08871

intro: CVPR 2017. Tsinghua University & Peking University & Megvii Inc.
keywords: Faster R-CNN, HyperLearner
arxiv: https://arxiv.org/abs/1705.02757
paper: http://openaccess.thecvf.com/content_cvpr_2017/papers/Mao_What_Can_Help_CVPR_2017_paper.pdf

intro: CMU & Volvo Construction
arxiv: https://arxiv.org/abs/1706.08917

intro: The University of North Carolina at Chapel Hill
arxiv: https://arxiv.org/abs/1707.09100

intro: State Key Lab of CAD&CG, Zhejiang University
arxiv: https://arxiv.org/abs/1803.05347

intro: British Machine Vision Conference(BMVC) 2017
arxiv: https://arxiv.org/abs/1804.04483

intro: ECCV 2018. Hikvision Research Institute
arxiv: https://arxiv.org/abs/1807.01438

Vehicle Detection

intro: IEEE Transactions on Intelligent Transportation Systems (T-ITS)
arxiv: https://arxiv.org/abs/1804.00433

Traffic-Sign Detection

intro: IEEE Conference on Information Reuse and Integration (IRI) 2017 oral
arxiv: https://arxiv.org/abs/1706.08574

Skeleton Detection

Fruit Detection

Shadow Detection

intro: The Chinese University of Hong Kong & The Hong Kong Polytechnic University
arxiv: https://arxiv.org/abs/1805.04635

Others Detection

intro: Conference of Computer and Robot Vision. University of Guelph
arxiv: https://arxiv.org/abs/1803.10842

Object Proposal

intro: IEEE Transactions on Image Processing
arxiv: http://arxiv.org/abs/1601.04798

intro: CVPR 2017
keywords: differentiable Determinantal Point Process (DPP) layer, Learning Detection with Diverse Proposals (LDDP)
arxiv: https://arxiv.org/abs/1704.03533

keywords: product detection
arxiv: https://arxiv.org/abs/1704.06752

Localization

intro: ICCV 2015
keywords: Markov Decision Process
arxiv: https://arxiv.org/abs/1511.06015

Tutorials / Talks

intro: Hikvision Research Institute. Supervised Data Augmentation (SDA)
slides: http://image-net.org/challenges/talks/2016/Hikvision_at_ImageNet_2016.pdf

https://docs.google.com/presentation/d/1OTfGn6mLe1VWE8D0q6Tu_WwFTSoLGd4OF8WCYnOWcVo/edit#slide=id.g37418adc7a_0_229

Projects

intro: FAIR’s research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
github: https://github.com/facebookresearch/Detectron

intro: “The basic model implements the simple and robust GoogLeNet-OverFeat algorithm. We additionally provide an implementation of the ReInspect algorithm”
github: https://github.com/Russell91/TensorBox

intro: Full convolution MultiBox Detector (like SSD) implemented in Torch.
github: https://github.com/teaonly/FMD.torch

keywords: MultiNet
intro: KittiBox is a collection of scripts to train out model FastBox on the Kitti Object Detection Dataset
github: https://github.com/MarvinTeichmann/KittiBox

intro: Most popular metrics used to evaluate object detection algorithms
github: https://github.com/rafaelpadilla/Object-Detection-Metrics

Leaderboard

Tools

https://github.com/antingshen/BeaverDam

Blogs

http://rnd.azoft.com/convolutional-neural-networks-object-detection/

https://towardsdatascience.com/understanding-ssd-multibox-real-time-object-detection-in-deep-learning-495ef744fab

http://machinethink.net/blog/object-detection/

https://www.jeremyjordan.me/object-detection-one-stage/

Object Detection(目標檢測神文)

文章目錄

Papers

Deep Neural Networks for Object Detection

OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

R-CNN

Rich feature hierarchies for accurate object detection and semantic segmentation

Fast R-CNN

Fast R-CNN

A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection

Faster R-CNN

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

R-CNN minus R

Faster R-CNN in MXNet with distributed implementation and data parallelization

Contextual Priming and Feedback for Faster R-CNN

An Implementation of Faster RCNN with Study for Region Sampling

Interpretable R-CNN

Light-Head R-CNN

Light-Head R-CNN: In Defense of Two-Stage Object Detector

Cascade R-CNN: Delving into High Quality Object Detection

MultiBox

Scalable Object Detection using Deep Neural Networks

Scalable, High-Quality Object Detection

SPP-Net

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection

Object Detectors Emerge in Deep Scene CNNs

segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection

Object Detection Networks on Convolutional Feature Maps

Improving Object Detection with Deep Convolutional Networks via Bayesian Optimization and Structured Prediction

DeepBox: Learning Objectness with Convolutional Networks

MR-CNN

Object detection via a multi-region & semantic segmentation-aware CNN model

YOLO

You Only Look Once: Unified, Real-Time Object Detection

darkflow - translate darknet to tensorflow. Load trained weights, retrain/fine-tune them using tensorflow, export constant graph def to C++

Start Training YOLO with Our Own Data

YOLO: Core ML versus MPSNNGraph

TensorFlow YOLO object detection on Android

Computer Vision in iOS – Object Detection

YOLOv2

YOLO9000: Better, Faster, Stronger

darknet_scripts

Yolo_mark: GUI for marking bounded boxes of objects in images for training Yolo v2

LightNet: Bringing pjreddie’s DarkNet out of the shadows

YOLO v2 Bounding Box Tool

YOLOv3

YOLOv3: An Incremental Improvement

AttentionNet: Aggregating Weak Directions for Accurate Object Detection

DenseBox

DenseBox: Unifying Landmark Localization with End to End Object Detection

SSD

SSD: Single Shot MultiBox Detector

DSSD

DSSD : Deconvolutional Single Shot Detector

Enhancement of SSD by concatenating feature maps for object detection

Context-aware Single-Shot Detector

Feature-Fused SSD: Fast Detection for Small Objects

FSSD

FSSD: Feature Fusion Single Shot Multibox Detector

Weaving Multi-scale Context for Single Shot Detector

ESSD

Extend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network

Tiny SSD: A Tiny Single-shot Detection Deep Convolutional Neural Network for Real-time Embedded Object Detection

MDSSD: Multi-scale Deconvolutional Single Shot Detector for small objects

Inside-Outside Net (ION)

Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks

Adaptive Object Detection Using Adjacency and Zoom Prediction

G-CNN: an Iterative Grid Based Object Detector

Factors in Finetuning Deep Model for object detection

Factors in Finetuning Deep Model for Object Detection with Long-tail Distribution

We don’t need no bounding-boxes: Training object class detectors using only human verification

HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection

A MultiPath Network for Object Detection

CRAFT

CRAFT Objects from Images

OHEM

Training Region-based Object Detectors with Online Hard Example Mining

S-OHEM: Stratified Online Hard Example Mining for Object Detection

Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers