[CVPR 2016] Weakly Supervised Deep Detection Networks论文笔记

zoukankan html css js c++ java

[CVPR 2016] Weakly Supervised Deep Detection Networks论文笔记
Weakly Supervised Deep Detection Networks，Hakan Bilen，Andrea Vedaldi

https://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Bilen_Weakly_Supervised_Deep_CVPR_2016_paper.pdf

亮点
- 把弱监督检测问题解释为proposal排序的问题，通过比较所有proposal的类别分数得到一个比较正确的排序，这种思想与检测中评测标准的计算方法一致
相关工作

The MIL strategy results in a non-convex optimization problem; in practice, solvers tend to get stuck in local optima

such that the quality of the solution strongly depends on the initialization.
- developing various initialization strategies [19, 5, 32, 4]
- on regularizing the optimization problem [31, 1].
- Another line of research in WSD is based on the idea of identifying the similarity between image parts.
- [31] propose a discriminative graph-based algorithm that selects a subset of windows such that each window is connected to its nearest neighbors in positive images.
- [32] extend this method to discover multiple co-occurring part configurations.
- [36] propose an iterative technique that applies a latent semantic clustering via latent Semantic Analysis (pLSA)
- [2] propose a formulation that jointly learns a discriminative model and enforces the similarity of the selected object regions via a discriminative convex clustering algorithm
方法

本文采用的方法非常简单易懂，主要分为以下三部：
- 将特征和region proposal的结果输入spatial pyramid pooling层，取出与区域相关的特征向量，并输入两个fc层
- 分类：fc层的输出通过softmax分类器，计算出这一区域类别
- 检测：fc层的输出通过softmax分类器，与上面不同的是归一化的时候不是用类别归一化，而是用所有区域的分数进行归一化，通过区域之间的对比找到包含该类别信息最多的区域
训练的loss function如下

最后一项是一个校准项（按照理解轻微更改了，感觉论文notation有点问题），其目的是通过拉近feature的距离约束解的平滑性（即与正确解相近的proposal也应该得到高分）。

实验结果

本文根据basenet不同给出了4种model：S (VGG-F), M (VGG-M-1024), L (VGG-VD16)和Ens（前三种ensemble的模型）
- Ablation:
- VOC2007
- VOC2010
缺点

本文有一个明显的缺点是只考虑了一张图中某类别物体只出现一次的情况（regulariser中仅限制了最大值及其周围的框），这一点在文中给出的failure cases中也有所体现。
查看全文

相关阅读:
NOIP模拟题管道
 NOIP模拟题序列
 NOIP模拟题栅栏
 NOIP模拟题斐波那契数列
 CodeForces 797F Mice and Holes
CodeForces 589H Tourist Guide
CERC2016 爵士之旅 Jazz Journey
BZOJ3832 Rally
BZOJ1061 NOI2008 志愿者招募
 js数组的操作

原文地址：https://www.cnblogs.com/Xiaoyan-Li/p/8695772.html