目标检测&语义分割(CNN)
data:image/s3,"s3://crabby-images/e1ca0/e1ca020e40ddc6a72ccbaaeaad794d16a6d3dc2f" alt=""
RocheL
May 1, 2022
Last edited: 2022-8-28
type
Post
status
Published
date
May 1, 2022
slug
detection
summary
目标检测算法:R-CNN,SSD,YOLO
tags
Course
category
学习思考
icon
password
Property
Aug 2, 2022 02:15 AM
URL
传统CNN的目标检测&语义分割算法:R-CNN,SSD,YOLO,DETR(挖坑),ViT-FRCNN(挖坑),SERT(分割,挖坑)
RCNN系列
RCNN
参照李沐的思路,最早介绍RCNN(region),
data:image/s3,"s3://crabby-images/6d051/6d051d37b7ba4ddecdcb6984cbc81734e0c4b9a8" alt="notion image"
传统计算机视觉的思路得到了较多保留,一开始用启发式搜索筛出候选框,再对候选区域用ML(包括回归和SVM),线性回归是预测原始候选框和GT的位置差。模型对输入尺寸其实比较敏感,所以希望能用池化约束尺寸(Region of Interest polling)
data:image/s3,"s3://crabby-images/99da3/99da3d2512c5ef7364194288dc8cbdb100d44bdb" alt="notion image"
FAST R-CNN
data:image/s3,"s3://crabby-images/8d869/8d869e3d7f955576769da8c446c4398693a2dafb" alt="notion image"
相当于全局过CNN后在feature map上搜锚框,selective search采用启发式搜索,但会把原始图中的待定锚框映射到特征图上
FASTER R-CNN
data:image/s3,"s3://crabby-images/b92c3/b92c32b5e2728b3a8da03ac28182e02006a32451" alt="notion image"
同一个外挂的小网络(region proposal network)代替启发式搜索,更快。
效果
data:image/s3,"s3://crabby-images/c1bc0/c1bc05507e454dc0ba52d0cbe1c9c807f8f05769" alt="notion image"
毕竟是一个two-stage的网络,精度可以但是很慢
R-CNN原始使用MATLAB做的,而且带了cv的传统方法,实现起来比较复杂
MASK R-CNN
data:image/s3,"s3://crabby-images/02f97/02f977af04754278223866f43a2d58ccb417eaa5" alt="notion image"
mask rcnn要做像素级别的分割,严格意义上应该不算目标检测,目标检测只用锚框就行,可能应该叫语义分割(注:实例分割是在语义分割基础上给同类或不同类物体标号)
像素级的精度要求,所以改用ROI align,即对图像超分之后来均匀池化
SSD
single shot detection,即one-stage,在不同分辨率的feature map上对每个像素做锚框
data:image/s3,"s3://crabby-images/c762c/c762c410f45cf185595392114f3778c5f3ff12e5" alt="notion image"
data:image/s3,"s3://crabby-images/8b516/8b51679c5d5997640ef87b1ff73e302fdcc5f3b7" alt="notion image"
data:image/s3,"s3://crabby-images/25cfb/25cfbbf2864eb72d3c616f34012f5c4eb60e462d" alt="notion image"
快,精度一般
YOLO
data:image/s3,"s3://crabby-images/dee96/dee96a93f8749e02b1836eeac32c5b6a573dc601" alt="notion image"
原始版本主要靠的就是少量均分的锚框和对应的边缘狂策略直接做one-stage的检测。说白了就是学习边缘框的生成策略去fit数据集。
data:image/s3,"s3://crabby-images/6202d/6202de6c8fa3bd2924a43e820f84cd76f138201a" alt="notion image"
用的多是有原因的。
- Catalog
- About
0%