CocoStuff—基於Deeplab訓練資料的標定工具【一、翻譯】（未完）

夜會美發表於2018-06-14

原文網址 : https://flycode.co/archives/252765

一、CocoStuff簡介

CocoStuff是一款為deeplab設計的，執行在Matlab中的語義標定工具，其標定結果和結合Deeplab訓練出的結果均為mat檔案格式，該專案原始碼已在github上進行開源。

二、說明

本文為系列部落格第一篇，主要對專案readme進行簡單的翻譯，主要是為了自己在學習踩坑過程中方便查閱說明，如果能幫到大家便是極好的。
*注：未完，部分只是先扔上來，將來會繼續完善。

筆者在探索之前並未在網上搜尋到關於CocoStuff的相關中文部落格，所以這可能是第一篇，有那裡不到位的請多多指教，互相學習。

三、翻譯

COCO-Stuff 10K資料集 v1.1（目前已過時）

作者：Holger Caesar, Jasper Uijlings, Vittorio Ferrari
CocoStuff—基於Deeplab訓練資料的標定工具【一、翻譯】（未完）

概述

歡迎來到COCO-Stuff資料集的官方主頁。COCO-Stuff新增了畫素級標定（標註）的流行的COCO資料集，這些標註可以用於一些像語義分割、物件檢測和影像字幕的場景理解任務。

亮點

10,000張來自COCO的複雜影像

稠密的畫素級標註

91個things類和91個stuff類

例項級標註來自COCO的的資料

物與物的複雜空間環境

每幅coco的圖片有5句說明

更新

2017年7月11日：新增了Resnet和VGG的Deeplab模型

2017年4月6日：資料集版本1.1：修改標籤指數

2017年3月31日：釋出JSON格式的標註

2017年3月9日：新增標籤層次指令碼

2017年3月8日：更正arXiv 檔案中的table 2

2017年1月10日：在標註工具中新增了提取SLICO超級畫素的指令碼

2016年12月12日：釋出資料集版本1.1和arXiv檔案

成果

目前釋出的COCO-Stuff-10K版本包括訓練和測試的標註，以及我們邀請使用者向我們報告他們的結果用來補充這張表。在不久的將來，我們會將COCO-Stuff擴充套件到COCO的所有影像中，並且我們會組織一個官方比賽，在比賽中的測試標註只會被組織者所知道。
點選此處檢視最新表格

Method	Source	Class-average accuracy	Global accuracy	Mean IOU	FW IOU
FCN-16s [3]	[1]	34.0%	52.0%	22.7%	–
Deeplab VGG-16 (no CRF) [4]	[1]	38.1%	57.8%	26.9%	–
FCN-8s [3]	[6]	38.5%	60.4%	27.2%	–
DAG-RNN + CRF [6]	[6]	42.8%	63.0%	31.2%	–
OHE + DC + FCN+ [5]	[5]	45.8%	66.6%	34.3%	51.2%
Deeplab ResNet (no CRF) [4]	–	45.5%	65.1%	34.4%	50.4%
W2V + DC + FCN+ [5]	[5]	45.1%	66.1%	34.7%	51.0%

資料集

Filename	Description	Size
cocostuff-10k-v1.1.zip	COCO-Stuff dataset v. 1.1, images and annotations	2.0 GB
cocostuff-10k-v1.1.json	COCO-Stuff dataset v. 1.1, annotations in JSON format (optional)	62.3 MB
cocostuff-labels.txt	A list of the 1+91+91 classes in COCO-Stuff	2.3 KB
cocostuff-readme.txt	This document	6.5 KB
Older files
cocostuff-10k-v1.0.zip	COCO-Stuff dataset version 1.0, including images and annotations	2.6 GB

用法

為了使用COCO-Stuff資料集，請按照下面的步驟：

從git上下載專案：git clone https://github.com/nightrome/cocostuff10k.git
在shell中開啟資料集：cd cocostuff10k
如果你有Matlab，執行下面的命令：
把程式碼檔案新增到Matlab路徑中：startup();
在Matlab中執行指令碼demo：demo_cocoStuff();
這個指令碼顯示影像、影像內容、內容加標註以及影像標題。
或者，執行下面的Linux命令或手動下載解壓資料集：
wget --directory-prefix=downloads http://calvin.inf.ed.ac.uk/wp-content/uploads/data/cocostuffdataset/cocostuff-10k-v1.1.zip
unzip downloads/cocostuff-10k-v1.1.zip -d dataset/

MAT格式

COCO-Stuff的標註是儲存在每張圖對應的.mat檔案中，這些檔案的格式和Tighe等人使用的格式相同。每個檔案包括以下欄位：

S: 畫素化的標籤地圖的尺寸

names:COCO-Stuff中的東西的名字和類別，更多詳情請看Label Names & Indices

captions:圖片標題，由5個不同的人平均標註

regionMapStuff:與S大小相同的對映，恰中包含大約值的索引，1000個區域用來註釋影像。

regionLabelStuff:每個超級畫素級的物品標籤的列表，這些regionMapStuff的索引都對應regionLabelStuff的條目。

JSON格式

另外，我們還提供了JSON格式的標註，這些標註是從COCO中複製出來的。我們使用RLE編碼格式，將影像中呈現的每種東西的類別都標註了單個註釋。要獲取標註：
wget--directory-prefix=dataset/annotations-json http://calvin.inf.ed.ac.uk/wpcontent/uploads/data/cocostuffdataset/cocostuff-10k-v1.1.json

或者用這個指令碼從.mat檔案中提取它們

標籤名稱&指數

為了與COCO相容，版本1.1有91個東西類(1-91)，91個東西類(92-182)和一個類“未標記”(0)。請注意，Coco2015中的11個Thing類沒有任何分段註釋。課桌、門和鏡子既可以是東西，也可以是東西，因此既可以是可可，也可以是可可。為了避免混淆，我們在coco-thing中的那些類中新增了字尾“-東西”。類的完整列表可以找到這裡.。舊版的COCO-東西1.0有80個東西類(2-81個)，91個東西類(82-172個)和一個類“未標記”(1)

Label等級體系

標籤的層次結構儲存在CocoStuffClass‘中。為了使其視覺化，在matlab中執行CocoStuffClasses.showClassHierarchyStuffThings()`(也可用於簡單的東西和東西類)。輸出應該類似於下面的圖
CocoStuff—基於Deeplab訓練資料的標定工具【一、翻譯】（未完）

語義分割模型

為了鼓勵對stuff and things的進一步研究，我們提供了經過訓練的語義分割模型

DeepLab VGG-16

Use the following steps to download and setup the DeepLab [4] semantic segmentation model trained on COCO-Stuff. It requires deeplab-public-ver2, which is built on Caffe:

Install Cuda. I recommend version 7.0. For version 8.0 you will need to apply the fix described here in step 3.
Download deeplab-public-ver2: git submodule update --init models/deeplab/deeplab-public-ver2
Compile and configure deeplab-public-ver2 following the author`s instructions. Depending on your system setup you might have to install additional packages, but a minimum setup could look like this:

cd models/deeplab/deeplab-public-ver2
cp Makefile.config.example Makefile.config
Optionally add CuDNN support or modify library paths in the Makefile.
make all -j8
cd ../..

Configure the COCO-Stuff dataset:

Create folders: mkdir models/deeplab/deeplab-public-ver2/cocostuff && mkdir models/deeplab/deeplab-public-ver2/cocostuff/data
Create a symbolic link to the images: cd models/deeplab/cocostuff/data && ln -s ../../../../dataset/images images && cd ../../../..
Convert the annotations by running the Matlab script: startup(); convertAnnotationsDeeplab();

Download the base VGG-16 model:

wget --directory-prefix=models/deeplab/cocostuff/model/deeplabv2_vgg16 http://calvin.inf.ed.ac.uk/wp-content/uploads/data/cocostuffdataset/deeplabv2_vgg16_init.caffemodel

Run cd models/deeplab && ./run_cocostuff_vgg16.sh to train and test the network on COCO-Stuff.

DeepLab ResNet 101

The default Deeplab model performs center crops of size 513*513 pixels of an image, if any side is larger than that. Since we want to segment the whole image at test time, we choose to resize the images to 513×513, perform the semantic segmentation and then rescale it elsewhere. Note that without the final step, the performance might differ slightly.
Follow steps 1-4 of the DeepLab VGG-16 section above.
Download the base ResNet model:

wget --directory-prefix=models/deeplab/cocostuff/model/deeplabv2_resnet101 http://calvin.inf.ed.ac.uk/wp-content/uploads/data/cocostuffdataset/deeplabv2_resnet101_init.caffemodel

Rescale the images and annotations:

cd models/deeplab
python rescaleImages.py
python rescaleAnnotations.py

Run ./run_cocostuff_resnet101.sh to train and test the network on COCO-Stuff.

標註工具

在[1]中，我們提出了一個簡單而有效的文字註釋工具，用於註釋可可資料集。它使用畫筆工具對SLICO超級畫素進行註釋(使用帶有內容標籤的codeofAchantet al.)]進行預計算)。這些註釋被現有的來自COCO的畫素級事物註釋所覆蓋。
我們提供了註釋工具的基本版本：

準備需要的資料:
- 指定一個username annotator/data/input/user.txt.
- 建立一個圖片list檔案annotator/data/input/imageLists/<user>.list.
- 在Matlab中提取所有圖片的things標籤: extractThings().
- 在Matlab中提取所有圖片的超畫素: extractSLICOSuperpixels().
- 在這個檔案的最上面的引數設定是否啟用超畫素、things標籤、多邊形繪圖CocoStuffAnnotator.m.
在Matlab中執行標註工具: CocoStuffAnnotator();
- 工具把.mat的標籤檔案放在這裡annotator/data/output/annotations.
- 通過在Matlab中執行這個指令碼將標註預覽為.png格式 annotator/code/exportImages.m.這些預覽圖會存在這裡 annotator/data/output/preview.

CocoStuff—基於Deeplab訓練資料的標定工具【三、標註工具的使用】
2018-06-14
DeepLab 使用 Cityscapes 資料集訓練模型
2019-04-10
模型
基於PYQT5的截圖翻譯工具
2022-05-31
QT
[翻譯] 使用 TensorFlow 進行分散式訓練
2022-04-10
分散式
基於pytorch實現Resnet對本地資料集的訓練
2022-03-19
PyTorch
飛槳帶你瞭解：基於百科類資料訓練的 ELMo 中文預訓練模型
2019-06-06
模型
[論文翻譯] 分散式訓練 Parameter sharding 之 ZeRO
2022-01-11
分散式
[翻譯]基於redis的分散式鎖
2018-12-02
Redis分散式
[翻譯]一個新式的基於文字的瀏覽器 Browsh
2018-07-15
瀏覽器
基於word2vec訓練詞向量(一)
2018-04-11
[翻譯] 除錯 Rxjs（一）：工具
2018-12-17
除錯JS
資料集訓練
2024-03-18
雲原生的彈性 AI 訓練系列之一：基於 AllReduce 的彈性分散式訓練實踐
2021-03-16
AI分散式
關於AI訓練資料侵權的碎碎念
2024-04-05
AI
資料結構的練習day2(未完待續)
2024-04-23
資料結構
練手：一個基於Vue的上下滑動翻月日曆元件
2018-11-25
Vue元件
訓練一個目標檢測模型
2023-12-13
模型
資料集訓練+1
2024-03-18
fashion資料集訓練
2020-11-16
用Python做一個簡單的翻譯工具
2020-07-17
Python
基於高質量訓練資料，GPT-4 Turbo更出色更強大
2023-11-22
GPT
【翻譯】基於 Cypress 測試 React 應用
2018-03-15
React
Github Statistics 一個基於 React 的 GitHub 資料統計工具
2019-12-25
GithubReact
[非專業翻譯] Mapster - 基於規則的對映
2021-06-29
Java 英語單詞本 (基於有道翻譯)
2020-10-19
Java
基於 Fluid+JindoCache 加速大模型訓練的實踐
2024-02-28
UI大模型
獲取和生成基於TensorFlow的MobilNet預訓練模型
2020-11-03
模型
一款基於 Java 開發的微信資料分析工具！
2024-11-18
Java
【翻譯轉載】如果你想要構建一套基於ECS的GUI框架
2024-08-21
GUI框架
資料庫第九周翻譯
2018-05-06
資料庫
【NLP學習其四】如何構建自己用於訓練的資料集？什麼是詞性標註？
2021-08-08
詞性標註
Mxnet R FCN 訓練自己的資料集
2018-09-28
前端工程基礎知識點--Browserslist (基於官方文件翻譯）
2018-09-04
前端
前端工程基礎知識點–Browserslist (基於官方文件翻譯）
2019-03-04
前端
資料分析 | 基於智慧標籤，精準管理資料
2020-05-30
官方翻譯 | 有關基於文件的iOS應用開發
2019-02-01
iOS
記一次翻譯工具的開發-有了它，實現實時翻譯還遠嗎？
2020-10-10
動手學Avalonia：基於SemanticKernel與矽基流動構建AI聊天與翻譯工具
2024-07-03
AI