YOLOv3學習筆記之資料處理

baidu_huihui發表於2020-12-12

原文網址 : https://blog.csdn.net/baidu_41617231/article/details/111086841

YOLOv3學習筆記之資料處理

dataset中的資料處理

1. 圖片處理

將讀取的圖片進行reshape，變為正方形

def pad_to_square(img, pad_value):
    c, h, w = img.shape
    dim_diff = np.abs(h - w)
    # (upper / left) padding and (lower / right) padding
    pad1, pad2 = dim_diff // 2, dim_diff - dim_diff // 2
    # Determine padding
    pad = (0, 0, pad1, pad2) if h <= w else (pad1, pad2, 0, 0)
    #(左填充，右填充，上填充，下填充)
    # Add padding
    img = F.pad(img, pad, "constant", value=pad_value)

    return img, pad

圖片變為正方形

2. 標籤處理

labels中txt檔案標籤資料的來源。在json檔案中給出的是原始圖片的左上角座標和寬高，但是在下載的labels資料夾中的txt檔案，標籤是已經被處理過了，可以看下面對比圖。

處理方式如下：

json中得到的值box左上角座標(w,y)，box寬高w,h，原圖寬高w_factor,h_factor
將左上角座標變為中心點座標(x1,y1)
將中心點座標，寬高等比例縮放，即得到一個相對值(x2,y2,w2,h2)

標籤label的處理方式：

在這裡插入圖片描述

另外還有使用打標工具labelImg得到的資料也是同樣的處理方式，程式碼驗證如下：

import cv2
import numpy as np

img_path='F:/labelimg_test/COCO_train2014_000000000036.jpg'
label_path='F:/labelimg_test/COCO_train2014_000000000036.txt'

image = cv2.imread(img_path)
print(image.shape) #(576, 704, 3) (H,W,C)
h_factor, w_factor = image.shape[:2]
print('原始影像尺寸h:',h_factor,'w:',w_factor)

boxes =np.loadtxt(label_path).reshape(-1,5)

for i in boxes:
    print('label中的標籤：',)
    print('x_label:', i[1],'y_label:', i[2],'w_label:', i[3],'h_label:', i[4])

    x,y,w,h=i[1],i[2],i[3],i[4]
    x=x*w_factor
    y=y*h_factor
    w=w*w_factor
    h=h*h_factor

    x=x-w/2
    y=y-h/2

    print('試著將label中標籤還原到原圖上：')
    print('x:', x,'y: ', y,'w:', w,'h:', h)
    print('\n')

    #左上角座標（下，y），右下角座標（x+w,y+h）
    draw_1=cv2.rectangle(image, (int(x),int(y)), (int(x+w),int(y+h)), color=(0,255,0),thickness=2, lineType=8)

cv2.imwrite("vertical_flip.jpg", draw_1)  #將畫過矩形框的圖片儲存到當前資料夾
cv2.imshow("draw_0", draw_1)  #顯示畫過矩形框的圖片
cv2.waitKey(0)
cv2.destroyWindow("draw_0")

在這裡插入圖片描述

在dataset中label的處理

將label中的標籤(boxes[1],boxes[2],boxes[3],boxes[4])還原得到真實圖片上的標籤，即左上角座標和寬高
找到右下腳座標，即得到(x1,y1,x2,y2)
原圖片經過填充後，相當於圖片的座標系發生了變化，因此呢，標註框也需要同樣的填充，保證相對位置不變
最後再經過上文提到的標籤變換方法，得到中心點座標和寬高，再除以填充後的寬高padded_w,padded_h，得到相對值

# Extract coordinates for unpadded + unscaled image
            x1 = w_factor * (boxes[:, 1] - boxes[:, 3] / 2)
            y1 = h_factor * (boxes[:, 2] - boxes[:, 4] / 2)
            x2 = w_factor * (boxes[:, 1] + boxes[:, 3] / 2)
            y2 = h_factor * (boxes[:, 2] + boxes[:, 4] / 2)

            # Adjust for added padding
            x1 += pad[0]
            y1 += pad[2]
            x2 += pad[1]
            y2 += pad[3]

            # Returns (x, y, w, h)
            boxes[:, 1] = ((x1 + x2) / 2) / padded_w
            boxes[:, 2] = ((y1 + y2) / 2) / padded_h
            boxes[:, 3] *= w_factor / padded_w
            boxes[:, 4] *= h_factor / padded_h

對於一個batch資料而言，不同圖片的label都是連線在一起的，因此需要再每一個box資料前新增一個index，用來表示該box屬於batch[index]圖片的，該功能是在自定義的collate_fn(self, batch)函式中實現

給每一個box框新增索引
多尺度訓練，在同一個batch中圖片大小要一致，需要將圖片進行resize到相同的尺寸，在resize時，每10個batch進行一次多尺度訓練，在[min=416-3x32,max=416+3x32]之間隨機選擇一個目標尺寸值
將圖片，標籤以及路徑連線起來，最後返回
paths=[path0,path1,…]
imgs=[batch_size,3,w,h]
targets=[N,6]
預設的collate_fn最後返回的是用棧存起來的，自定義collate返回的使用列表存起來的

    def collate_fn(self, batch):
        """
        :param batch: list 一個批次的資料 [(path,img,target),(),...]
        targets :tuple(target0,target1,...)
        :return:
        """
        paths, imgs, targets = list(zip(*batch)) #*表示以元組形式接收引數  zip打包為元組的列表
       
        # Remove empty placeholder targets
        targets = [boxes for boxes in targets if boxes is not None]
        # Add sample index to targets
        for i, boxes in enumerate(targets):
            boxes[:, 0] = i
        targets = torch.cat(targets, 0)
        # print('拼接後的targets:',targets.shape)

        # Selects new image size every tenth batch
        if self.multiscale and self.batch_count % 10 == 0:
            self.img_size = random.choice(range(self.min_size, self.max_size + 1, 32))
        # Resize images to input shape
        imgs = torch.stack([resize(img, self.img_size) for img in imgs])
        self.batch_count += 1
        return paths, imgs, targets

再經過dataloader將dataset資料進行包裝，輸入到模型中。

機器學習筆記---資料預處理
2022-04-30
機器學習筆記
Python深度學習（處理文字資料）--學習筆記（十二）
2020-11-12
Python深度學習筆記
swoft 學習筆記之異常處理
2019-08-13
筆記
機器學習演算法筆記之6：資料預處理
2020-04-06
機器學習演算法筆記
【Pandas學習筆記02】-資料處理高階用法
2021-12-01
筆記
【Pandas學習筆記02】處理資料實用操作
2021-11-26
筆記
React學習筆記-事件處理
2020-12-26
React筆記事件
Vue學習筆記之事件處理
2018-06-20
Vue筆記事件
大資料之 Hadoop學習筆記
2018-12-14
大資料Hadoop筆記
JSP筆記-XML 資料處理
2021-08-06
JS筆記XML
大資料學習之Hadoop如何高效處理大資料
2018-09-20
大資料Hadoop
Flutter學習筆記（32）--PointerEvent事件處理
2020-06-18
Flutter筆記事件
vue學習筆記3-事件處理
2024-06-30
Vue筆記事件
swoft 學習筆記之資料庫操作
2019-08-29
筆記資料庫
資訊理論理論學習筆記
2019-02-22
筆記
OpenCV影像處理學習筆記-Day1
2020-09-28
OpenCV筆記
SpringMVC學習筆記10-異常處理
2020-12-16
SpringMVC筆記
Python 3 學習筆記之——資料型別
2018-10-23
Python筆記資料型別
Python學習筆記|Python之pycache資料夾
2018-12-21
Python筆記
資料庫學習筆記之查詢表
2021-01-03
資料庫筆記
深度學習--資料預處理
2024-07-28
深度學習
達夢資料庫日常管理之問題處理筆記1
2021-12-21
資料庫筆記
資料庫學習筆記
2018-10-18
資料庫筆記
【大資料】離線批處理計算MapReduce | 複習筆記
2020-12-11
大資料筆記
React學習筆記之雙向資料繫結
2020-11-21
React筆記
深度學習——資料預處理篇
2019-02-18
深度學習
學習筆記（1):Python零基礎入門到實戰-資料科學原理與資料處理流程
2020-11-16
筆記Python資料科學
Dotnetty學習筆記——自定義初始化處理器
2024-09-07
Netty筆記
Unity學習筆記--資料持久化之PlayerPrefs的使用
2023-11-19
Unity筆記持久化
飛機的 PHP 學習筆記之資料庫篇
2020-01-30
PHP筆記資料庫
PHP 資料加密 (學習筆記)
2019-07-30
PHP加密筆記
1029學習筆記資料庫
2020-11-03
筆記資料庫
資料結構學習筆記
2018-04-22
資料結構筆記
python學習筆記：資料庫
2018-04-19
Python筆記資料庫
MySQL資料庫學習筆記
2020-12-10
MySql資料庫筆記
小王的學習筆記（十四）——vue資料渲染、事件處理、表單輸入與繫結
2020-11-19
筆記Vue事件
機器學習一：資料預處理
2019-02-27
機器學習
SpringMVC入門學習---資料的處理
2019-05-11
SpringMVC

YOLOv3學習筆記之資料處理

YOLOv3學習筆記之資料處理

dataset中的資料處理

1. 圖片處理

2. 標籤處理

相關文章