聚類kmeans演算法在yolov3中的應用

sdu20112013發表於2019-05-28

原文網址 : https://www.cnblogs.com/sdu20112013/p/10937717.html

yolov3 kmeans

yolov3在做boundingbox預測的時候,用到了anchor boxes.這個anchors的含義即最有可能的object的width,height.事先通過聚類得到.比如某一個畫素單元,我想對這個畫素單元預測出一個object,圍繞這個畫素單元,可以預測出無數種object的形狀,並不是隨便預測的,要參考anchor box的大小,即從已標註的資料中通過聚類統計到的最有可能的object的形狀.

.cfg檔案內的配置如下:

[yolo]
mask = 3,4,5
anchors = 10,14,  23,27,  37,58,  81,82,  135,169,  344,319

在用我們自己的資料做訓練的時候,要先修改anchors,匹配我們自己的資料.anchors大小通過聚類得到.

通俗地說,聚類就是把捱得近的資料點劃分到一起.
kmeans演算法的思想很簡單

隨便指定k個cluster
把點劃分到與之最近的一個cluster
上面得到的cluster肯定是不好的,因為一開始的cluster是亂選的嘛
更新每個cluster為當前cluster的點的均值.
這時候cluster肯定變準了,為什麼呢?比如當前這個cluster裡有3個點,2個點靠的很近,還有1個點離得稍微遠點,那取均值的話,那相當於靠的很近的2個點有更多投票權,新算出來的cluster的中心會更加靠近這兩個點.你要是非要抬槓:那萬一一開始我隨機指定的cluster中心點就特別準呢,重新取均值反而把中心點弄的不準了?事實上這是kmeans的一個缺陷:比較依賴初始的k個cluster的位置.選擇不恰當的k值可能會導致糟糕的聚類結果。這也是為什麼要進行特徵檢查來決定資料集的聚類數目了。
重新執行上述過程
- 把點劃分到與之最近的一個cluster
- 更新每個cluster為當前cluster的點的均值
不斷重複上述過程,直至cluster中心變化很小

yolov3要求的label檔案格式

<object-class> <x_center> <y_center> <width> <height>
Where:
<object-class> - integer object number from 0 to (classes-1)
<x_center> <y_center> <width> <height> - float values relative to width and height of image, it can be equal from (0.0 to 1.0]
> for example: <x> = <absolute_x> / <image_width> or <height> = <absolute_height> / <image_height>
atention: <x_center> <y_center> - are center of rectangle (are not top-left corner)

舉例:
1 0.716797 0.395833 0.216406 0.147222
所有的值都是比例.(中心點x,中心點y,目標寬,目標高)

kmeans實現

一般來說,計算樣本點到質心的距離的時候直接算的是兩點之間的距離,然後將樣本點劃歸為與之距離最近的一個質心.
在yolov3中樣本點的資料是有具體的業務上的含義的,我們其實最終目的是想知道最有可能的object對應的bounding box的形狀是什麼樣子的. 所以這個距離的計算我們並不是直接算兩點之間的距離,我們計算兩個box的iou,即2個box的相似程度.d=1-iou(box1,box_cluster). 這樣d越小,說明box1與box_cluster越類似.將box劃歸為box_cluster.

資料載入

    f = open(args.filelist)
  
    lines = [line.rstrip('\n') for line in f.readlines()]
    
    annotation_dims = []

    size = np.zeros((1,1,3))
    for line in lines:
                    
        #line = line.replace('images','labels')
        #line = line.replace('img1','labels')
        line = line.replace('JPEGImages','labels')        
        

        line = line.replace('.jpg','.txt')
        line = line.replace('.png','.txt')
        print(line)
        f2 = open(line)
        for line in f2.readlines():
            line = line.rstrip('\n')
            w,h = line.split(' ')[3:]            
            #print(w,h)
            annotation_dims.append(tuple(map(float,(w,h))))
    annotation_dims = np.array(annotation_dims)

看著一大段,其實重點就一句

w,h = line.split(' ')[3:]            
annotation_dims.append(tuple(map(float,(w,h))))

這裡涉及到了python的語法,map用法https://www.runoob.com/python/python-func-map.html
這樣就生成了一個N*2矩陣. N代表你的樣本個數.

定義樣本點到質心點的距離
計算樣本x代表的box和k個質心box的IOU.(即比較box之間的形狀相似程度).
這裡涉及到一個IOU的概念:即交併集比例.交叉面積/總面積.

def IOU(x,centroids):
    similarities = []
    k = len(centroids)
    for centroid in centroids:
        c_w,c_h = centroid
        w,h = x
        if c_w>=w and c_h>=h:     #box(c_w,c_h)完全包含box(w,h)
            similarity = w*h/(c_w*c_h)
        elif c_w>=w and c_h<=h:   #box(c_w,c_h)寬而扁平
            similarity = w*c_h/(w*h + (c_w-w)*c_h)
        elif c_w<=w and c_h>=h:
            similarity = c_w*h/(w*h + c_w*(c_h-h))
        else: #means both w,h are bigger than c_w and c_h respectively
            similarity = (c_w*c_h)/(w*h)
        similarities.append(similarity) # will become (k,) shape
    return np.array(similarities)

kmeans實現

def kmeans(X,centroids,eps,anchor_file):
    
    N = X.shape[0]
    iterations = 0
    k,dim = centroids.shape
    prev_assignments = np.ones(N)*(-1)    
    iter = 0
    old_D = np.zeros((N,k)) #距離矩陣  N個點,每個點到k個質心 共計N*K個距離

    while True:
        D = [] 
        iter+=1           
        for i in range(N):
            d = 1 - IOU(X[i],centroids)  #d是一個k維的   
            D.append(d)   
        D = np.array(D) # D.shape = (N,k)
        
        print("iter {}: dists = {}".format(iter,np.sum(np.abs(old_D-D))))
            
        #assign samples to centroids 
        assignments = np.argmin(D,axis=1) #返回每一行的最小值的下標.即當前樣本應該歸為k個質心中的哪一個質心.
        
        if (assignments == prev_assignments).all() :  #質心已經不再變化
            print("Centroids = ",centroids)
            write_anchors_to_file(centroids,X,anchor_file)
            return

        #calculate new centroids   
        centroid_sums=np.zeros((k,dim),np.float)  #(k,2)
        for i in range(N):
            centroid_sums[assignments[i]]+=X[i]        #將每一個樣本劃分到對應質心
        for j in range(k):            
            centroids[j] = centroid_sums[j]/(np.sum(assignments==j)) #更新質心
        
        prev_assignments = assignments.copy()     
        old_D = D.copy()

計算每個樣本點到每一個cluster質心的距離 d = 1- IOU(X[i],centroids)表示樣本點到每個cluster質心的距離.
np.argmin(D,axis=1)得到每一個樣本點離哪個cluster質心最近
argmin函式用法參考https://docs.scipy.org/doc/numpy/reference/generated/numpy.argmin.html
計算每一個cluster中的樣本點總和,取平均,更新cluster質心.

for i in range(N):
    centroid_sums[assignments[i]]+=X[i]        #將每一個樣本劃分到對應質心
for j in range(k):            
    centroids[j] = centroid_sums[j]/(np.sum(assignments==j)) #更新質心

不斷重複上述過程,直到質心不再變化聚類完成.

儲存聚類得到的anchor box大小

def write_anchors_to_file(centroids,X,anchor_file):
    f = open(anchor_file,'w')
    
    anchors = centroids.copy()
    print(anchors.shape)

    for i in range(anchors.shape[0]):
        anchors[i][0]*=width_in_cfg_file/32.
        anchors[i][1]*=height_in_cfg_file/32.
         

    widths = anchors[:,0]
    sorted_indices = np.argsort(widths)

    print('Anchors = ', anchors[sorted_indices])
        
    for i in sorted_indices[:-1]:
        f.write('%0.2f,%0.2f, '%(anchors[i,0],anchors[i,1]))

    #there should not be comma after last anchor, that's why
    f.write('%0.2f,%0.2f\n'%(anchors[sorted_indices[-1:],0],anchors[sorted_indices[-1:],1]))
    
    f.write('%f\n'%(avg_IOU(X,centroids)))
    print()

由於yolo要求的label檔案中,填寫的是相對於width,height的比例.所以得到的anchor box的大小要乘以模型輸入圖片的尺寸.
上述程式碼裡

        anchors[i][0]*=width_in_cfg_file/32.
        anchors[i][1]*=height_in_cfg_file/32.

這裡除以32是yolov2的演算法要求. yolov3實際上不需要.參見以下連結https://github.com/pjreddie/darknet/issues/911

for Yolo v2: width=704 height=576 in cfg-file
./darknet detector calc_anchors data/hand.data -num_of_clusters 5 -width 22 -height 18 -show
for Yolo v3: width=704 height=576 in cfg-file
./darknet detector calc_anchors data/hand.data -num_of_clusters 9 -width 704 -height 576 -show
And you can use any images with any sizes.

完整程式碼見https://github.com/AlexeyAB/darknet/tree/master/scripts
用法:python3 gen_anchors.py -filelist ../build/darknet/x64/data/park_train.txt

KMeans演算法與GMM混合高斯聚類
2023-04-16
演算法聚類
聚類演算法在 D2C 佈局中的應用
2022-02-17
聚類演算法
利用python的KMeans和PCA包實現聚類演算法
2019-09-15
PythonPCA聚類演算法
Kmeans如何初始化聚類中心
2018-10-12
聚類
ML.NET技術研究系列-2聚類演算法KMeans
2019-07-14
聚類演算法
KMeans演算法全面解析與應用案例
2023-11-16
演算法
Spark中的聚類演算法
2020-09-27
Spark聚類演算法
統計分析和智慧聚類在遊戲資料中的應用
2021-01-20
聚類遊戲
聚類演算法
2020-04-26
聚類演算法
資料探勘之KMeans演算法應用與簡單理解
2019-07-23
演算法
聚類(part3)--高階聚類演算法
2020-10-11
聚類演算法
聚類之K均值聚類和EM演算法
2019-05-13
聚類演算法
文字圖Tranformer在文字分類中的應用
2022-01-31
ORM文字分類
吳恩達《Machine Learning》精煉筆記 8：聚類 KMeans 及其 Python實現
2021-01-16
吳恩達Mac筆記聚類Python
聚類演算法綜述
2018-12-09
聚類演算法
OPTICS聚類演算法原理
2020-05-14
聚類演算法
初探DBSCAN聚類演算法
2021-05-22
聚類演算法
14聚類演算法-程式碼案例六-譜聚類(SC)演算法案例
2018-12-16
聚類演算法
09聚類演算法-層次聚類-CF-Tree、BIRCH、CURE
2018-12-11
聚類演算法
04聚類演算法-程式碼案例一-K-means聚類
2018-12-08
聚類演算法
python Kmeans演算法解析
2018-11-05
Python演算法
可伸縮聚類演算法綜述（可伸縮聚類演算法開篇）
2018-10-30
聚類演算法
聚類模型的演算法效能評價
2024-06-27
聚類模型演算法
深度聚類演算法敘談
2021-05-18
聚類演算法
深度聚類演算法淺談
2021-04-15
聚類演算法
AI 演算法在大資料治理中的應用
2023-03-09
AI演算法大資料
【機器學習】--譜聚類從初始到應用
2018-04-06
機器學習聚類
用電負荷相關聚類演算法總結（1）
2018-09-09
聚類演算法
推薦系統中的產品聚類：一種文字聚類的方法
2020-01-02
聚類
Kmeans演算法優缺點
2020-02-23
演算法
Calendar類在Java中的應用與日期時間處理
2024-08-08
Java
聚類演算法——DBSCAN演算法原理及公式
2020-05-20
聚類演算法公式
【Python機器學習實戰】聚類演算法（1）——K-Means聚類
2021-12-06
Python機器學習聚類演算法
應用聚類模型獲得聊天機器人語料
2018-04-12
聚類模型機器人
線性蒙皮分解演算法及其在遊戲中的應用
2020-03-04
演算法遊戲
AI 演算法在視訊可分級編碼中的應用
2021-11-01
AI演算法
【python資料探勘課程】二十四.KMeans文字聚類分析互動百科語料
2018-07-06
Python聚類
【長篇乾貨】深度學習在文字分類中的應用
2018-04-04
深度學習文字分類

聚類kmeans演算法在yolov3中的應用

yolov3 kmeans

yolov3要求的label檔案格式

kmeans實現

資料載入

kmeans實現

儲存聚類得到的anchor box大小

相關文章