在python中如何打亂資料？

在玩python學習機器時，對於那種對隨機性不太敏感的模型，理論上說可以不打亂。但敏感不敏感也跟資料量級，複雜度，演算法內部計算機制都有關，目前並沒有一個經緯分明的演算法隨機度敏感度列表。既然打亂資料並不會得到一個更差的結果，一般推薦的做法就是打亂全量資料。那怎麼打亂呢？今天小編就教大家在python中打亂資料集和標籤，來看看吧。

方法一、打亂索引Index

import numpy as np

index = [i for i in range(len(test_data))] # test_data為測試資料

np.random.shuffle(index) # 打亂索引

test_data = test_data[index]

test_label = test_label[index]

方法二：透過陣列來shuffle來打亂

image_list=[]           # list of images

label_list=[]           # list of labels

 

temp = np.array([image_list, label_list])

temp = temp.transpose()

np.random.shuffle(temp)

 

images = temp[:, 0]     # array of images   (N,)

labels = temp[:, 1]

方法三：透過隨機數打亂

import numpy as np

np.random.seed(12)

np.random.shuffle(test_data)

np.random.seed(12)

np.random.shuffle(test_label)

以上就是小編整理的用python打亂資料的方法，如果你在玩python機器學習的話，可以採取以上方法打亂資料哦~更多學習推薦

在python中如何打亂資料？

相關文章