Pytorch - Dataloader

kingchou007發表於2024-04-01

原文網址 : https://www.cnblogs.com/hackerk/p/18109127

Basically the DataLoader works with the Dataset object. So to use the DataLoader you need to get your data into this Dataset wrapper. To do this you only need to implement two magic methods: __getitem__ and __len__. The __getitem__ takes an index and returns a tuple of (x, y) pair. The __len__ is just your usual length that returns the size of the data. And that’s that. [1]

Dataloader如何讀取資料

import torch

# Define some sample data
X = torch.randn(5,3)  # input
y = torch.randn(5,3)  # labe

print(X,y)

我們的資料如下：

tensor([[-0.5138, -1.7766, -0.6183],
        [ 0.2235,  0.1974,  0.2892],
        [ 1.6249, -0.5768, -1.5081],
        [ 0.5972, -0.1788,  0.7579],
        [ 1.3844, -0.5480, -1.5612]]) 
tensor([[-0.5818,  0.1668,  0.5073],
        [-1.7707, -0.2907,  1.4918],
        [ 1.2157, -2.8250, -0.0247],
        [ 0.2748,  0.1086,  1.6052],
        [-0.7613, -1.3326, -0.5267]])

然後我們從dataloader讀取。

# batch_size = 1, 這意味著只能一次只能讀取一個資料
# shuffle = True, 在每個訓練週期（epoch）開始時，資料集中的資料將被隨機打亂
dataloader = DataLoader(dataset, batch_size=1, shuffle=False)

for i, (batch_x, batch_y) in enumerate(dataloader):
    print(f"Batch {i}: input shape {batch_x}, \n label shape {batch_y}")

我們可以得到：

Batch 0: input shape tensor([[-0.5138, -1.7766, -0.6183]]), label shape tensor([[-0.5818,  0.1668,  0.5073]])
Batch 1: input shape tensor([[ 0.5972, -0.1788,  0.7579]]), label shape tensor([[0.2748, 0.1086, 1.6052]])
Batch 2: input shape tensor([[ 1.6249, -0.5768, -1.5081]]), label shape tensor([[ 1.2157, -2.8250, -0.0247]])
Batch 3: input shape tensor([[ 1.3844, -0.5480, -1.5612]]), label shape tensor([[-0.7613, -1.3326, -0.5267]])
Batch 4: input shape tensor([[0.2235, 0.1974, 0.2892]]), label shape tensor([[-1.7707, -0.2907,  1.4918]])

從batch size拿出來的輸入的順序和放進去的順序是一樣的嗎？
answer: 所以這個問題被回答了，如果shuffle = true, 那就不是，因為資料會被隨機打亂。否則就是相同的順序。

Pytorch入門-dataloader
2024-03-14
PyTorch
Pytorch入門上 —— Dataset、Tensorboard、Transforms、Dataloader
2021-12-15
PyTorchORBORM
【小白學PyTorch】3 淺談Dataset和Dataloader
2020-09-01
PyTorch
pytorch dataloader和batch_size大小的理解
2020-10-28
PyTorchBAT
Pytorch建模過程中的DataLoader與Dataset
2023-01-04
PyTorch
[原始碼解析] PyTorch 分散式(2) --- 資料載入之DataLoader
2021-08-18
原始碼PyTorch分散式
pytorch dataloader num_workers引數設定導致訓練阻塞
2020-10-02
PyTorch
直播小程式原始碼，pytorch同時讓兩個dataloader打亂的順序是相同
2023-10-10
原始碼PyTorch
Dataset和Dataloader的使用
2023-01-05
Torch 中Dataset 和Dataloader 的資料變換
2024-08-25
深度學習入門筆記——DataLoader的使用
2024-10-29
深度學習筆記
torch.utils.data.DataLoader與迭代器轉換
2021-12-06
【Python報錯】RuntimeError: DataLoader worker (pid(s) 9764, 15128) exited unexpectedly
2021-06-13
PythonError
pytorch
2024-04-22
PyTorch
配置pytorch
2024-05-04
PyTorch
Pytorch QuickStart
2024-05-14
PyTorchUI
Pytorch | Pytorch格式 .pt .pth .bin .onnx 詳解
2024-09-02
PyTorch
《PyTorch》Part5 PyTorch之遷移學習
2020-11-21
PyTorch遷移學習
（pytorch-深度學習系列）pytorch資料操作
2020-10-12
PyTorch深度學習
【Pytorch教程】迅速入門Pytorch深度學習框架
2024-08-26
PyTorch深度學習框架
PyTorch-TensorBoardX
2018-08-01
PyTorchORB
安裝pytorch
2024-05-03
PyTorch
pytorch安裝
2024-05-09
PyTorch
pytorch之Tensor
2018-12-24
PyTorch
pytorch簡介
2023-04-06
PyTorch
pytorch dali 加速
2020-12-21
PyTorch
PyTorch張量
2020-11-25
PyTorch
【小白學PyTorch】10 pytorch常見運算詳解
2020-09-14
PyTorch
【小白學PyTorch】13 EfficientNet詳解及PyTorch實現
2020-09-25
PyTorch
【小白學PyTorch】12 SENet詳解及PyTorch實現
2020-09-19
PyTorchSENet
pytorch安裝命令
2018-07-25
PyTorch
PyTorch使用總覽
2018-09-03
PyTorch
Pytorch入門-Transforms
2024-03-13
PyTorchORM
Pytorch Dataset入門
2024-04-16
PyTorch
編譯 PyTorch 模型
2023-04-29
編譯PyTorch模型
pytorch 方法筆記
2020-11-13
PyTorch筆記
【PyTorch Lightning】簡介
2020-10-24
PyTorch
pytorch安裝教程
2020-10-29
PyTorch

Pytorch - Dataloader

Dataloader如何讀取資料

相關文章