Pytorch | Tutorial-03 資料轉換

一碗给力嗯發表於2024-03-20

原文網址 : https://www.cnblogs.com/shaojunjie0912/p/18085871

這是對 Pytorch 官網的 Tutorial 教程的中文翻譯。

資料並不總是以訓練機器學習演算法所需的最終處理形式出現，我們使用轉換來對資料執行一些操作並使其適合訓練。

所有 TorchVision 資料集都有兩個引數：用於修改特徵的 transform 和用於修改標籤的 target_transform。接受包含轉換邏輯的可呼叫物件。 torchvision.transforms 模組提供了幾種開箱即用的常用轉換。

FashionMNIST 資料集的特徵採用 PIL 影像格式，標籤為整數。對於訓練，我們需要將特徵作為歸一化張量，將標籤作為獨熱編碼張量。為了進行這些轉換，我們使用 ToTensor 和 Lambda 。

import torch
from torchvision import datasets
from torchvision.transforms import ToTensor, Lambda

ds = datasets.FashionMNIST(
    root="data",
    train=True,
    download=True,
    transform=ToTensor(),
    target_transform=Lambda(lambda y: torch.zeros(10, dtype=torch.float).scatter_(0, torch.tensor(y), value=1))
)

輸出：

Downloading http://fashion-mnist.s3-website.eu-central-1.amazonaws.com/train-images-idx3-ubyte.gz
Downloading http://fashion-mnist.s3-website.eu-central-1.amazonaws.com/train-images-idx3-ubyte.gz to data/FashionMNIST/raw/train-images-idx3-ubyte.gz

  0%|          | 0/26421880 [00:00<?, ?it/s]
  0%|          | 65536/26421880 [00:00<01:12, 362470.31it/s]
  1%|          | 229376/26421880 [00:00<00:38, 681259.72it/s]
  4%|3         | 950272/26421880 [00:00<00:11, 2185553.59it/s]
 15%|#4        | 3833856/26421880 [00:00<00:02, 7599317.20it/s]
 34%|###4      | 9109504/26421880 [00:00<00:00, 18310296.11it/s]
 46%|####5     | 12091392/26421880 [00:00<00:00, 17936658.84it/s]
 68%|######7   | 17924096/26421880 [00:01<00:00, 22974578.28it/s]
 89%|########9 | 23592960/26421880 [00:01<00:00, 25758355.11it/s]
100%|##########| 26421880/26421880 [00:01<00:00, 18198564.66it/s]
Extracting data/FashionMNIST/raw/train-images-idx3-ubyte.gz to data/FashionMNIST/raw

Downloading http://fashion-mnist.s3-website.eu-central-1.amazonaws.com/train-labels-idx1-ubyte.gz
Downloading http://fashion-mnist.s3-website.eu-central-1.amazonaws.com/train-labels-idx1-ubyte.gz to data/FashionMNIST/raw/train-labels-idx1-ubyte.gz

  0%|          | 0/29515 [00:00<?, ?it/s]
100%|##########| 29515/29515 [00:00<00:00, 325487.35it/s]
Extracting data/FashionMNIST/raw/train-labels-idx1-ubyte.gz to data/FashionMNIST/raw

Downloading http://fashion-mnist.s3-website.eu-central-1.amazonaws.com/t10k-images-idx3-ubyte.gz
Downloading http://fashion-mnist.s3-website.eu-central-1.amazonaws.com/t10k-images-idx3-ubyte.gz to data/FashionMNIST/raw/t10k-images-idx3-ubyte.gz

  0%|          | 0/4422102 [00:00<?, ?it/s]
  1%|1         | 65536/4422102 [00:00<00:12, 362947.95it/s]
  5%|5         | 229376/4422102 [00:00<00:06, 682324.89it/s]
 21%|##1       | 950272/4422102 [00:00<00:01, 2189897.25it/s]
 87%|########6 | 3833856/4422102 [00:00<00:00, 7611069.08it/s]
100%|##########| 4422102/4422102 [00:00<00:00, 6093636.48it/s]
Extracting data/FashionMNIST/raw/t10k-images-idx3-ubyte.gz to data/FashionMNIST/raw

Downloading http://fashion-mnist.s3-website.eu-central-1.amazonaws.com/t10k-labels-idx1-ubyte.gz
Downloading http://fashion-mnist.s3-website.eu-central-1.amazonaws.com/t10k-labels-idx1-ubyte.gz to data/FashionMNIST/raw/t10k-labels-idx1-ubyte.gz

  0%|          | 0/5148 [00:00<?, ?it/s]
100%|##########| 5148/5148 [00:00<00:00, 39985698.13it/s]
Extracting data/FashionMNIST/raw/t10k-labels-idx1-ubyte.gz to data/FashionMNIST/raw

ToTensor()

ToTensor 將 PIL 影像或 NumPy ndarray 轉換為 FloatTensor 。並將影像畫素值縮放到 [0., 1.] 範圍內。

Lambda 轉換

Lambda 轉換應用任何使用者定義的 lambda 函式。在這裡，我們定義一個函式將整數轉換為 one-hot 編碼張量。它首先建立一個大小為 10 的零張量（資料集中的標籤數量）並呼叫 scatter_ ，它在標籤 y 給出的索引上分配 value=1 。

target_transform = Lambda(lambda y: torch.zeros(
    10, dtype=torch.float).scatter_(dim=0, index=torch.tensor(y), value=1))

mxnet資料格式轉換為tensorflow，pytorch資料
2018-12-14
PyTorch
Pytorch變數型別轉換
2018-07-27
PyTorch變數型別
資料庫轉換工具，不同資料庫之前任意轉換
2020-08-07
資料庫
Stimulsoft Reports如何建立新的資料轉換、編輯資料轉換
2021-04-25
Python pytorch 座標系變換與維度轉換
2024-04-22
PythonPyTorch
Hive資料格式轉換
2019-01-08
Hive
資料型別轉換
2018-12-29
資料型別
資料類新轉換
2024-07-14
資料集轉換JSON
2024-07-04
JSON
將json資料轉換為Python字典將json資料轉換為Python字典
2023-11-07
JSONPython
什麼是資料轉換？
2018-12-03
JavaScript 資料型別轉換
2018-12-19
JavaScript資料型別
javascript資料型別轉換
2018-06-12
JavaScript資料型別
layui tree資料格式轉換
2019-11-19
UI
【Java】資料型別轉換
2020-10-20
Java資料型別
資料型別及轉換
2024-06-07
資料型別
voc資料集轉換成coco資料集
2024-04-27
MxNet預訓練模型到Pytorch模型的轉換
2018-06-28
模型PyTorch
Pytorch框架之tensor型別轉換(type, type_as)
2020-11-04
PyTorch框架型別
機器學習-- 資料轉換
2018-11-17
機器學習
JS資料型別的轉換
2019-04-03
JS資料型別
JS中資料型別轉換
2018-06-06
JS資料型別
資料型別,型別轉換
2024-04-08
資料型別
2、java資料型別轉換
2020-08-11
Java資料型別
人大金倉資料庫轉換
2024-08-31
資料庫
siebel切換資料來源【轉】
2024-07-20
JavaScript 基本資料型別轉換
2022-01-22
JavaScript資料型別
頁面資料賦值轉換
2020-12-02
賦值
Java資料型別的顯式轉換和隱式轉換
2020-09-23
Java資料型別
Pytorch視覺化(顯示圖片)及格式轉換
2020-12-20
PyTorch視覺化
python--進位制轉換和資料交換
2020-12-07
Python
excel表格怎麼轉換成word文件表格資料轉換到文件
2022-03-26
Excel
scala和java資料型別轉換
2018-10-26
Java資料型別
JavaScript 隱式資料型別轉換
2018-12-19
JavaScript資料型別
JS裡的資料型別轉換
2018-07-14
JS資料型別
Oracle資料庫日期格式轉換操作
2018-05-10
Oracle資料庫
JS 裡的資料型別轉換
2018-06-26
JS資料型別
go語言資料型別轉換
2024-04-29
Go資料型別

Pytorch | Tutorial-03 資料轉換

ToTensor()

Lambda 轉換

相關文章