Machine Learning (6) - 關於 Logistic Regression (Multiclass Classification) 的小練習

Rachel發表於2019-04-14

原文網址 : https://learnku.com/articles/27482?order_by=created_at&

Iris flower data set 是關於一種花的資料集. 這種花有三個品種, 分別是 setosa, virginica 和 versicolor. 每朵花都有兩種花瓣(sepals 和 petals).早在 20 世紀 30 年代, 一位學者對每個品種收集了 50 個樣本, 分別測量兩種花瓣的長度和寬度, 最終形成了一個有 150 條資料的資料集. 這個資料集被廣泛用於機器學習的初學者做資料分析的練習.

import pandas as pd
import matplotlib.pyplot as plt
// 引入 iris 資料集
from sklearn.datasets import load_iris
iris = load_iris()

// 檢視 iris 資料集的屬性
dir(iris)
['DESCR', 'data', 'feature_names', 'filename', 'target', 'target_names']

// 檢視 iris 資料集的前5條資料
iris.data[0:5]
// 輸出, 分別是每朵花的每種花瓣的長度和寬度
array([[5.1, 3.5, 1.4, 0.2],
       [4.9, 3. , 1.4, 0.2],
       [4.7, 3.2, 1.3, 0.2],
       [4.6, 3.1, 1.5, 0.2],
       [5. , 3.6, 1.4, 0.2]])

// 檢視 iris 資料集的屬性名稱
iris.feature_names
// 輸出
['sepal length (cm)',
 'sepal width (cm)',
 'petal length (cm)',
 'petal width (cm)']

iris.target
// 輸出
array([0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
       0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
       0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
       1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1,
       1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2,
       2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2,
       2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2])

// 這裡就是 iris 花的三個品種的名字, 應該是分別對應了 target 值的 0, 1, 2
iris.target_names
//輸出
array(['setosa', 'versicolor', 'virginica'], dtype='<U10')

// 把資料集拆分為訓練資料和測試資料
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(iris.data, iris.target, test_size=0.2)
len(X_train) // 120
len(X_test) // 30

// 訓練模型
from sklearn.linear_model import LogisticRegression
model = LogisticRegression()
model.fit(X_train, y_train)

// 檢視模型準確度
model.score(X_test, y_test) // 0.9

// 透過模型進行預測
model.predict([[4.4, 3., 1.6, 0.9]])
// 輸出
array([0])

想要更加細緻地瞭解誤差的位置, 可以透過 confusion_matrix 類實現:

// 透過模型預測的值
y_predicted = model.predict(X_test)

// 引入 confusion_matrix 包
from sklearn.metrics import confusion_matrix
cm = confusion_matrix(y_test, y_predicted)
cm
// 輸出
array([[ 7,  0,  0],
       [ 0,  8,  2],
       [ 0,  0, 13]])

// 為了將上面的輸出視覺化更強, 引入 seaborn 包     
import seaborn as sn
plt.figure(figsize = (10, 7))
sn.heatmap(cm, annot=True)
plt.xlabel('Predicted')
plt.ylabel('Truth')

Machine Learning (6) - 關於 Logistic Regression (Multiclass Classification) 的小練習

本作品採用《CC 協議》，轉載必須註明作者和本文連結

Machine Learning (9) - 關於 Logistic Regression (Multiclass Classification) 的小練習
2019-06-08
Mac
Machine Learning (8) - Logistic Regression (Multiclass Classification)
2019-06-07
Mac
Machine Learning (7) - 關於 Logistic Regression (Binary Classification) 的小練習
2019-06-07
Mac
Machine Learning (6) - Logistic Regression (Binary Classification)
2019-06-07
Mac
Machine Learning (11) - 關於 Decision Tree 的小練習
2019-06-09
Mac
吳恩達機器學習第一課 Supervised Machine Learning Regression and Classification
2024-06-10
吳恩達機器學習Mac
Machine Learning（16） - 關於 K Means Clustering 的練習題
2019-06-15
Mac
Machine Learning (1) - Linear Regression
2019-04-14
Mac
吳恩達《Machine Learning》精煉筆記 6：關於機器學習的建議
2021-01-16
吳恩達Mac筆記機器學習
【機器學習】Logistic Regression 的前世今生（理論篇）
2019-02-22
機器學習
《Machine Learning in Action》—— Taoye給你講講Logistic迴歸是咋回事
2020-12-07
Mac
吳恩達機器學習課程：程式設計練習 | (2) ex2-logistic regression
2020-09-23
吳恩達機器學習程式設計
Logistic regression 為什麼用 sigmoid ？
2018-05-29
Sigmoid
4.邏輯迴歸（Logistic Regression）
2020-11-16
邏輯迴歸
《深度學習》PDF Deep Learning: Adaptive Computation and Machine Learning series
2019-12-17
深度學習APTMac
【深度學習基礎-13】非線性迴歸 logistic regression
2019-01-14
深度學習
Paper Reading: Random Balance ensembles for multiclass imbalance learning
2024-10-29
random
【題解】程式設計作業ex5: Regularized Linear Regression and Bias/Variance (Machine Learning)
2020-10-09
程式設計ZedMac
《machine learning》引言
2020-10-13
Mac
Machine Learning with Sklearn
2020-12-11
Mac
Machine Learning（機器學習）之二
2018-10-25
Mac機器學習
Machine Learning（機器學習）之一
2019-02-27
Mac機器學習
使用Octave來學習Machine Learning(二)
2019-02-27
Mac
Machine Learning 機器學習筆記
2018-03-27
Mac機器學習筆記
WEKA把分類(Classification)和迴歸(Regression)
2018-06-17
Machine Learning (12) - Support Vector Machine (SVM)
2019-06-10
Mac
Matlab機器學習3（Machine Learning Onramp）
2020-10-27
Matlab機器學習Mac
林軒田機器學習基石課程學習筆記10 — Logistic Regression
2018-07-24
機器學習筆記
Machine Learning－Introduction
2019-04-03
Mac
Machine Learning - Basic points
2020-01-17
Mac
邏輯迴歸（Logistic Regression）原理及推導
2019-02-22
邏輯迴歸
林軒田機器學習技法課程學習筆記5 — Kernel Logistic Regression
2018-07-25
機器學習筆記
三、邏輯迴歸logistic regression——分類問題
2024-08-06
邏輯迴歸
Extreme Learning Machine 翻譯
2019-01-20
REMMac
pages bookmarks for machine learning domain
2018-12-05
MacAI
Machine Learning（13）- Random Forest
2019-06-12
MacrandomREST
Machine Learning (10) - Decision Tree
2019-06-09
Mac
Machine learning terms_01
2021-04-07
Mac

Machine Learning (6) - 關於 Logistic Regression (Multiclass Classification) 的小練習

相關文章