Machine Learning (1) - Linear Regression

Rachel發表於2019-04-14

原文網址 : https://learnku.com/articles/27483?order_by=created_at&

Pandas 是學習 Machine Learning 的利器，這裡假設你已經對 Pandas 基礎有所瞭解。

這一節主要以預測一個地區的房價為例，學習 ML 的模型之一 Linear Regression：

1. 引入需要用的包以及資料檔案

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from sklearn import linear_model

df = pd.read_csv('/Users/rachel/Downloads/py-master/ML/1_linear_reg/homeprices.csv')
df

輸出:

Machine Learning (1) - Linear Regression

2. 訓練 Linear Regression 模型

reg = linear_model.LinearRegression() // 初始化資料模型

// 訓練這個模型，第一個引數是已知的資料，第二個引數是未來要預測的值
reg.fit(df[['area']], df.price)

現在就可以用模型來預測值

reg.predict([[5000]])

輸出:

array([859554.79452055])

這裡大概介紹一下 Linear Regression 的建模公式:

y = m * x + b

m 和 b 就是模型的係數, 通過提供大量的 x 和 y 的值，來求出最佳的 m 和 b 的值，這也就是訓練模型的過程。

m 被稱作 Coefficients
b 被稱作 Intercept
通過下面兩個命令就可以檢視 Linear Regression 模型的 m 和 b 的值:
reg.coef_  // m 的值
reg.intercept_  // b 的值

3. 輸出圖形資料:

%matplotlib inline
plt.xlabel('area(sqr ft)', fontsize=20) // x 軸
plt.ylabel('price(US$)', fontsize=20) // y 軸
plt.scatter(df.area, df.price, color='red', marker='+') // 以 “點” 輸出已知資料
plt.plot(df.area, reg.predict(df[['area']]), color='blue') // 以 “線” 輸出預測的資料，第二個引數是根據模型預測的值

Machine Learning (1) - Linear Regression

上面這條線，就是我們最終得到的 Linear Regression 模型，得到這條線，我們就可以輕鬆預測任何尺寸的房價，也就相當於模型訓練完成。

4. 應用模型

下面就用前面訓練好的模型來快速預測房價：

先建一個新的 csv 檔案，裡面填充一些房子的面積值：

df_new = pd.read_csv('/Users/rachel/Sites/py-master/ML/1_linear_reg/areas.csv')
df_new.head()

輸出（大家可以按照這個輸出格式，隨便建一個側表來做測試）：

Machine Learning (1) - Linear Regression

用訓練好的模型 reg 做房價預測

p = reg.predict(df_new[['area']])
p

// 輸出
array([ 316404.10958904,  384297.94520548,  492928.08219178,
        661304.79452055,  740061.64383562,  799808.21917808,
        926090.75342466,  650441.78082192,  825607.87671233,
        492928.08219178, 1402705.47945205, 1348390.4109589 ,
       1144708.90410959])

// 用預測出來的房價資料完善原表     
df_new['price'] = p
df_new

// 輸出完善好的資料到 prediction.csv 檔案
//至於這個檔案生成在哪裡, 還是去終端看下, 你此時的 jupyter notebook 執行在哪裡
df_new.to_csv('prediction.csv', index = False)

輸出：

Machine Learning (1) - Linear Regression

今天開始第二次機器學習，對一些知識點有了更深入的瞭解，把之前的筆記完善一下。

本作品採用《CC 協議》，轉載必須註明作者和本文連結

【題解】程式設計作業ex5: Regularized Linear Regression and Bias/Variance (Machine Learning)
2020-10-09
程式設計ZedMac
Machine Learning (6) - Logistic Regression (Binary Classification)
2019-06-07
Mac
Machine Learning (8) - Logistic Regression (Multiclass Classification)
2019-06-07
Mac
閱讀翻譯Mathematics for Machine Learning之2.7 Linear Mappings
2024-07-23
MacAPP
閱讀翻譯Mathematics for Machine Learning之2.5 Linear Independence
2024-07-18
Mac
Machine Learning (6) - 關於 Logistic Regression (Multiclass Classification) 的小練習
2019-04-14
Mac
Machine Learning (7) - 關於 Logistic Regression (Binary Classification) 的小練習
2019-06-07
Mac
Machine Learning (9) - 關於 Logistic Regression (Multiclass Classification) 的小練習
2019-06-08
Mac
吳恩達機器學習第一課 Supervised Machine Learning Regression and Classification
2024-06-10
吳恩達機器學習Mac
通俗理解線性迴歸(Linear Regression)
2020-09-11
《machine learning》引言
2020-10-13
Mac
Machine Learning with Sklearn
2020-12-11
Mac
Machine Learning (12) - Support Vector Machine (SVM)
2019-06-10
Mac
Machine Learning－Introduction
2019-04-03
Mac
Machine Learning - Basic points
2020-01-17
Mac
Extreme Learning Machine 翻譯
2019-01-20
REMMac
pages bookmarks for machine learning domain
2018-12-05
MacAI
Machine Learning（13）- Random Forest
2019-06-12
MacrandomREST
Machine Learning (10) - Decision Tree
2019-06-09
Mac
Machine learning terms_01
2021-04-07
Mac
機器學習-----線性迴歸淺談（Linear Regression）
2019-02-22
機器學習
statsmodels中的summary解讀(以linear regression模型為例)
2018-08-17
模型
線性迴歸（Linear Regression）演算法優缺點
2020-02-24
演算法
Machine Learning (5) - Training and Testing Data
2019-06-06
MacAI
SciTech-BigDataAIML-Machine Learning Tutorials
2024-08-12
AIMac
《深度學習》PDF Deep Learning: Adaptive Computation and Machine Learning series
2019-12-17
深度學習APTMac
Machine Learning Yearning 要點筆記
2018-10-24
Mac筆記
Machine Learning（14） - K Fold Cross Validation
2019-06-18
MacROS
MATH38161 Multivariate Statistics and Machine Learning
2024-11-23
Mac
MPHY0041 Machine Learning in Medical Imaging
2024-12-01
Mac
ml-10-1-規模機器學習( ( Large Scale Machine Learning) )
2020-10-14
機器學習Mac
大資料分析筆記 (4.1) - 線性迴歸分析(Linear Regression)
2020-11-19
大資料筆記
軒田機器學習技法課程學習筆記1 — Linear Support Vector Machine
2018-07-25
機器學習筆記Mac
Machine Learning（機器學習）之二
2018-10-25
Mac機器學習
Machine Learning（機器學習）之一
2019-02-27
Mac機器學習
使用Octave來學習Machine Learning(二)
2019-02-27
Mac
Machine Learning 機器學習筆記
2018-03-27
Mac機器學習筆記
Machine Learning With Go 第4章：迴歸
2022-06-01
MacGo

Machine Learning (1) - Linear Regression

1. 引入需要用的包以及資料檔案

2. 訓練 Linear Regression 模型

3. 輸出圖形資料:

4. 應用模型

相關文章