ML.NET 示例：推薦之One Class 矩陣分解

feiyun0112發表於2018-12-12

原文網址 : https://www.cnblogs.com/feiyun0112/p/10110581.html

寫在前面

準備近期將微軟的machinelearning-samples翻譯成中文，水平有限，如有錯漏，請大家多多指正。
如果有朋友對此感興趣，可以加入我：https://github.com/feiyun0112/machinelearning-samples.zh-cn

產品推薦 - 矩陣分解問題示例

ML.NET 版本	API 型別	狀態	應用程式型別	資料型別	場景	機器學習任務	演算法
v0.8	動態 API	最新版本	控制檯應用程式	.txt 檔案	推薦	矩陣分解	MatrixFactorizationTrainer (One Class)

在這個示例中，您可以看到如何使用ML.NET來構建產品推薦方案。

本示例中的推薦方式基於共同購買或經常一起購買的產品，這意味著它將根據客戶的購買歷史向客戶推薦一組產品。

替代文字

在這個示例中，基於經常一起購買的學習模型來推薦產品。

問題

在本教程中，我們將使用亞馬遜共同購買產品資料集。

我們將使用One-Class因式分解機來構建我們的產品推薦器，它使用協同過濾方法。

我們介紹的one-class和其他因式分解機的區別在於，在這個資料集中，我們只有購買歷史的資訊。

我們沒有評分或其他詳細資訊，如產品描述等。

“協同過濾”是在一個基本假設的情況下運作的，即如果某人A在一個問題上與某人B具有相同的意見，則在另一個問題上，相對其他隨機選擇的人，A更傾向於B的觀點。

資料集

原始資料來自SNAP:
https://snap.stanford.edu/data/amazon0302.html

ML 任務 - 矩陣分解 (推薦)

這個示例的ML任務是矩陣分解，它是一個執行協同過濾的有監督的機器學習任務。

解決方案

要解決此問題，您需要在現有訓練資料上建立和訓練ML模型，評估其有多好（分析獲得的指標），最後您可以使用/測試模型來預測給定輸入資料變數的需求。

建立 -> 訓練 -> 評估 -> 使用

1. 建立模型

建立模型包括:

從 https://snap.stanford.edu/data/amazon0302.html 下載並複製資料集檔案Amazon0302.txt。
使用以下內容替換列名：ProductID ProductID_Copurchased
在讀取器中，我們已經提供了KeyRange，並且產品ID已經編碼，我們需要做的就是使用幾個額外的引數呼叫MatrixFactorizationTrainer。

下面是用於建立模型的程式碼：

 
    //STEP 1: Create MLContext to be shared across the model creation workflow objects 
    var ctx = new MLContext();

    //STEP 2: Create a reader by defining the schema for reading the product co-purchase dataset
    //        Do remember to replace amazon0302.txt with dataset from 
              https://snap.stanford.edu/data/amazon0302.html
    var reader = ctx.Data.TextReader(new TextLoader.Arguments()
    {
        Separator = "tab",
        HasHeader = true,
        Column = new[]
        {
                new TextLoader.Column("Label", DataKind.R4, 0),
                new TextLoader.Column("ProductID", DataKind.U4, new [] { new TextLoader.Range(0) }, new KeyRange(0, 262110)),
                new TextLoader.Column("CoPurchaseProductID", DataKind.U4, new [] { new TextLoader.Range(1) }, new KeyRange(0, 262110))
            }
        });

        //STEP 3: Read the training data which will be used to train the movie recommendation model
        var traindata = reader.Read(new MultiFileSource(TrainingDataLocation));


        //STEP 4: Your data is already encoded so all you need to do is call the MatrixFactorization Trainer with a few extra hyperparameters:
        //        LossFunction, Alpa, Lambda and a few others like K and C as shown below. 
        var est = ctx.Recommendation().Trainers.MatrixFactorization("ProductID", "CoPurchaseProductID",  
                                     labelColumn: "Label",
                                     advancedSettings: s =>
                                     {
                                         s.LossFunction = MatrixFactorizationTrainer.LossFunctionType.SquareLossOneClass;
                                         s.Alpha = 0.01;
                                         s.Lambda = 0.025;
                                         // For better results use the following parameters
                                         //s.K = 100;
                                         //s.C = 0.00001;
                                     });

2. 訓練模型

一旦定義了評估器，就可以根據可用的訓練資料對評估器進行訓練。

這將返回一個訓練過的模型。


    //STEP 5: Train the model fitting to the DataSet
    //Please add Amazon0302.txt dataset from https://snap.stanford.edu/data/amazon0302.html to Data folder if FileNotFoundException is thrown.
    var model = est.Fit(traindata);

3. 使用模型

我們將通過建立預測引擎/函式來執行此模型的預測，如下所示。

    public class Copurchase_prediction
    {
        public float Score { get; set; }
    }

    public class ProductEntry
    {
        [KeyType(Contiguous = true, Count = 262111, Min = 0)]
        public uint ProductID { get; set; }

        [KeyType(Contiguous = true, Count = 262111, Min = 0)]
        public uint CoPurchaseProductID { get; set; }
        }

一旦建立了預測引擎，就可以預測兩個產品被共同購買的分數。

    //STEP 6: Create prediction engine and predict the score for Product 63 being co-purchased with Product 3.
    //        The higher the score the higher the probability for this particular productID being co-purchased 
    var predictionengine = model.MakePredictionFunction<ProductEntry, Copurchase_prediction>(ctx);
    var prediction = predictionengine.Predict(
                             new ProductEntry()
                             {
                             ProductID = 3,
                             CoPurchaseProductID = 63
                             });

ML.NET 示例：推薦之矩陣分解
2018-12-11
矩陣
ML.NET 示例：推薦之場感知分解機
2018-12-13
用Spark學習矩陣分解推薦演算法
2018-09-30
Spark矩陣演算法
推薦系統實踐 0x0b 矩陣分解
2020-12-04
矩陣
矩陣分解
2020-12-06
矩陣
（十八）從零開始學人工智慧-智慧推薦系統：矩陣分解
2021-09-09
人工智慧矩陣
矩陣分解--超詳細解讀
2020-10-03
矩陣
ML.NET 示例：聚類之鳶尾花
2018-12-15
聚類
矩陣的奇異值分解（SVD）及其應用
2024-07-26
矩陣
ML.NET 示例：深度學習之整合TensorFlow
2018-12-16
深度學習
達觀資料周顥鈺：想寫出人見人愛的推薦系統，先了解經典矩陣分解技術
2018-10-09
矩陣
ML.NET 示例：迴歸之價格預測
2018-12-08
ML.NET 示例：迴歸之銷售預測
2018-12-09
協方差矩陣推導1
2024-10-19
矩陣
矩陣加速線性遞推
2024-08-19
矩陣
【矩陣乘法】【快速冪】遞推
2020-12-19
矩陣
基於矩陣分解的協同過濾演算法
2024-04-11
矩陣演算法
三維旋轉矩陣推導
2019-03-15
矩陣
二維旋轉矩陣推導
2024-04-02
矩陣
動態dp & 矩陣加速遞推
2024-08-19
矩陣
ML.NET 示例：多類分類之問題分類
2018-12-06
ML.NET 示例：多類分類之鳶尾花分類
2018-12-07
數學建模例題2.28 矩陣合併示例
2024-10-28
矩陣
數學建模例題例 2.29 矩陣分割示例
2024-10-28
矩陣
數學建模例題2.30 矩陣元素求和示例
2024-10-28
矩陣
資料結構之陣列和矩陣--矩陣&不規則二維陣列
2020-10-02
資料結構陣列矩陣
ML.NET呼叫Tensorflow模型示例——MNIST
2019-05-21
模型
小紅書矩陣投放產品推廣做紅書矩陣上海氖天
2023-03-14
矩陣
MKL稀疏矩陣運算示例及函式封裝
2023-04-23
矩陣函式封裝
巨大的矩陣（矩陣加速）
2024-08-16
矩陣
鄰接矩陣、度矩陣
2021-12-07
矩陣
人工智慧-機器學習-演算法：非負矩陣分解(NMF)
2020-12-27
人工智慧機器學習演算法矩陣
奇異矩陣，非奇異矩陣，偽逆矩陣
2020-09-29
矩陣
JavaScript陣列方法大全(推薦)
2018-10-25
JavaScript陣列
Netflix 推薦系統 (Part One)-排序演算法
2018-09-27
排序演算法
推薦系統與協同過濾、奇異值分解
2019-03-04
3D旋轉矩陣的推導
2019-09-16
3D矩陣
ML.NET 示例：二元分類之垃圾簡訊檢測
2018-12-03