llamafactory框架下微調llama3-70b推理問題

sss1001發表於2024-05-28

原文網址 : https://www.cnblogs.com/livysong/p/18218332

問題描述

使用llamafactory + npu lora微調llama3-70b後,最終推理出現亂碼以及不能自動停止生成。如下所示：

derrick rose of the chicago bulls has the most career assists among players who have never been named to an all-star game with 3,339 assists.  IICIII.џџџ. 3,339 assists(stypyuseRal;\r\r\n

推測過程

由於出現亂碼的位置總是在一段輸出結束末尾，同時不能夠根據eos_token停止輸出，只能到達長度限制停止。推測是eos_token的問題。
檢查原模型與微調後的tokenizer配置檔案。發現special_tokens_map.json不一致。覺得可能是由於這個問題，導致要麼是微調時可能沒有充分學習到正確使用結束標記來終止生成，要麼是合併權重的時候配置衝突。

原模型配置檔案

{
"bos_token": "<|begin_of_text|>",
"eos_token": "<|end_of_text|>"
}

微調後的配置檔案

{
"bos_token": {
  "content": "<|begin_of_text|>",
  "lstrip": false,
  "normalized": false,
  "rstrip": false,
  "single_word": false
},
"eos_token": {
  "content": "<|eot_id|>",
  "lstrip": false,
  "normalized": false,
  "rstrip": false,
  "single_word": false
},
"pad_token": "<|eot_id|>"
}

思考為什麼不一致。llama3使用template檔案中的llama3模板，進入LLaMA-Factory/src/llmtuner/data/template.py檢視llama3，發現stop_words和原模型配置中的eos_token不一致。

_register_template(
    name="llama3",
    format_user=StringFormatter(
        slots=[
            (
                "<|start_header_id|>user<|end_header_id|>\n\n{{content}}<|eot_id|>"
                "<|start_header_id|>assistant<|end_header_id|>\n\n"
            )
        ]
    ),
    format_system=StringFormatter(
        slots=[{"bos_token"}, "<|start_header_id|>system<|end_header_id|>\n\n{{content}}<|eot_id|>"]
    ),
    format_observation=StringFormatter(
        slots=[
            (
                "<|start_header_id|>tool<|end_header_id|>\n\n{{content}}<|eot_id|>"
                "<|start_header_id|>assistant<|end_header_id|>\n\n"
            )
        ]
    ),
    default_system="You are a helpful assistant.",
    stop_words=["<|eot_id|>"], # 不一致
    replace_eos=True,
)

解決辦法

改為原模型配置中的eos_token。將LLaMA-Factory/src/llmtuner/data/template.py檔案中的llama3模板作如下修改：

_register_template(
    name="llama3",
    format_user=StringFormatter(
        slots=[
            (
                "<|start_header_id|>user<|end_header_id|>\n\n{{content}}<|end_of_text|>"  # sss <|eot_id|>
                "<|start_header_id|>assistant<|end_header_id|>\n\n"
            )
        ]
    ),
    format_system=StringFormatter(
        slots=[{"bos_token"}, "<|start_header_id|>system<|end_header_id|>\n\n{{content}}<|end_of_text|>"]  # sss <|eot_id|>
    ),
    format_observation=StringFormatter(
        slots=[
            (
                "<|start_header_id|>tool<|end_header_id|>\n\n{{content}}<|end_of_text|>"  # sss <|eot_id|>
                "<|start_header_id|>assistant<|end_header_id|>\n\n"
            )
        ]
    ),
    default_system="You are a helpful assistant.",
    stop_words=["<|end_of_text|>"], # sss <|eot_id|>
    replace_eos=True,
)

微調訓練推理

結果展示

微調後推理，無亂碼生成，也可自動停止。

kourtney kardashian, kim kardashian, khloe kardashian, rob kardashian, kendall jenner, and kylie jenner.

待探索

等待嘗試量化後的模型是否會出現問題

隱藏的輸入框調起軟鍵盤問題--ios/安卓
2020-09-24
iOS安卓
【問題】【SpringBoot】記一次springboot框架下用jackson解析RequestBody失敗的問題
2020-09-07
Spring Boot框架
輸入框換行問題
2020-10-16
C++ Qt開發：SpinBox數值微調框元件
2023-12-12
C++QT元件
amazeUI復擇框問題解決
2018-09-05
UI
Android懸浮框的適配問題
2020-12-20
Android
通過Observable解決搜尋框問題
2018-10-22
移動端1px邊框問題
2018-10-17
Bootstrap 模態框無法呼叫的問題
2020-12-24
boot
自我學習與理解：keras框架下的深度學習（三）迴歸問題
2021-12-27
Keras框架深度學習
keras框架下的深度學習（二）二分類和多分類問題
2021-10-26
Keras框架深度學習
將越獄問題轉換為求解邏輯推理題：「濫用」推理能力讓LLM實現自我越獄
2025-03-02
解決Few-shot問題的兩大方法：元學習與微調
2023-11-11
微信提現問題
2020-10-26
動態規劃-揹包01問題推理與實踐
2024-11-10
動態規劃
VUE下拉框第一行空白問題
2019-10-25
Vue
【火爐煉AI】深度學習010-Keras微調提升效能（多分類問題）
2018-11-15
AI深度學習Keras
微調
2020-11-19
在pytorch框架下，訓練model過程中，loss=nan問題時該怎麼解決？
2018-08-16
PyTorch框架NaN
win10執行框怎麼調出來_win10如何調出執行框
2020-03-13
Win10
問卷調查中常見問題及解決方法
2023-10-08
千問QwQ，推理界“新王”！
2024-12-01
Leetcode：單調棧_可見山峰問題
2020-12-12
LeetCode
資料庫高io問題調查
2020-12-11
資料庫
解決Windows安全性登入彈框的問題
2018-05-21
Windows
bootstrap-table重新整理和搜尋框高度問題
2021-01-03
boot
微信小程式提示框
2020-12-09
微信小程式
呼叫微信介面token的問題
2019-01-19
微信支付返回-1，問題排查
2019-05-22
微信APP支付-簽名問題
2018-03-03
APP
「火鍋問答」是啥？面向自然語言和多步推理問題，新型問答資料集HotpotQA面世
2018-10-01
直播平臺原始碼，關於彈出框中輸入框被遮擋問題解決
2021-11-10
原始碼
layer彈框刪除ztree節點非阻塞問題解決
2019-09-17
關於jquery控制select下拉框自動展開問題
2018-03-16
jQuery
Oracle 調優確定存在問題的SQL
2020-07-02
OracleSQL
技術管理之如何協調加班問題
2022-11-22
Swift微調命令
2024-03-14
Swift
[解決問題] Vagrant nginx 站點配置問題（ThinkPHP HTML 無法調跳轉）
2019-05-09
NginxPHPHTML

llamafactory框架下微調llama3-70b推理問題

問題描述

推測過程

解決辦法

結果展示

待探索

相關文章