continue呼叫1.5B小模型實現程式碼fast-apply

索美不达米亚發表於2024-11-04

原文網址 : https://www.cnblogs.com/mesopotamiaa/p/18524948

100tok/s生成速度，就問夠不夠fast？用過cursor的小夥伴一定對有個功能印象深刻，那就是fast apply功能。只要點一下，就可以把對話方塊中AI生成的程式碼快速地應用到編輯器的當前程式碼檔案裡，然後下一步就是對比變更，accept或者reject程式碼塊,相比於要手動從對話方塊複製程式碼到編輯器裡貼上修改，這個方式非常高效方便，是cursor的殺手鐧功能.

現在可以透過vscode外掛continue使用本地的小模型來實現這個功能，這個模型就是Qwen2.5-Coder-1.5b。1.5B的GGUF量化模型在我本地電腦M2 Max上透過LMStudio來跑，測試速度大約是q8_0 100 tok/s，q4_0 140 tok/s，fp16 70 tok/s，7B版本的q4_0 40 tok/s。兼顧效能和速度的話，我還是選擇了1.5B的q8_0版本。

這件事起因是我看到一個專門用於fast apply的微調模型FastApply-1.5B-v1.0，是透過微調qwen2.5-coder-1.5B和7B模型實現的，專門用於程式碼合併fast apply功能的模型，準確率比原版有提升。

效能最佳化

我試圖把它接入到continue裡，不知道continue的小夥伴可以看這個影片入門（continue開源AI程式碼程式設計助手-自定義api-SiliconFlow矽基流動與deepseek配置教程-嗶哩嗶哩）。可惜它的輸出格式是<updated-code>[Full-complete updated file]</updated-code>，要透過修改continue原始碼來解析模型生成的程式碼，這太複雜了，我就放棄折騰，直接用原版qwen2.5-coder-1.5B好了。

經過我粗略對比，原版容易刪除註釋和換行空格，沒有那麼守規矩。微調版輸出更準確，但是原版能力也不差，可以使用，200行內的簡單程式碼合併輕輕鬆鬆，並且1.5B既能支援fast apply，也可以支援程式碼補全fim，一個模型兩個用途，本地執行非常划算。

下面是如何配置continue：

// ~/.continue/config.json
{
  "models": [{
    "title": "fastapply-1.5b-v1.0@f16",
    "model": "qwen2.5-coder-1.5b-instruct@q8_0",
    "apiBase": "http://192.168.8.110:5000/v1",
    "provider": "lmstudio",
    "contextLength": 4000,
    "completionOptions": {
      "maxTokens": 4000,
      "stop": [
        "<|endoftext|>"
      ],
      "temperature": 0.01
    }
  }],
  "tabAutocompleteModel": {
    "title": "ollama_model",
    "provider": "lmstudio",
    "model": "qwen2.5-coder-1.5b-instruct@q8_0",
    "template": "qwen",
    "apiBase": "http://192.168.8.110:5000/v1"
  },
  "modelRoles": {
    "applyCodeBlock": "fastapply-1.5b-v1.0@f16",
    "inlineEdit": "fastapply-1.5b-v1.0@f16"
  }
}

// ~/.continue/config.ts
export function modifyConfig(config: Config): Config {
    const gptEditPrompt: PromptTemplate = (_, otherData) => {
        // 原版enclosed within <updated-code> and </updated-code> tags
        // system You are a coding assistant that helps merge code updates
        // Do not include any additional text, explanations, placeholders, ellipses, or code fences.
        // 為了方便相容改成markdown格式的
        // enclosed within markdown \`\`\`your update code\`\`\`
        const systemMessage =
            `<|im_start|>system You are a coding assistant that helps fix code and merge code updates, ensuring every modification is fully integrated.<|im_end|>`;
        const userMessage =
            `<|im_start|>user Merge all changes from the <update> snippet into the <code> below. - Preserve the code's structure, order, comments, and indentation exactly. - Output only the updated code, enclosed within markdown \`\`\`your update code\`\`\`. - Do not include any additional text, explanations, placeholders, ellipses.`;

        if (otherData ? .codeToEdit ? .trim().length === 0) {
            return `${systemMessage}
${userMessage}
<code>${otherData.prefix}[BLANK]${otherData.suffix}</code>
<update>${otherData.userInput}</update>
Provide the complete updated code.<|im_end|>
<|im_start|>assistant `;
        }

        // const codeBlock = `${otherData.prefix}<code>${otherData.codeToEdit}$</code>{otherData.suffix}`;  // 使用prefix, suffix
        const codeBlock = `<code>${otherData.codeToEdit}</code>`;
        const updateBlock = `<update>${otherData.userInput}</update>`;

        return `${systemMessage}
${userMessage}
${codeBlock}
${updateBlock}
Provide the complete updated code.<|im_end|>
<|im_start|>assistant `;
    };

    let modelName = "fastapply-1.5b-v1.0@f16"
    // Fix the model finding logic
    let applyModel = config.models.find(model => model.title === modelName);
    if (applyModel) {
        applyModel.promptTemplates = {
            edit: gptEditPrompt,
        };
        // console.log('done')
    } else {
        // console.warn('Model "fastapply-1.5b-v1.0@f16" not found in config.models');
    }
    return config;
}

我還向continue倉庫提了一個issue，希望能相容fastApply微調模型，歡迎跟蹤進度。

用Promise實現小程式介面鏈式呼叫
2018-10-06
Promise
10行程式碼實現微信小程式支付功能，使用小程式雲開發實現小程式支付功能（
2021-09-09
行程微信小程式
Python程式碼實現“FlappyBird”小遊戲
2020-12-10
PythonAPP遊戲
服務端呼叫微信小程式OCR識別介面實現
2019-08-21
服務端微信小程式
Laravel核心程式碼學習 — 模型關聯底層程式碼實現
2019-03-02
Laravel模型
Laravel核心程式碼學習 -- 模型關聯底層程式碼實現
2018-06-01
Laravel模型
使用emscripten實現js直接呼叫C程式碼(emscripten的初探)
2018-10-24
JSC程式
從微信小程式開發者工具原始碼看實現原理（二）- - 小程式技術實現
2019-07-22
微信小程式原始碼
一行程式碼實現Android的跨程式呼叫與通訊
2021-09-09
行程Android
TensorFlow 呼叫預訓練好的模型—— Python 實現
2018-10-10
模型Python
微信小程式實現全域性搜尋程式碼高亮
2018-03-29
微信小程式
如何使用充血模型實現防彈程式碼 - DZone Java
2019-04-27
模型Java
層次分析法模型原理以及程式碼實現
2024-06-27
模型
Deepseek AI 與外掛Continue程式碼智慧助手
2024-09-28
AI
微信小程式如何呼叫API實現資料請求-wx.request()
2018-08-05
微信小程式API
微信小程式實現商城案例（賦原始碼)
2019-03-03
微信小程式原始碼
小程式訂閱訊息推送（含原始碼）java實現小程式推送，springboo
2021-09-09
原始碼JavaSpring
【微信小程式canvas】實現小程式手寫板使用者簽名(附程式碼)
2018-09-18
微信小程式Canvas
當微信小程式遇上TensorFlow：小程式實現
2018-10-08
微信小程式
5行程式碼實現微信小程式模版訊息推送（含推送後臺和小程式原始碼）
2021-09-09
行程微信小程式原始碼
微信小程式功能之全屏滾動效果的實現程式碼
2018-12-04
微信小程式
乾貨：如何藉助小程式雲開發實現小程式支付功能（含原始碼）
2019-07-04
原始碼
JNI：Java程式碼呼叫原生程式碼
2022-03-02
Java
微信掃小程式碼實現網頁端登入
2019-08-09
網頁
實現小程式canvas拖拽功能
2018-09-03
Canvas
Tomcat程式碼實現
2018-11-08
Tomcat
Promise 程式碼實現
2019-01-01
Promise
小程式實現實時聊天IM功能
2020-03-18
XLNet預訓練模型，看這篇就夠了！(程式碼實現)
2019-09-30
模型
OutputStreamWriter介紹&程式碼實現和InputStreamReader介紹&程式碼實現
2022-07-09
Locust 程式碼指令碼實現
2024-03-16
指令碼
【微信小程式】掃碼付小程式優化實踐
2019-03-01
微信小程式優化
直播小程式原始碼，vue實現時間倒數計時
2023-03-13
原始碼Vue
Linklist程式碼實現以及程式碼解讀
2024-10-23
小程式國際化實現方式
2019-04-04
在小程式中實現 Mixins 方案
2019-06-19
微信小程式遮罩功能實現
2018-04-05
微信小程式遮罩
小程式實現手寫簽名
2022-02-18

continue呼叫1.5B小模型實現程式碼fast-apply

相關文章