針對基于清華大學ChatGLM-6B采用LoRA技術微調開源項目代碼解讀

作者：大狗在海裡 2023-04-24 11:04:00

聲明如下：

1.沒有開源就沒有任何行業的進步，那些開源項目的作者值得被每個人尊重

2.本文沒有貶低作者的意思,作為布道者應該盡可能減少學習者的誤區操作成本

在這個“開源為王，資料為王，模型為王”的大時代。持續的學習能力才不會有35歲的危機。

看到一個ChatGLM-6B采用LoRA的開源項目幫你快速在ChatGLM-6B上實作自己私有對話機器人。下面學習解讀下：

1.資料準備階段(資料才是最重要的)

cover_alpaca2jsonl.py

直接将斯坦福大羊駝的資料進行格式轉化換成自己的格式，核心代碼功能：instruction-->Instruction;input-->Input;+Answer:output-->target

def format_example(example: dict) -> dict:

context = f"Instruction: {example['instruction']}\n"

if example.get("input"):

context += f"Input: {example['input']}\n"

context += "Answer: "

target = example["output"]

return {"context": context, "target": target}

斯坦福大羊駝的資料格式樣例：

{

"instruction": "Give three tips for staying healthy.",

"input": "",

"output": "1.Eat a balanced diet and make sure to include plenty of fruits and vegetables. \n2. Exercise regularly to keep your body active and strong. \n3. Get enough sleep and maintain a consist

ent sleep schedule."

}

這個cover_alpaca2jsonl.py轉換後的資料格式樣例：請注意：一定要看源代碼!!!!,不知道是作者的疏忽還是其他原因，git代碼生成資料jsonl上是“”“Response:”“”，代碼是“”“Answer: ”“”

{

"context":"Instruction: Give three tips for staying healthy.\nAnswer: ",

"target":"1.Eat a balanced diet and make sure to include plenty of fruits and vegetables. \n2. Exercise regularly to keep your body active and strong. \n3. Get enough sleep and maintain a consistent sleep schedule."

}

2.tokenize_dataset_rows.py

#備注文檔寫法錯誤

如果是代碼裡的實作文檔使用應該是這麼寫：

--skip_overlength true/false

或者代碼改成這樣才ok

parser.add_argument("--skip_overlength", type=bool,action="store_true", default=False)

3.微調finetune.py

請根據實際硬體跟代碼要求注意選擇不同的資料類型,例如fp32,fp16,half,int8等，需要根據實際情況調整

假設出現RuntimeError: expected scalar type Half but found Float，直接将--fp16去掉即可

python finetune.py \

--dataset_path data/alpaca \

--lora_rank 8 \

--per_device_train_batch_size 6 \

--gradient_accumulation_steps 1 \

--max_steps 52000 \

--save_steps 1000 \

--save_total_limit 2 \

--learning_rate 1e-4 \

--fp16 \

--remove_unused_columns false \

--logging_steps 50 \

--output_dir output

最後趕緊在單機單卡,一機多卡，多機多卡上訓練自己的大模型吧。

項目git位址：

https://github.com/mymusise/ChatGLM-Tuning.git

針對基于清華大學ChatGLM-6B采用LoRA技術微調開源項目代碼解讀

繼續閱讀

開源項目管理：使用automake 各元件的關聯

mjpg_streamer源碼的分析及針對圖像處理算法的修改

深度KWeaver：價值驅動，認知智能走向開源共創

解析開源領域的摩爾定律現象

HandAI開源項目，拉近人和攝影的距離：基于手勢識别完成不同的拍攝行為項目功能項目設計思路項目例子緻謝

力軟開發運維一體化平台是采用市面主流技術開發架構，同時整合優質第三方開源項目而研發出的産品，能有效幫助企業實作從業務需求

【周五了，請用這個開源項目離開工位去摸魚】Genact是一個很有趣的項目，它是一個無實際意義的活動生成器。可以在你工作時

場景、技術與夥伴：鴻蒙完成開源生态“黃金三角”建構

共築使能千行百業的數字底座 | HDC 2022松湖對話順利召開

4個好用的springboot開源項目#程式員#計算機#幹貨分享

Eclipse3.6 SVN plugin installation---subversive

推薦一些查找Android開源項目的國内網站

如何成為一名成功的自由程式員？

Apache CXF WebService1 簡介2 原生ws和rs規範用法3 springboot整合Jax-ws和Jax-rs

值得學習17個C/C++ 超經典開源項目

10個超炫酷的前端3D開源項目