partial implementation of lqlora #8324

Liebele · 2024-04-24T13:23:52Z

PR types

PR changes

Description

CLAassistant · 2024-04-24T13:23:58Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

paddle-bot · 2024-04-24T13:23:59Z

Thanks for your contribution!

codecov · 2024-04-24T13:53:25Z

Codecov Report

Attention: Patch coverage is 0% with 41 lines in your changes missing coverage. Please review.

Project coverage is 54.39%. Comparing base (0844a5b) to head (88b3455).

Files	Patch %	Lines
paddlenlp/peft/lora/lqlora_utils.py	0.00%	41 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8324      +/-   ##
===========================================
- Coverage    54.41%   54.39%   -0.03%     
===========================================
  Files          632      633       +1     
  Lines        99475    99516      +41     
===========================================
  Hits         54127    54127              
- Misses       45348    45389      +41

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Liebele · 2024-04-26T06:04:06Z

算法实现原理如下：

lugimzzz · 2024-04-29T03:45:19Z

paddlenlp/peft/lora/lqlora_utils.py

+                lora_A = Ur @ paddle.diag(paddle.sqrt(Sr))
+                lora_B = paddle.diag(paddle.sqrt(Sr)) @ Vhr
+
+                Q = qlora_weight_quantize_dequantize(W-lora_A@lora_B, double_quant=True)


double_quant=True，应该作为一个可调节参数，qlora_weight_quantize_dequantize中的其他参数也一样

lugimzzz · 2024-04-29T03:46:50Z

paddlenlp/peft/lora/lqlora_utils.py

+                Sr = S[:num_ranks]
+                Vhr = Vh[:num_ranks]
+
+                lora_A = Ur @ paddle.diag(paddle.sqrt(Sr))


配置的时候需要考虑lora scaling，看起来lora scaling只能强制为1

lugimzzz · 2024-04-29T06:36:38Z

paddlenlp/peft/lora/lqlora_utils.py

+
+            if W.dtype in [paddle.float16]:
+                old_dtype = W.dtype
+                W = paddle.cast(W, dtype=paddle.float32)


cast成fp32的原因？

参考了论文原作者在pytorch下的实现
https://github.com/HanGuo97/lq-lora/blob/c2424b3adc27197815da1ac9e1304565168d824d/models/lq_utils.py#L117

lugimzzz · 2024-04-29T06:44:27Z

有没有实验结果可以参考一下效果

lugimzzz · 2024-04-29T06:44:42Z

提交之前，修复格式问题

cd PaddleNLP
pre-commit install

Liebele · 2024-05-06T02:18:23Z

在E2E数据集上的微调结果：

lugimzzz · 2024-06-06T09:53:03Z

paddlenlp/peft/lora/lqlora_utils.py

+
+import paddle
+from paddlenlp.quantization.qlora import qlora_weight_quantize_dequantize
+


建议把lqlora初始化的过程写成一个lqlora_init的函数，通过lora_config传入是否使用lqlora，考虑在621行前对lora_module apply这个lqlora_init，https://github.com/PaddlePaddle/PaddleNLP/blob/develop/paddlenlp/peft/lora/lora_model.py#L621

lugimzzz · 2024-06-06T09:54:53Z

llm/finetune_generation.py

@@ -477,6 +478,9 @@ def neft_post_hook(module, input, output):
        else:
            model = LoRAModel.from_pretrained(model=model, lora_path=model_args.lora_path)

+        if model_args.lqlora:
+            transform_lora_layers(model)


传入到lora_config lqlora来控制

partial implementation of lqlora

33e2fbd

paddle-bot bot added the contributor label Apr 24, 2024

paddle-bot bot assigned ZHUI Apr 24, 2024

ZHUI requested a review from lugimzzz April 25, 2024 06:12

lugimzzz reviewed Apr 29, 2024

View reviewed changes

lugimzzz reviewed Jun 6, 2024

View reviewed changes

Merge branch 'PaddlePaddle:develop' into lqlora

88b3455

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

partial implementation of lqlora #8324

partial implementation of lqlora #8324


		import paddle
		from paddlenlp.quantization.qlora import qlora_weight_quantize_dequantize

partial implementation of lqlora #8324

Are you sure you want to change the base?

partial implementation of lqlora #8324

Conversation

PR types

PR changes

Description

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment