How to change to PEFT model dynamically? #1829

whr819987540 · 2024-06-05T13:24:40Z

python==3.7.12
PEFT==0.3.0

I fine-tune the eleventh transformer of Bert as below:

target_modules = []
target_modules.append("11.attention.self.query")
target_modules.append("11.attention.self.value")

lora_config = LoraConfig(
    r = self.args.lora_rank,
    lora_alpha = self.args.lora_alpha,
    target_modules = target_modules,
    lora_dropout = 0.05,
    bias = "none"
)

After training for a few epochs, I also want to fine-tune the first transformer. How to achieve this?

BenjaminBossan · 2024-06-05T14:34:43Z

This is not directly possible. What you could try is to add all the layers you want to eventually train into target_modules. Then, you go through the modules and manually disable the gradient of those you don't want to train:

target_modules = [
    "0.attention.self.query", "0.attention.self.value",
    "11.attention.self.query", "11.attention.self.value",
]
...
model = get_peft_model(...)
for name, module in model.named_modules():
    if name ...:
        module.requires_grad_(False)

I've never tried this, so not 100% sure if it'll work, but worth a try.

whr819987540 · 2024-06-05T15:02:36Z

Actually, that's what I am using. But model (just fine-tune the last transformer) created by this way performs differently with model created directly (pass the target_modules argument). I guess it's because of the initialization of additional layers changes the random number sequence.

BenjaminBossan · 2024-06-05T16:06:20Z

Yes, if you create more LoRA layers, the random seed will be ticked more often. But I don't understand what the big issue would be with that. Maybe you could show your code and explain what you expect vs what actually happens.

whr819987540 · 2024-06-06T00:37:06Z

Just to know whether it's the reason of random seed.

Thanks a lot for your reply.

whr819987540 closed this as completed Jun 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to change to PEFT model dynamically? #1829

How to change to PEFT model dynamically? #1829

How to change to PEFT model dynamically? #1829

How to change to PEFT model dynamically? #1829

Comments