[lora] `push_to_hub()` `save_pretrained()` errors and potential inconsistencies #57

sayakpaul · 2023-02-07T04:38:48Z

Related to #56.

Consider the following LoraModel instance:

from transformers import AutoModelForImageClassification, TrainingArguments, Trainer
from peft import LoraConfig, LoraModel


model_checkpoint = "google/vit-base-patch16-224-in21k" 
model = AutoModelForImageClassification.from_pretrained(
    model_checkpoint,
    label2id=label2id,
    id2label=id2label,
    ignore_mismatched_sizes=True,  # provide this in case you're planning to fine-tune an already fine-tuned checkpoint
)

config = LoraConfig(
    r=16,
    lora_alpha=16,
    target_modules=["query", "value"],
    lora_dropout=0.1,
    bias="none",
    modules_to_save=["classifier"],
)
lora_model = LoraModel(config, model)

If I call lora_model.save_pretrained("lora_vit"), I see that the state_dict is about the same size as that of model. Is this expected?

I would have expected to only see the LoRA trainable parameters alongside the modules_to_save ones. This would help reduce the size of the state dict and would also help with portability, especially for very large models. This is also how it's implemented in diffusers.

Also, PeftConfig is somehow unable to find the config.json here whereas it's clearly there as we can see. What am I missing out on?

This currently blocks the inference and sharing section of the notebook.

The text was updated successfully, but these errors were encountered:

pacman100 · 2023-02-07T04:45:58Z

Hello Sayak, for hf hub utils, you need to create object of PeftModel. In the notebook example that I have shared offline, these changes have been shown.

sayakpaul · 2023-02-07T05:35:09Z

Thanks!

sayakpaul mentioned this issue Feb 7, 2023

add: example on fine-tuning for image classification. #56

Merged

sayakpaul closed this as completed Feb 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[lora] `push_to_hub()` `save_pretrained()` errors and potential inconsistencies #57

[lora] `push_to_hub()` `save_pretrained()` errors and potential inconsistencies #57

[lora] push_to_hub() save_pretrained() errors and potential inconsistencies #57

[lora] push_to_hub() save_pretrained() errors and potential inconsistencies #57

Comments

[lora] `push_to_hub()` `save_pretrained()` errors and potential inconsistencies #57

[lora] `push_to_hub()` `save_pretrained()` errors and potential inconsistencies #57