Support for LoRA? #34

TimS-ml · 2023-07-19T03:53:05Z

Hi:

Thank you very much for open-sourcing this project!
I found in your be_great/great.py, self.efficient_finetuning support lora. I've come across a few bugs that I may need help with.

[1] GReaT.load_from_dir() will lead to the state dict mismatch.

Missing key(s) in state_dict: "transformer.wte.weight" ...
Unexpected key(s) in state_dict: "base_model.model.transformer.wte.weight" ...

[2] net.sample(n_samples, k=50) returns

AttributeError: 'GPT2LMHeadModel' object has no attribute 'generation_config'

Thanks

The text was updated successfully, but these errors were encountered:

sebffischer · 2023-07-21T07:28:36Z

I just faced the same problem. The problem is that the load_from_dir() method does not create the correct model.

This is a workaround:

from be_great import GReaT

great = GReaT('distilgpt2')

# Define LoRA Config
lora_config = LoraConfig(
    r=16,  # only training 0.16% of the parameters of the model
    lora_alpha=32,
    target_modules=[
        "c_attn"
    ],  # this is specific for gpt2 model, to be adapted
    lora_dropout=0.05,
    bias="none",
    task_type=TaskType.CAUSAL_LM,  # this is specific for gpt2 model, to be adapted
)
# add LoRA adaptor
great.model = get_peft_model(great.model, lora_config)
great.model.print_trainable_parameters()

great.model.load_state_dict(torch.load("model.pt"))

import json
# Load attributes
with open("config.json", "r") as f:
    attributes = json.load(f)

# Set all attributes
for k, v in attributes.items():
    setattr(great, k, v)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for LoRA? #34

Support for LoRA? #34

TimS-ml commented Jul 19, 2023

sebffischer commented Jul 21, 2023

Support for LoRA? #34

Support for LoRA? #34

Comments

TimS-ml commented Jul 19, 2023

sebffischer commented Jul 21, 2023