Anyone using triton python backend? #633

YooSungHyun · 2023-06-26T10:39:46Z

System Info

python 3.10
tritonserver:23.05
torch 2.0.1
peft 0.3.0
transformers 4.29.2

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder
My own task or dataset (give details below)

Reproduction

check, triton-inference-server/server#5989
i want to use PeftModel in tritonserver:23.05, that is not ocurring error, but don't working on....

Anyone else using it like me?
please, give me some tips 😭

Expected behavior

set_adapter is work good

The text was updated successfully, but these errors were encountered:

pacman100 · 2023-06-27T13:02:44Z

Hello @YooSungHyun, as per triton-inference-server/server#5989 (comment), the issue seems to stem from the fact that torch.compile is incompatible with Peft models at present. We will put the deep dive for this in our backlog but this will take time.

YooSungHyun · 2023-06-28T04:50:15Z

@pacman100 thx, if you need something, i will help you. plz just tagging me🤗

YooSungHyun · 2023-06-28T04:53:05Z

if i use like this.

model = AutoModelForCausalLM.from_pretrained(
    optional_config["model_path"],
    low_cpu_mem_usage=bool(strtobool(optional_config["low_cpu_mem_usage"])),
    load_in_8bit=bool(strtobool(optional_config["load_in_8bit"])),
    torch_dtype=getattr(torch, optional_config["torch_dtype"], None),
    device_map=device_map,
)

self.model = PeftModel.from_pretrained(
    model,
    optional_config["aa_lora_weights"],
    adapter_name=optional_config["aa_lora_name"],
    device_map=device_map,
)
self.model.load_adapter(
    optional_config["bb_lora_weights"],
    adapter_name=optional_config["bb_lora_name"],
    device_map=device_map,
)
model = torch.compile(model)
self.model.eval()

it work fine

pacman100 · 2023-06-28T05:28:38Z

Great! So, it works!

YooSungHyun · 2023-06-28T05:39:52Z

@pacman100 but, this is weired... why important torch.compile's implement line number...?

pacman100 · 2023-06-28T05:44:02Z

Torch.compile should always be at the end of all the preprocessing of the model. For example, when using DDP too, torch.compile is applied after wrapping the model in DDP.

YooSungHyun · 2023-06-28T05:50:44Z

@pacman100 oh thx

YooSungHyun · 2023-06-28T05:52:41Z

@pacman100 ah.. how about eval()?

model.eval()
model = torch.compile(model)

model = torch.compile(model)
model.eval()

each correct?

YooSungHyun · 2023-07-04T01:03:44Z

@pacman100 hello?

github-actions · 2023-07-28T15:03:28Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

YooSungHyun · 2023-08-03T06:55:33Z

@pacman100

github-actions · 2023-08-27T15:03:29Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

YooSungHyun changed the title ~~Any one using triton python backend?~~ Anyone using triton python backend? Jun 26, 2023

YooSungHyun closed this as completed Jun 28, 2023

YooSungHyun reopened this Jun 28, 2023

github-actions bot closed this as completed Sep 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anyone using triton python backend? #633

Anyone using triton python backend? #633

Anyone using triton python backend? #633

Anyone using triton python backend? #633

Comments

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior