drop stop words #1823

grimoire · 2024-06-21T09:04:28Z

fix for #1754

Stop words should NOT be cached.

User should be able to get the very same result if they gather all input/output and recompute it. Since we won't give stop words to user, we should not cache it too.
Baichuan2 would forgot the history if we put the eos in cache.

grimoire · 2024-06-21T09:04:42Z

@zhulinJulia24

zhulinJulia24 · 2024-06-24T01:42:18Z

@zhulinJulia24

Fixed!

zhulinJulia24

lgtm

grimoire · 2024-06-24T02:46:02Z

@lvhan028 Is the behavior aligned with turbomind?

lvhan028 · 2024-06-24T05:15:31Z

@lvhan028 Is the behavior aligned with turbomind?

Turbomind caches the stop_words but not the eos_id.

lmdeploy/lmdeploy/turbomind/turbomind.py

Line 751 in da439df

if len(output) > 0 and output[-1].item() == self.eos_id \

lvhan028 · 2024-06-25T10:41:34Z

"Since we won't give stop words to user, we should not cache it too."
I don't think so.
In the non-stateful case, it is OK, since message2pormpt will add the stop_words in between.
So it means, in the stateful case, the stop_words should be saved.

Regarding the baichuan model, I think we should experiment with the eos_id token. Addeos_id after each assistant's answer in a multi-round scenario and use transformers to do the inference. Let's check if it loses the memory

grimoire · 2024-06-26T04:22:11Z

Regarding the baichuan model, I think we should experiment with the eos_id token. Addeos_id after each assistant's answer in a multi-round scenario and use transformers to do the inference. Let's check if it loses the memory

Sure it does

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM, GenerationConfig

def main():
    model_path = '/path/to/Baichuan2-13B-Chat/'

    eos = ''
    eos = '</s>'
    messages = [
        {"role": "user", "content": "Do you know John Wick?"},
        {"role": "assistant", "content": f"Yes, it is a movie. {eos}"},
        {"role": "user", "content": "Tell me more about it."},
    ]

    tokenizer = AutoTokenizer.from_pretrained(model_path,
        revision="v2.0",
        use_fast=False,
        trust_remote_code=True)
    model = AutoModelForCausalLM.from_pretrained(model_path,
        revision="v2.0",
        device_map="auto",
        torch_dtype=torch.bfloat16,
        trust_remote_code=True)
    model.generation_config = GenerationConfig.from_pretrained(model_path, revision="v2.0")
    with torch.inference_mode():
        response = model.chat(tokenizer, messages)
        print(response)

if __name__ == '__main__':
    main()

output with eos

I'm sorry, I am not sure what you are referring to. Can you provide more context or clarification?

output w/o eos

"John Wick" is a 2014 action thriller film directed by Chad Stahelski and written by Derek Kolstad. The film stars Keanu Reeves as the title character, an ex-secret service agent who goes on a revenge mission after his car and dog are stolen at the behest of a Russian mobster (played by Alfie Allen). The film was released to positive reviews from critics, who praised Reeves' performance and the film's fast-paced action sequences. A sequel, "John Wick: Chapter 2", was released in 2017, and a third installment, "John Wick: Chapter 3 – Parabellum", was released in 2019.

grimoire · 2024-06-26T04:37:41Z

I can align the behavior with TurboMind, but putting the logic in the template is more reasonable.

grimoire added 2 commits June 21, 2024 16:50

drop stop words

99a6cc9

fix length

d561282

Merge branch 'main' into drop-stop-words

c4f2dbc

zhulinJulia24 self-requested a review June 24, 2024 01:42

zhulinJulia24 approved these changes Jun 24, 2024

View reviewed changes

grimoire added 2 commits June 26, 2024 14:53

ignore eos only

aedcb10

fix

a6c41b1

lvhan028 approved these changes Jul 1, 2024

View reviewed changes

lvhan028 added the improvement label Jul 1, 2024

lvhan028 merged commit 78d88d5 into InternLM:main Jul 1, 2024
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

drop stop words #1823

drop stop words #1823

drop stop words #1823

drop stop words #1823

Conversation

Choose a reason for hiding this comment