-
Notifications
You must be signed in to change notification settings - Fork 284
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add docs of support new vl model
documentation
Improvements or additions to documentation
#1332
opened Mar 22, 2024 by
irexyc
Loading…
fix: update api_server_backend.py to adapt latest gradio
improvement
#1541
opened May 3, 2024 by
kv-chiu
Loading…
[benchmark] optimize benchmark: counting tokenlizer tokens and error requests
#1607
opened May 17, 2024 by
NiuBlibing
Loading…
feat: skip invokeFlattenKV_v2_ when fp16 and bf16 with CacheType::kBlock
#1683
opened May 29, 2024 by
zhyncs
Loading…
Visualize layer activations and weights to simplify the quantization process.
#607
opened Oct 24, 2023 by
HIT-cwh
Loading…
Add tools to api_server for InternLM2 model
enhancement
New feature or request
#1763
opened Jun 12, 2024 by
AllentDan
Loading…
Support guided decoding for pytorch backend
enhancement
New feature or request
#1856
opened Jun 26, 2024 by
AllentDan
Loading…
feat: support llama2 and internlm2 on 910B
enhancement
New feature or request
#1889
opened Jul 1, 2024 by
yao-fengchen
Loading…
Fix index error when profiling token generation with
-ct 1
Bug:P1
#1898
opened Jul 2, 2024 by
lvhan028
Loading…
Remove deprecated arguments from API and clarify model_name and chat_template_name
BC-breaking
improvement
WIP
#1931
opened Jul 5, 2024 by
lvhan028
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.