InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 284
Star 3.2k

Code
Issues 177
Pull requests 26
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: InternLM/lmdeploy

Labels 32 Milestones 0

New pull request New

Clear current search query, filters, and sorts

26 Open 874 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add Jetson platform support (by docker)

#1820 opened Jun 21, 2024 by BestAnHongjun

Loading…

Compatible with Gradio 4.x improvement WIP

#1035 opened Jan 24, 2024 by irexyc

Loading…

[WIP] support Medusa

#1231 opened Mar 3, 2024 by zhyncs

Loading…

5 of 9 tasks

Add docs of support new vl model documentation

Improvements or additions to documentation

#1332 opened Mar 22, 2024 by irexyc

Loading…

Log stats enhancement

New feature or request

#1423 opened Apr 11, 2024 by AllentDan

Loading…

fix: update api_server_backend.py to adapt latest gradio improvement

#1541 opened May 3, 2024 by kv-chiu

Loading…

support AI4Chem/ChemLLM-7B-Chat-1_5-SFT WIP

#1552 opened May 7, 2024 by lvhan028

Loading…

Add tritonserver testcase

#1559 opened May 8, 2024 by ZhoujhZoe

Loading…

[benchmark] optimize benchmark: counting tokenlizer tokens and error requests

#1607 opened May 17, 2024 by NiuBlibing

Loading…

Check base64 image validation Bug:P2

#1615 opened May 20, 2024 by AllentDan

Loading…

support vl benchmark

#1662 opened May 27, 2024 by AllentDan

Loading…

feat: skip invokeFlattenKV_v2_ when fp16 and bf16 with CacheType::kBlock

#1683 opened May 29, 2024 by zhyncs

Loading…

Visualize layer activations and weights to simplify the quantization process.

#607 opened Oct 24, 2023 by HIT-cwh

Loading…

Add tools to api_server for InternLM2 model enhancement

New feature or request

#1763 opened Jun 12, 2024 by AllentDan

Loading…

Maybe a workaround for qwen2 quantization Nan error

#1844 opened Jun 25, 2024 by AllentDan • Draft

feat: decouple input_ids and output_ids

#1855 opened Jun 25, 2024 by zhyncs

Loading…

Support guided decoding for pytorch backend enhancement

New feature or request

#1856 opened Jun 26, 2024 by AllentDan

Loading…

feat: support llama2 and internlm2 on 910B enhancement

New feature or request

#1889 opened Jul 1, 2024 by yao-fengchen

Loading…

Fix index error when profiling token generation with -ct 1 Bug:P1

#1898 opened Jul 2, 2024 by lvhan028

Loading…

refactor sampling layer setup improvement

#1912 opened Jul 3, 2024 by irexyc

Loading…

PyTorch Engine AWQ support

#1913 opened Jul 3, 2024 by grimoire

Loading…

[ci] add internlm2.5 models into testcase

#1928 opened Jul 5, 2024 by zhulinJulia24

Loading…

Upgrade gradio improvement

#1930 opened Jul 5, 2024 by AllentDan

Loading…

Remove deprecated arguments from API and clarify model_name and chat_template_name BC-breaking improvement WIP

#1931 opened Jul 5, 2024 by lvhan028

Loading…

support internlm-xcomposer2d5-7b WIP

#1932 opened Jul 5, 2024 by irexyc

Loading…

1 of 5 tasks

Previous 1 2 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly