Uh oh!

There was an error while loading. Please reload this page.

InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 720
Star 8k

Code
Issues 534
Pull requests 76
Discussions
Actions
Projects
Security and quality 4
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: InternLM/lmdeploy

Labels 34 Milestones 0

New pull request New

76 Open 2,362 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Bugfix] Return the client-disconnect error instead of discarding it in generate and completions_v1

#4811 opened Jul 31, 2026 by ErenAta16

Loading…

3 tasks done

feat: support interns2 preview time-series encoder

#4810 opened Jul 31, 2026 by CUHKSZzxy Collaborator • Draft

perf(pytorch): add opt-in torch.compile for decode CUDA graphs

#4808 opened Jul 31, 2026 by grimoire Collaborator

Loading…

[ci] remove old models and refactor interface testcase

#4806 opened Jul 30, 2026 by zhulinJulia24 Collaborator • Draft

refactor(pytorch): derive CUDA step metadata from selected operators

#4805 opened Jul 30, 2026 by grimoire Collaborator

Loading…

fix(serve): reject empty/falsy prompt input in format_prompts and AsyncEngine.generate

#4803 opened Jul 29, 2026 by SuperMarioYL

Loading…

refactor: stream tool parameters incrementally improvement

#4802 opened Jul 29, 2026 by lvhan028 Collaborator

Loading…

Ssm prefix cache non aligned

#4799 opened Jul 29, 2026 by grimoire Collaborator • Draft

2 tasks

refactor: report cache usage directly improvement

#4798 opened Jul 29, 2026 by lvhan028 Collaborator

Loading…

refactor: split api server endpoints improvement

#4797 opened Jul 28, 2026 by lvhan028 Collaborator

Loading…

SM90 native BF16/FP8 GEMM kernels, fused-SiLU quantization, and linear test harness improvement

#4795 opened Jul 28, 2026 by lzhangzz Collaborator

Loading…

feat: add Rust TurboMind API server

#4794 opened Jul 28, 2026 by lvhan028 Collaborator • Draft

[Feat]: Support output input logprobs enhancement

New feature or request

#4793 opened Jul 28, 2026 by RunningLeon Collaborator

Loading…

Support dflash for qwen3.5

#4789 opened Jul 27, 2026 by RunningLeon Collaborator

Loading…

optimize and modularize SSM prefix caching improvement

#4788 opened Jul 27, 2026 by grimoire Collaborator

Loading…

Integrate DeepEPv2 enhancement

New feature or request

#4783 opened Jul 25, 2026 by irexyc Collaborator

Loading…

[Fix] generate: propagate the disconnect 400 instead of returning null

#4782 opened Jul 24, 2026 by AmirF194

Loading…

feat: support Intern-S2-Preview TS forecaster enhancement

New feature or request

#4780 opened Jul 24, 2026 by CUHKSZzxy Collaborator

Loading…

TEST: update turbomind qwen3.5 config

#4778 opened Jul 24, 2026 by littlegy Contributor

Loading…

fix(security): reject pickled update_weights payloads by default

#4765 opened Jul 21, 2026 by Solaris-star

Loading…

[Draft] Add PyTorch engine support for Hy3 BF16 and static FP8 (NO MTP)

#4763 opened Jul 20, 2026 by yidingcheng0206 Collaborator • Draft

5 of 7 tasks

Upgrade to cu130

#4753 opened Jul 15, 2026 by RunningLeon Collaborator

Loading…

feat: support GLM-5.2

#4737 opened Jul 7, 2026 by CUHKSZzxy Collaborator

Loading…

Add TurboMind ViT support for InternVL and Qwen VL models enhancement

New feature or request

#4719 opened Jun 29, 2026 by irexyc Collaborator

Loading…

refactor: rename quant policy to kv cache dtype

#4718 opened Jun 29, 2026 by CUHKSZzxy Collaborator • Draft

Previous 1 2 3 4 Next

Previous Next

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!